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In describing the colonization of the New World by Europeans and its inauspicious impact on the native populations, 
Jared Diamond writes 1 

...for practical purposes the collision of advanced Old World and New World societies began abruptly in A.D. 1492, with Christopher 
Columbus's “discovery " of Caribbean islands densely populated by Native Americans. 

The quotation marks about the word discovery are pregnant with meaning, calling into question what it means to discover 
something if that something already contains many people, but the history of the event further complicates interpretation. 
Columbus had to pitch his proposal to many of the European princes and royalty to secure funding for his enterprise, 
most of whom declined until King Ferdinand and Queen Isabella of Spain agreed to outfit his three ship voyage on 
Columbus's second, not first, appeal. As to the alleged discovery itself, the first sighting of land (the Bahamas) was by a 
lookout on the Pinta, whose captain alerted Columbus (who captained the Santa Maria) by cannon-fire. To Columbus, 
however, went the lifetime pension granted by the King and Queen to whomsoever saw land first. 

The point of the story is not to look with askance at a popular account of a wellknown event or tarnish its gold-obsessed 
principal investigator, but rather illuminate that although one person may claim the accolades (or pension or royalty), it 
is support from colleagues, collaborators, benefactors, supporters, institutions and funding agencies that is most needed 
for a voyage of discovery to even be possible. It was true then, no less true now, and perhaps not surprising: as put by 
Robert Wright 2 

Unlike food or spears or hides, information is shared without being actually surrendered, a fact that can make the exchange radically 
with "non-zero-sum" meaning all players can be winners. 

There is sharing, though, and then there is giving. I owe much gratitude to many colleagues, having benefitted from 
their help, support, mentorship, collegiality, encouragement, friendship, and especially forbearance. Much of what I 
have written about would not have been learned without them. I am indebted the Naval Research Laboratory for its 

1 Jared M. Diamond, Guns, Germs, and Steel: The Pates of Human Societies. 1st ed.. New York, NY: W. W. Norton & Company, 1997. Chat 3, p67. 
2 Robert Wright. The Moral Animal: The New Science of Evolutionary Psychology. New York: Pantheon Books, 1994. pi 95. 
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PART I 

Foundations 


Tell me, Muse, of the man of many ways, who was driven far journeys ...Many are they ...whose minds he learned of, many the pains 
he suffered in his spirit... 

- Homer 1 


Corner, The Odyssey of Homer (trans. Richmond Lattimore). New York: Harper & Row, 1967, p. 27. 
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CHAPTER 1 

Prelude 


Then this new kind of knowledge must have an additional quality? 

What quality? 

Usefulness in war. 

- Plato 1 

Ever since the invention of aerial bombing, military strategists have had to assume that "the bomber will always get through". Every 
technological advance since then has made this more certain - with one exception. 

- R.J. James [1] 

In late August of 1940, during the Battle of Britain, Sir Henry Tizard led a mission to the USA at the direction of Prime 
Minister Winston Churchill. Tizard flew, but the rest of his mission traveled on the Duchess of Richmond across the Atlantic, 
carefully conveying a black box holding Britain's most sensitive military technical secrets [2], an account of which figures 
prominently in the historical review by James [1]. Tizard's mission was just shy of a year after the famed Einstein-Szilard 
letter to President F.D. Roosevelt urging the USA to initiate a program developing atomic weapons. Surely it is no surprise, 
then, what the secret documents concerned. 

Radar. 

The importance of radar (an acronym of RAdio Detecting And Ranging) was (and is) great, and that importance was 
starkly revealed in the Battle of Britain (1940), the defeat of the Italian Navy in the Cape of Matapan (1940), the US 
Navy's defeat of the Japanese aircraft carriers in the Battle of Midway (1942) that marked the turning point of the war 
in the Pacific, and the elimination of the German U-boat threat in the Atlantic that enabled supplies to reach the British 
(1943), all of which testified to the pivotal and game-changing role that radar played. A summary by Skolnik [3] is to 
the point: 

There were many factors that allowed the Allies to defeat the Axis forces in Wold War II, but the introduction of radar was one of the 
most important. There is no way to know whether the Allies would have lost the war if they did not have radar, but it is quite clear that 
there would have been more losses and a longer time needed to win the war if there were no radar. 

An Allied victory was not a sure thing: both Germany and Japan in those years were formidable in military strength and 
scientific capability, and the edge that radar provided mattered. Ironically, radar's actual importance during World War 
n, compared to its public perception in the USA, is perhaps inverted from that bequeathed to atomic weapons. 2 After the 
war, spinoff peacetime applications and descendent technology from microwave ovens, television, cell phone networks, 
satellite communications, and weather imaging have caused an immense change in how people work, play, and know 
whether to take an umbrella. 

Why is all that important here? Radio wave and microwave amplifiers are made possible by extracting power from 
energetic bunches of electrons. Electrons are not by nature feral: they are yanked from their host material by violently 
ripping them out (using scorching heat, electric fields thousands of times stronger than those associated with lightening, 
intense lasers brighter than the surface of the sun, and even accelerated beams of other high energy electrons). Improving 
the efficiency of doing so made radar better, and so it and the many technologies indebted to it served as the impetus to 
drive a great deal of cathode research up to the present. The physics is at its most interesting when the conditions the 
electron sources are subjected to are the most punishing, and so those conditions are the ones gleefully considered here. 

1 Plato, The Dialogues of Plato: The Republic VII (trans. Benjamin Jowett), Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 7, 
Plato. Chicago: Encyclopaedia Britannica, 1952, p. 391. 

2 For example, "The historical evidence makes clear that the popular view about the use of the bomb is a mythological construct... there were 
other options available for ending the war within a reasonably short time without the bomb ..." J.S. Walker [4], 
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As to the technologies that benefit, there are many. Microwave ovens are perhaps the most obvious vacuum device, 
the guts being not much more than a version of the pivotal magnetron behind early radar, but in a shiny box with a 
fancy timer (an unkind but reasonably accurate characterization). Field emission microscopy [5], flash X-ray sources [6], 
and microwave devices using metallic wires [7] received intense investigation beginning in the late 1950s as possible 
alternates to thermionic cathodes (the ability of field emission to be to be strongly modulated being the driver). For 
microwave devices, using the fields that existed in cavity resonators, the field emission cathodes produced electron 
beams of high current density in bunches with high harmonic content from tungsten needles. Modern incarnations of 
high-power microwave (HPM) devices [8] use tufted carbon fiber cathodes, in which the individual fiber bundles coated 
with cesium iodide (Csl) salt can produce currents in excess of 1 kiloamp [9], 

Vacuum electronics [10, 11], and after the 1980s vacuum microelectronics, were behind much cathode development 
for microwave amplifiers and "tubes" [12]. These applications encompass the class of vacuum devices that operate 
at microwave frequencies, for either generation or amplification. The microwave frequency range encompasses UHF 
(0.3-3 GHz), SHF (3-30 GHz), EHF (30-300 GHz), and, most recently, pushing into the sub millimeter (or terahertz) 
regime. Although "tubes" evokes images of glass-enclosed triodes and pentodes, modern tubes are rather more akin to 
particle accelerators, albeit small ones. 

The most common microwave tubes are klystrons, traveling-wave tubes (TWTs), magnetrons, crossed field amplifiers, 
gyrotrons, and free electron lasers, and these are widely used in radar, communications, electronic countermeasures, 
directed energy devices, and particle accelerators. Depending on the frequency band, radar is used for long-range surveil¬ 
lance, long-range weather forecasting, airborne weather forecasting, missile tracking and guidance, marine radar, air and 
ballistic missile defense, high-resolution mapping and satellite altimetry, and airport surveillance, among others [3], with 
most modern radars at the higher frequencies. FM and television occupy VHF, microwave ovens and mobile phones 
occupy UHF, and radio astronomy and directed energy applications such as active denial operate in the EHF. The progress 
in the metric of the product of average power and frequency squared, or P m J 2 , has been remarkably steady over decades 
of development, as shown in Figure 1.1 - indeed, for high values of that metric, the playing field belongs to the "tubes", 
with solid-state devices dominating the lower power, lower frequency regimes [13], 

Many other devices reliant on electron beams have proliferated. Most people, at least those p ost-Hobbit and pre-Harry 
Potter, remember the cathode ray tubes (CRTs) that were the basis of all televisions, computer monitor displays, and 
oscilloscopes for decades. The heart of the CRT was a thermionic cathode that generated a beam of electrons that were 
accelerated towards a phosphor screen such that the beam scanned back and forth ("raster"). Before being overtaken 
by liquid crystal and plasma displays, flat panel field emission displays (FEDs) were developed in the 1990s using small 
clusters of microfabricated field emitters to individually address pixel elements on the display [14, 15], Electron beam 
lithography [16], an analog of photolithography using particles instead of light, is a method of creating very small struc¬ 
ture circuits and nanotechnology by using an electron beam to expose a resist, which can then be selectively removed. 
Similarly, field emission scanning electron microscopy (FESEM) images secondary electrons emitted from the surface of a 
sample that are created by primaries emitted from a field emission source and accelerated through a high-gradient poten¬ 
tial (generally in the kilovolts range) [17], an improvement on the typically thermionic tungsten source that required 
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long scan times. A higher brightness field emission source enables a modern field emission scanning electron microscope 
to achieve resolution of feature sizes on the order of 4 nm. Various cathode technologies are behind particle accelerators, 
radio frequency injectors [18], free electron lasers (FELs) [19, 20], energy recovery LINACs (ERL) [21-23], and X-ray 
FELs [24], Other uses include charge neutralization cathodes for ion and Hall thrusters used for satellite propulsion [25], 
and thermionic and non-thermionic cathode alternatives for electrodynamic tethers [26] for propellentless propulsion 
for satellites passing through the Earth's magnetic field [27, 28], 

Field and secondary emission are not always opportunities: they can cause dark current on the surface of multialkali 
antimonide photocathodes used in photoinjectors [29] or lead to breakdown on metal surfaces subject to high fields. 
Copper surfaces of particle accelerators (e.g., the Stanford linear accelerator (SLAC)) exhibit regions of melting and 
protrusion formation that ultimately give rise to undesirable emission off the copper cavity walls [30-32] are a precursor 
to seriously undesirable breakdown phenomena. 

In that rather rapid summary, four emission mechanisms were identified: 

1. thermal: emission by heating [33] 

2. photo: emission by absorption of photons [34] 

3. secondary: emission by scattering with primary electrons [35] 

4. field: extraction by intense electric fields [36]. 

By the late 1920s, predictive equations for all of these had been developed. An introduction to the theory of the four 
mechanisms - and space charge - should start at a level accessible to those without prior exposure, but who nevertheless 
are familiar with basic methods in quantum mechanics, statistical mechanics, electricity and magnetism, mathematical 
methods of the kind that physicists enjoy, and a smattering of other disciplines that a well-rounded graduate curriculum 
would provide. Even so, this introduction will try not to presume too much of the reader, and provide opportunities to 
either become acquainted with or refresh one's knowledge of the physics involved in a way that is, it is hoped, at least 
interesting and perhaps somewhat different than the usual narratives. For those who crave more, copious citations are 
provided. 

The ambition was to design a narrative that would be instructive, oriented to computation, and a pleasure to read - the 
kind of book that the author would like to read, which may have had unintended consequences. Sometimes this includes 
material to spice up otherwise mundane methods. There is a reason for that. Science and mathematics are essential to 
enjoy the good life, but that life also includes the world of ideas, imagination, and curiosity. Why would the two not 
mingle? Perhaps they should, particularly if it encourages reading for the pleasure of doing so. That is how discovery 
begins. 



CHAPTER 2 

Units and evaluation 


2.1 Numerical accuracy 


...quelemieuxestl'ennemidu bien. 


- Voltaire 1 


Mauves ovriers ne troveraja bon hostill. 


- 13th century French proverb 2 


A mathematician and a physicist (the gender of each is not important, whether their gender is the same is not important; 
if of the same gender, they are distinguishable) walk into a bar (the kind of bar is not important, as long as it is not a 
sports bar, although if it were a wine bar it could be in Central Square, Cambridge or downtown Menlo Park). At the 
other end of the bar is a patron (the gender is not important, the distance is not important, the aerosol optical attenuation 
coefficient is less than 40 km -1 ) that both the physicist and mathematician show an interest in talking to (why is not 
important, about what is not important). The mathematician sighs in despair and says "We shall have to cross half the 
distance remaining an infinite number of times, and so will never get there!" The physicist gets up and starts walking, 
saying "I'll get close enough." 

The point of the anecdote may seem to be that physicists should not tell jokes. 

Instead, it is a parable that something, like explaining a subject, can be done correctly in a seemingly infinite number of 
steps, or there is a point where one gets close enough. "Correctly" maybe exact or fundamental. "Close enough" involves 
hand waving, approximation, analogy, reliance on simple models, and 0.064799 grams of NaCl. But close enough for 
what? As a practical matter, it should be "close enough to do a calculation". The most accurate theory is not always the 
best theory: much can be had by way of simplified models that get to the heart of the problem, lest one rues (like Hamlet) 
that".. .the native hue of resolution is sickbed o'er with the pale cast of thought... " 3 The gap between understanding and 
doing can be large. Though number crunching (like facts) is not wisdom, wisdom nevertheless is informed by calculation. 
A parsimonious calculation is an art. 

A good place to start is therefore with a consideration of computations and unit analyses used by electron emission 
and charge transport calculations, where electrons interact with atoms. For the former, the calculation of Riemann's zeta 
function serves as a useful example of computational challenges, given its importance and presence in many equations 
encountered in describing electron emission. For the latter, the prototypical hydrogen atom motivates convenient units. 
Those units are most easily expressed using length, energy, time, and charge scales of [nm], [eV], [fs], and [<?], respectively. 

Access to astonishing computational power is the birthright of the present generation, but power does not make right, 
so to speak. Riemann's zeta function f(p) (Section A2.3 and revisited in Section 6.3) illustrates a pitfall that attends all 
calculations, namely, the accuracy of numerical results: a computer will always give a number - whether it is what one 
believes it to be is another matter. The evaluation of f (3) and f (3/2) are pedagogically valuable aside from being topical 
in the evaluation of the electron density in Section 6.4 or the Stefan-Boltzmann constant of Eq. (3.24), and go to the 
heart of how much confidence should be placed in numerically calculated integrals using packaged programs. The task 


1 ".. .the best is the enemy of the good." Voltaire, La Bfyueule: conte moral, Google eBook, http://google.com/books?id=cy4HAAAAQAAJ, p. A3. 
2 "A bad workman will never find a good tool." Martin H. Manser, Rosalind Fergusson, and David Pickering, The Facts on File Dictionary of Proverbs 
(Facts on File Writers Library), 2nd edn. New York: Checkmark Books, 2007, p. 17. 

3 Ref. [37]: Hamlet, m.i.84-85, p. 951. 
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is therefore to compare computed values with the known exact values 4 quoted to a ridiculous precision: 

C(3)= 1.20205690315959428539973816151 (2.1) 

f(3/2) = 2.61237534868548834334856756792 (2.2) 

Begin with f(3): from its series definition in Eq. (A2.8) for the first 10 5 terms, a numerical evaluation gives f(3)» 
1.2020569031097323. Compare to a symbolic math integration program value of f(3) » 1.202056909910681: though 
less accurate, an error of 4 x 10 _9 % is tolerable. 

However, the series expansion for f(3/2) does not converge as quickly and the naive series summation is less use¬ 
ful and, not surprisingly, inaccurate by the third significant digit even for 10 5 terms. A symbolic math integration of 
f(3/2) « 2.6123751617794717 is accurate to 6 X 10 -6 % (the numerical approach of Section A2.3 is better), and therefore 
differences arise after the seventh digit. Gaussian quadrature methods work well (as in Section Al.2.3) if the underlying 
integrand function is a polynomial, which \[x (which appears in the numerator of the integrand for f(p) in Eq. A2.6) is 
not. Caveat computantum: 5 the numerical technique matters. 

Conversely, unless one is a pedant, accuracy to six digits is often adequate and numerical evaluation of integrals and 
series, if done with care, is sufficient even though different numerical means give slightly different ends. Therefore, 
values arising from numerically calculated integrals, as tjfp) shall be in addition to many other examples found herein, 
are accurate to at least six digits, an accuracy that would bring the ersatz physicist of the above parable to a small fraction 
of the diameter of a human hair for initial separations typical of most bars and more than "close enough". 


2.2 Atomic-sized units 

"What's in a name?” asks Juliet. “That which we call a rose by any other name would smell as sweet ." What's in a name is that everyone 
in a language community tacitly agrees to use a particular sound to convey a particular idea.. .Now any of us can convey the thought by 
making the noise. 

- Steven Pinker 6 

...a word is not a relationship between a sound and an object. It is an agreement among people who share a joint representation of 
things in their world, and who share a set of conventions for communicating with each other about those things. 

- Jonathan Haidt 7 

There is a tendency, as in elementary particle physics and cosmology, to economize with constants that appear fre¬ 
quently by setting them equal to 1, in particular, the electron mass m, the speed of light c, the unit charge q, and Plank's 
constant ti. While this has an advantage in making certain oft-recurring expressions easy to write, rendering electric 
field as dimensionless, and avoiding the problems associated with transitioning from units like centimeter-gram-seconds 
(cgs) to meters-kilograms-seconds-amperes (MKSA 8 ), particularly with respect to the potential energy of a collection 
of charged particles and the forces between them, it is at the cost of a powerful ally in understanding phenomena and 
finding error. Finding units natural to a problem and working in them is an advantage discarded at peril. When discussing 
electrons, however, the choice of meters, kilograms, seconds, and amperes (MKSA) are not those units. 

MKSA units are the units of technology and man-sized phenomena. Using a unit of energy equivalent to that which 
an electron acquires by falling through a potential of 1 volt, namely, an electron volt [eV], is a good compromise. For 
tunneling, distances on the length scale of atoms, or nanometers [nm], and time scales comparable to the inverse fre¬ 
quency of optical (visible light) transitions [fs], are also useful. Doing this, the fundamental constants acquire values that 
do not involve large exponents and become convenient to use, as in Table 2.1. 

In particular, the fine structure constant a, which governs the strength of the electromagnetic interaction, serves 
as a bridge between various units, in particular cgs, where a = q 2 /tic, and MKSA, where a = q 2 /Ane 0 Tic and e 0 is the 


4 http://oeis.org/A002117 for f(3) and http://oeis.org/A078434 for ?(3/2). 

5 "One who calculates, beware" (more or less). 

6 Steven Pinker, Words and Rules: The Ingredients of Language. New York: Basic Books, 1999, p. 2. 

7 Jonathan Haidt, The Righteous Mind: Why Good People Are Divided By Politics and Religion. New York: Pantheon Books, 2012, p. 240. 

8 The designation MKSA has been eclipsed by the more recent International System of Quantities (ISQ), but MKSA is used here to reinforce 
the contrast to the nefq units introduced below, as the acronym reinforces the units involved. 
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Table 2.1 Fundamental constants in units characteristic of atomic phenomena: 
elementary charge [q], length [nm], time [fs], energy [eV], and temperature [K], 


Symbol 

Definition 

Value 

Unit 


4 

Unit charge 

1 

9 


c 

Speed of light 

299.792 

nm/fs 


m 

Electron rest mass 

510999 

eV/c 2 


k B 

Boltzmann's constant 

(11604.5)- 1 

eV/K 


h 

Planck's constant 

0.658212 

eV-fs 


n a 

Avogadro's number 

6.02214X 10 23 

- 


a 

Fine structure constant 

1/137.036 

- 



permittivity of vacuum, a has the same value (being a dimensionless constant) in all unit conventions; that it has a 
different defining equation depending on the units in question is a source of vexation for initiates. 

There is some practicality therefore in basing a description of forces and potential energies between charges in terms 
of a. The fine structure constant governs the strength of the electromagnetic interaction between charged particles, and 
therefore it will play an integral role in descriptions of the hydrogen atom. Thus, units and conventions which push a to 
the fore will already be of a size and character to conveniently discuss the interaction of electrons with their environment, 
atomic phenomena, and tunneling phenomena. 

Joining the electron charge with potential differences and electric fields gives energies and forces, respectively. As with 
electron charge q with potential cp giving potential energy V = qcp, it is easy to speak of a force F on an electron as the 
product of the electron charge q and the electric field £, or F = q£. 

The hydrogen atom motivates units that are agreeable to electron emission physics as it is simple, very well understood, 
introduces the right scales for atomic phenomena, resolves ambiguity, and is well studied in both classical electrodynamics 
and quantum mechanics. It shall therefore serve below as the basis for units to be adopted. 

Still, using F instead of q£ takes patient accommodation on the part of the reader. Perhaps an analogy helps: were an 
inquiry as to someone's weight W to receive an answer of "784 Newtons" rather than "80 kilograms", the questioner 
would look askance at such a response, even though the first answer is technically correct: weight is measured in Newtons. 
What is being sought differs from what is being asked: it is mass M that is the intrinsic property but ignoring the acceleration 
due to gravity g = GMq/Rq = 9.8 m/s 2 in W = Mg is so commonplace that in polite discourse responding to weight 
questions with mass answers is routine. The units of force F and electric field £ are analogous, 9 but even more so, for in 
the units of [eV/nm] in the former and [GV/m] in the latter, they are also numerically the same. Speaking of force when one 
is really interested in field or vice versa is then much like speaking of weight when one is really interested in mass, but 
the grounds for doing so are better because the conversion factor in [nfeq] units is unity (see Tables 2.1, 2.2, and 2.3). 
Still, one should not lose sight of the fact that although the difference matters for particular units, the charged particle 
being an electron and the potential being volts begs for units of [eV], and also because as far as number crunching is 
concerned in [nefq] units, it is a difference that makes no difference. 10 


2.2.1 Bohr model of the hydrogen atom 

Whatever the alteration in the laws of motion of the electrons may he, it seems necessary to introduce in the laws in question a quantity 
foreign to the classical electrodynamics, i.e., Planck's constant, or as it often is called the elementary quantum of action. By the introduction 
of this quantity the question of the stable configuration of the electrons in the atoms is essentially changed, as this constant is of such 


9 A purist might object that q is analogous to M, whereas g is analogous to £ given that the latter convey the influence of the environment on 
the dynamics of a massive or charged object, but that purist would thereby miss the point of the analogy in the first place. 

10 The popular adage "A difference that makes no difference is no difference" is often attributed to William James (William James: The Essential 
Writings, ed. Bruce Wilshire, Harper & Row (Harper Torchbooks, TB1616), 1971, p. xiii) but only appears in the Forward by Wilshire as" ...the 
notion of eternally true but as yet undiscovered propositions lacks a full complement of meaning. That is, the notion is a theoretic difference 
which makes no differencerather, the adage is from James Blish (Spock Must Die! [a Star Trek Novel], Bantam Books, 1970), which is strangely 
fitting: popular usage is wrong in detail even as the truth is captured. 
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Table 2.2 Conversion of MKSA units expressed in units of [nm], [eV], [fs], and [g = 1] parameters 
[nefq]. To go from MKSA -* nfeq, multiply by the factor. 


Quantity 

MKSA unit 

Factor 

nefq unit 

Length 

Meter (m) 

10 9 

nm 

Time 

Second (s) 

10 15 

fs 

Energy 

Joule (J) 

6.2415093 xIO 18 

eV 

Charge 

Coulomb (C) 

6.2415093 xIO 18 

q 

Force 

Newton (N) 

6.2415093 xIO 9 

eV/nm 

Potential 

Volt (V) 

1 

eV/g 

Current 

Amp (A) 

6241.5093 

g/fs 

Current density 

Amp/square centimeter (A/cm 2 ) 

6.2415093 xIO- 11 

g/(fs-nm 2 ) 

Field 

Gigavolts/meter (GV/m) 

1 

eV/(g-nm) 

Power 

Watt (W) 

6241.5093 

eV/fs 

Capacitance 

Farad (F) 

6.2415093 xIO 18 

g 2 /eV 

Resistance 

Ohm (£2) 

(6.2415093)- 1 

eV-fs/g 2 


Table 2.3 Fundamental constants in units characteristic of atomic phenomena: elementary charge 
[g], length [nm], time [fs], and energy [eV], The mixed units of the last three A-B constants reflect 
standard usage in commonly encountered units for the evaluation of current density. The value given 
for v assumes a work function of 4.5 eV. 

Definition 

Formula 

Value 

Unit 

er rest energy 

me 2 

510999 

eV 

Length (a 0 ) 

h/amc 

0.052918 

nm 

Time 

2mJ«c 

0.1520 

fs 

Charge 

t? 

1 

q 

0 

ahC/4 

0.35999 

eV nm 

2 Q/a 0 

lm(ac) 2 

13.605692 

eV 

a rld 

(mq/2h)(k B /)ch) 2 

120.1735 

A/(cm 2 K 2 ) 

A q 

q/06x 2 h) 

1.521334x10- 6 

A/eV 

B a 

(4/3h)'j2m 

6.83089 

1/(eV'/ 2 nm) 

v 

(80/9)^271)7® 

0.772808 

- 


dimensions and magnitude that it, together with the mass and charge of the particles, can determine a length of the order of magnitude 
required. 

- Niels Bohr [38] 

In the Bohr model of the hydrogen atom, an electron orbits a proton but does so in quantized units of angular momen¬ 
tum. As the speed of light is the fundamental unit of velocity, it is convenient to speak of a particle's velocity as the product 
of a dimensionless constant with c. For an electron in a hydrogen atom, that constant of proportionality is a, and so the 
quantization condition is 

macr n = nh (2.3) 

The radius r x = a 0 = h/amc is the Bohr radius. As the velocity is non-relativistic, the kinetic energy is KE = (1/2 )m(ac) 2 , 
which is the Rydberg energy Ry. 

The centrifugal force the electron experiences is equal and opposite to the attraction between the electron and proton, 
and so 

m{ac) 2 _ ahc ^ ^ 

a o a 2 0 

The potential energy is PE = -atic/a 0 , from which it is seen the magnitude of KE is half that of the potential energy PE, 
and PE = -IRy. 
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2.2.2 Quantum mechanical model of the hydrogen atom 


The electron can no longer be conceived as a single, small granule of electricity; it must be associated with a wave and this wave is no 
myth; its wavelength can be measured and its interferences predicted. It has thus been possible to predict a whole group of phenomena 
without their actually having been discovered. And it is on this concept of the duality of waves and corpuscles in Nature, expressed in a 
more or less abstract form, that the whole recent development of theoretical physics has been founded and that all future development of 
this science will apparently have to be founded. 

- Louis de Broglie [39] 


The same results are obtained directly as a proper mathematical solution of the time-independent Schodinger's 
equation, H'¥ = E y V. The solution for the coulomb, or 1/r, potential will be considered at length in Section 9.1, and is 
moreover a centerpiece of quantum mechanics texts (40-44). The problem is considerably simplified for the ground 
state (angular momentum quantum number l = 0), for which Schodinger's equation is 


\ 2 mr 2 dr\ dr) 


ahc 




(2.5) 


where E = h 2 k 2 /2m is positive, so that -E is a bound state on the right-hand side, which allows the equation to be written 

{jj + jjU K 2 = 0 (2.6) 

The ansatzxp <x e~ kr , suggested by the large rbehavior (i.e., (d 2 - k 2 )ys « 0), indicates that [( amc/h ) - k\/r = 0, or k = amc/h 
for the ground state. The expectation value 

I,- (\)y(r) 2 4nr 2 dr 

-—- =k (2.7) 

/ 0 °° y/(r) 2 4jrr 2 dr 

is the definition of the Bohr radius a^ 1 = (1/r) and identifies a 0 = h/amc. 

The size of a shows that the energy of interaction (2 Ry) is a factor of a 2 smaller than the rest energy me 2 of the electron. 
The combination tic appears very often and ahc = 1.44 eV-nmis straightforward to remember (1.44 = 1.2 2 ): it will figure 
prominently in the treatment of the image charge potential and the Schottky barrier factor. For the image charge, the 
combination appears frequently and motivates the introduction of 


Q = 


(2.8) 


Because (1.2/2) 2 = 0.36, it follows that 4 Q = 1.44 eV-nm to a good approximation (the actual value being 4 x 0.3599911 
eV-nm). 


2.3 Units based on emission 

...there is one result, though there might be a hundred anxious schemes. 

-The Yi King 11 

Say not, "I have found the path of the soul.’ Say rather, “I have met the soul walking upon my path ." For the soul walks upon all paths. 

- Kahlil Gibran 12 

There are many conventions for units, by means of which the same truths should be uncovered. Still, working with 
physical units that reflect the magnitude of the phenomena being examined makes for an easier path in the development 
of intuition and order of magnitude estimates without the requirement of tracking exceptionally large exponents. The 
emission-based units are tailored to calculate the electron number and current density associated with emission through 
and over surface barriers that exist between bulk materials and vacuum, and electron transport through bulk materials, 
be they semiconductors or metals. 


11 The Yi King, Sacred Books of the East, Vol. 16 , The Sacred Books of China, Vol. 2 of 6, Part II of The Texts of Confucianism (trans. James Legge). Oxford: 
Clarendon Press, 1882, Appendix note to Chapter V.31, p. 389. 

12 Kahlil Gibran, The Prophet. New York: Knopf, 1952, Chapter 17: On Self Knowledge. 
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Observe two consequences of shifting from [MKSA] to [nefq] units: mass [kg] is replaced by energy [eV] and current 
[A] is replaced by unit charge [q = 1] as the units defining the system. The first consequence is that the speed of light c 
can be attached to h and m to enable the more convenient quantities Q and me 2 to be used in calculations: energy-based 
units are convenient especially when treating Fermi levels and work functions, or with tunneling phenomena where the 
quantity _ 


- A [tT = i fJ-) a/2 —2 _ 4 V 1.022 X 10 6 eV = 6.831 
~ 3ft V m ~ 3 \ hc) V mC ~ 3 197.33 eVnm ~ 


(2.9) 


is easily calculated, as given in Table 2.3. The second consequence is that the electron unit charge q becomes embedded in 
the unit of energy [eV], charge density is numerically equivalent to number density, and current density to the number 
of electrons passing a unit area [nm 2 ] in unit time [fs]. The [nefq] units therefore simplify the treatment of Schrodinger's 
equation describing the wave function, and Poisson's equation relating distributions of charges to potential energy. 

A third reason, but less consequential here, is that combinations of the fundamental constants associated with the 
hydrogen atom appear in possibly surprising places. For example, a characteristic time is defined by how long a photon 
takes to traverse the radius of a hydrogen atom, or c/a 0 . The ratio of two energy scales, namely the potential energy of the 
hydrogen atom ground state 4 Q/a 0 and the rest energy of the electron me 2 , is 1 /a 2 . Joining them gives the Alfven-Lawson 
current of Eq. (33.12). Such products of factors are simply a consequence of the fundamental units of nature and have 
no bearing on what is under discussion. 13 

Realizing that F is a force rather than a field takes some getting used to, but an example helps. Consider the Schottky 
factor y/4QF, which describes how much a potential energy barrier to emission lowers when an electric field is applied to 
a surface: it is generally treated as lowering the work function ®, an energy measured in units of [eV], to ® - \J4QF = 
(1 -y)®, where 

_ V4 QF 


y=- 


( 2 . 10 ) 


The Schottky factor appears often in electron emission theory, so that familiarity with how to calculate with it is 
advantageous. 


EXAMPLE: Consider a metal such that ® = 4.5 eV. By what fraction does the emission barrier lower for fields 
characteristic of thermionic emission, where the electric field is 0.1 MV/m? 

SOLUTION: The electric field corresponds to F = 0.1 eV/pm, and so 

y = —-—-i/(l.44 eV nm) ( 0.1—] = — 

7 4.5 eVy V F m / 375 


EXAMPLE: Consider a metal such that O = 4.5 eV. By what fraction does the emission barrier lower for fields 
characteristic of field emission, where the electric field is 9 GV/m? 

SOLUTION: The electric field corresponds to F = 9 eV/nm, and so 

y = —-— J{ 1.44 eV nm)(9 —) = - 
y 4.5 eVV V nm/ 5 


13 The consequences of this are that one can, if one is sufficiently perverse, introduce fundamental constants like h where they are not normally 
expected, as in Poisson's equation in Eq. (16.1), by using atic instead of q 2 /4ne 0 = 4 Q because of the ubiquity of a in relating the electron 
velocity in a hydrogen atom to the speed of light, or m(ac) 2 /2 = q 2 /4ire 0 a 0 = 4 Q/a 0 , as in Section 23.4. This produces vexation and assertions 
that quantum mechanical units should not appear in classical equations. But one is dealing with electrons: quantum mechanical language is 
woven into the language of the discussion. 










CHAPTER 3 

Pre-quantum models 


3.1 Discovery of electron emission 

The question is, what is the origin of this current? How is it produced? Since we have within the globe a nearly perfect vacuum, we cannot 
conceive the current as flowing across vacuous space, as this is not in accordance with our pre-conceived ideas connected with higher 
vacua ...I have no theory to propound as to the origin of these phenomena. 

- Edwin J. Houston [45] 

In October, 1884, two months after the cornerstone of the Statue of Liberty was laid in New York Harbor, and two 
months before either the completed Washington Monument became the tallest structure in the world or the first pub¬ 
lication of Mark Twain's satiric masterpiece The Adventures of Huckleberry Finn in the UK and Canada (two months again 
before being published in the USA), Professor E.J. Houston's presentation [45] at the first American Institute of Elec¬ 
trical Engineers meeting in Philadelphia described a seemingly inexplicable high-vacuum phenomena first observed by 
Edison. 1 Edison had invented an incandescent lamp in which a carbon fiber was enclosed in a glass vessel, the ends 
of which were connected to a battery. In a modified lamp, as shown in Figure 3.1, an additional strip of platinum foil 
was inserted and connected to a galvanometer, for which one terminal was connected to the positive terminial (+) and 
the other connected with the platinum strip. For ordinary current levels, light was produced with no unusual effects, 
but as the current levels rose so that the luminosity of the lamp increased markedly (a factor of 3 to 12), the needle of 
the galvanometer was strongly deflected by current passing through its coils. For the physical theories of the time, and 
as Houston reported to the audience, the origin of the current was a mystery. Even more mysterious, when the leads 
were switched so that the galvanometer was connected to the negative terminal, then the current flowed in the opposite 
direction but at about 1/40 of the level. Houston was baffled. 

Eventually the phenomena became known as the Edison effect and the current as cathode rays. Up until the 1890s, 
very different conceptions about the nature of these phenomena were debated. One explanation, identified with German 
physicists, was that cathode rays were a process occurring in the aether because their motion in magnetic fields was in 
circular paths rather than straight lines. A competing explanation argued that the cathode rays were material and were 
the paths of particles of negative charge. In 1895, Perrin demonstrated the particle nature of the cathode rays, and then in 
1897 J.J. Thomson at Cambridge University settled the matter definitively by showing the deflection of the rays passing 
between, essentially, a capacitor plate [47], showing that all of the particles had the same charge to mass ratio that was 
independent of the potential between the electrodes, and indifferent to either the residual gases within the tube or the 
chemical nature of the materials from which the rays were produced. For his work, Thomson received the Nobel Prize 
in physics in 1906. 


3.2 The Drude model and Maxwell-Boltzmann statistics 


...the great tragedy of Science - the slaying of a beautiful hypothesis by an ugly fact ...it is one thing to refute a proposition, and another 
to prove the truth of a doctrine which, implicitly or explicitly, contradicts that proposition. 

- Thomas H. Huxley 2 


1 As a result of Edison's advocacy, the Naval Research Laboratory opened in 1923 on the banks of the Potomac in Washington, DC, where pulsed 
radar debuted in 1934 [46]. The venerated inventor's bust greets visitors at the entrance (www.nrl.navy.mil/about-nrl/history/edison/). 
2 Thomas H. Huxley, Presidential Address at the British Association for the Advancement of Science: Biogenesis and Abiogenesis, Collected Essays, 
8, 1870. 
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connected 
known as 



Schematic of an incandescent lamp containing a carbon fiber. The fiber (red line with current flowing in the direction of the red arrow) is 
to a battery (D) and a strip of platinum foil (P). At high current, the galvanometer (G) is deflected "violently" by a mechanism which became 
the Edison effect and was subsequently related to the thermal emission of electrons. Based on Figure 1 of ref. [45]. 


Drude developed a free electron theory of metals in 1900 that was instrumental in explaining electrical conductivity 
and its relation to thermal conductivity [48]. 3 Assuming that all of the electrons in a metal had a velocity v = \J^k R T/m 
much like ideal gas atoms, and that scattering with the lattice atoms randomized their direction, Drude found that the 
ratio between the thermal and electrical conductivities of a metal was 3 {k B /q) 2 T. Lorentz improved the model by assuming 
that the electron velocities were distributed according to Maxwell-Boltzmann (MB) statistics. The Drude model was a 
watershed event in the understanding of electrons in materials, a circumstance all the more tragic for being mistaken 
about the statistics that the electrons obeyed. What replaced it, however, was remarkable. 

The seemingly successful use of MB statistics in the Drude model lent credence to its analogous use in the first derivation 
of an equation to describe thermionic emission (Section 5.7), in which electrons were thought to be boiled off a metal. 
Boiling increases with temperature, and therefore the temperature-dependence of thermionic emission was in stark 
contrast to the temperature-indifference of field emission [49], although that was because the temperatures applied 
were not high enough in field emission experiments [50] to reveal an effect. As a result, thermal-electrons were thought 
to be a different kind of particle than field-electrons, and they were dubbed "thermions" so that emitters producing them 
were called "thermionic" emitters (unfortunately the name stuck, even after thermal-electrons and field-electrons were 
recognized as being just electrons). The first formulation of the thermal emission equation came remarkably close in its 
form to the "modern" formulation. Its derivation shows the power of statistical reasoning even when a major feature of 
the problem, the indistinguishable nature of discrete fundamental particles and what that entails, was not yet realized. 
Understanding the physics of the early theories and the problems they faced is more than an interesting historical yarn. 
It helps form a deep appreciation for the bravura on the part of many who brought forth a coherent narrative of electron 
emission in all its forms. 

First, though, the MB distribution needs to be understood if only just a little bit (for more, see [51]). Although treated 
at length in Section 5.4, a working understanding of MB distributions adequate to present needs is easy to come by. Let 
the "environment" be a small box not subject to potential gradients (like gravity or electric fields). The MB distribution 
can be thought of as the probability that a gas molecule in thermal equilibrium with its environment has an energy E. It 
is intuitive that having more energy is progressively more unlikely, and so if E v < E 2 , it is expected that /(£ 2 )//(Ej) < 1. 
It is also intuitive that the probability should not depend on where the zero point of the energy is measured from, and so if 
energy is measured from a different reference \i such that E n =E' n - p then/(£')//(£(,) = /(£, + p)/f(E 2 + p). A different 
way of expressing the equality is 

In \f{E l + p)\ - In \f(E l )] = In \f(E 2 + p)\ - In \f(E 2 )\ (3.1) 

but because p is at the moment arbitrary and therefore arbitrarily small, and because of the similarity of Eq. (3.1) to 
finite difference methods (see Section Al.3.2) and therefore differentiation, that in turn implies dln\f(E)\/dE =const. 
= -/?, where the is a consequence of/(E) decreasing as E increases, and so/(E) oc exp(-/?£). 

The discussion has been so general that the significance of /(E) could be mistakenly overlooked. To emphasize an 
important point, f{E)dE is the probability that a particle can be found with an energy E, and this will be profoundly 


5 It also treated optical properties, but wait for that until Section 31.6. 
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important in treating the distribution functions and statistics below. Probabilities are normalized so that their sum, 
or integral, is 1, and so 

m= ' ™ =pe~‘ IE dE (3.2) 

fo e-VdE 

The average energy is then the integral of E with its distribution, or 

(E) = J Ef(E)dE = i (3.3) 

If two same-sized boxes of the same kind of gas with the same number of particles are mixed, and they are characterized 
by /?, and (S 2 , respectively, then the average energy before mixing should be equal to the average after mixing, or 

|[<£i) + (E 2 )] = (E mix ) + (3.4) 

Because a measure of the average energy of a gas is temperature T, this shows that p = l/k B T, where k B relates temper¬ 
ature to energy. 

Still, why would anyone think of electrons as a gas? Well, as Drude showed, there some pretty good reasons, and 
electrical conductivity is one of them. 


3.2.1 Electrical conductivity 


One of the principal characteristics of materials is their ability (or lack of ability) to conduct electrical current... if one takes the conductivity 
of superconductors, measured at low temperatures, into consideration, this span extends to 40 orders of magnitude ...(t)his is the largest 
known variation in a physical property and is only comparable to the ratio between the diameter of the universe (about 10 26 m) and the 
radius of an electron (10~ 14 m). 

- Rolf E. Hummel [52], p. 80 

In the Drude model, electrons freely roam through the metal, their electrical charge being balanced by the ions, so the 
electrons act as if they are independent and therefore only respond to an external field within the material. Their motion 
as a consequence of the field leads to current flow. Much like a gas of particles, the electrons travel in straight lines for 
a time x until they collide with something. Drude assumed that the collisions were with fixed ions, but the causes of 
scattering are less important than the consequences. 

Particles subject to a force F accelerate, and so to prevent boundless acceleration Drude proposed that after each 
collision the velocity of the electrons was brought back to zero, and the acceleration begun afresh. If the number density 
of electrons p \q/ cm 3 ] is comparable to the number density [atoms/cm 3 ] of the metal (or equal, if each metal atom gives 
rise to a single electron), then the current density J is the product of the charge q traveling at an average velocity v ave 
with the density, or 

J q = ~W V ave (3-5) 


where the subscript q is used to distinguish a current of charge from a current of heat below. 

What "average velocity" means is nuanced: it is not simply the mid-point velocity between the start and just before 
a scattering event, and so it will not do to take v ave « (v max + v min )/2. What matters is the mean distance the electron 
goes and the mean time it takes to get there. "The mean time between collisions is x" is equivalent to saying that the 
probability that an electron experiences a collision in time t is P(t ) a e~‘E (see Section A3.30.1). To normalize it, take 


Pit) = 


e t/r 

fo°° e- t,T dt 


If* 


(3.6) 


The average velocity is the integral of v(t) = Ft/m over the probability P(t ) that the electron has that velocity, and so 




(3.7) 


The relationship between current flow and electric field is given by Ohm's law 


J q = -a£ = -a 


cdV 
q dx 


(3.8) 
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where a is the conductivity of a metal. Combining Eq. (3.5) with (3.8) gives 



(3.9) 


Observe that the current is therefore related to the product of a charge conductivity with the gradient of potential energy. 
The electron collisions are not without consequence: the kinetic energy they acquired from acceleration is ultimately 
transferred to the lattice (Joule heating) and the impediment to their motion by suffering those scatterings is known as 
"resistivity" (1 /a). 


EXAMPLE: What is the conductivity of a copper-like metal for which t = 25 fs and p = 8.41 x 10 22 cm 3 ? 
SOLUTION: Convert the density to p = 84.1 nm' 3 and the electron mass m = 5.68563 eV [fs/nm] 2 . Then, using 
Table 2.2, 


a = q 7--- f84 ' 1 r 

( 5.68563 evf —) ) 


Electrons obviously carry charge during transport from one region to another, but they also carry energy. If the tem¬ 
perature of the electron gas is uniform, no net transfer of energy occurs across a reference plane, but if a difference 
in temperature exists across that plane (a temperature gradient), then a flow of heat occurs. Metals are known to be 
both good current conductors and good heat conductors: if electrons are both the carriers of electric charge and thermal 
energy, then the assumption that the collision processes governing the flow of heat and charge are mediated by similar 
mechanisms has consequences. Therefore, consider thermal conductivity. 

3.2.2 Thermal conductivity 


. ..hot thoughts beget hot deeds... 

William Shakespeare 4 

As J q was the flow of charge, so is J T is the flow of heat, and is the product of a thermal conductivity k with a gra¬ 
dient of a temperature energy k B T or J T = - K{dT/dx ); the sign designates that energy flows from hot to cold. Electrons 
cross a boundary (say, the y-z plane) with a velocity v x . Because the +x region has a different temperature than the -x 
region, the electron velocities from the + region will differ from those from the — region. The net flow of energy is the 
difference, or 




(3.10) 


(3.11) 


and so k = 2lp(d(E x v x )/dT), where l is the average distance the electrons travel between collisions, or the mean free path 
v ave r. The factor of 21 instead of simply l occurs because, on average, the last collision an electron experiences is a distance 
/ from the x-y plane for both sides of the plane, as in Figure 3.2. 

The evaluation of the averages of a gas of particles, be they atoms of an ideal gas or a gas of electrons, requires a 
distribution of velocities. Drude made the reasonable assumption that the distribution f(v) was MB, or the probability 
that a particle has an energy E = mv 2 /2 is f MR (v) <x exp(-pE) where p = 1 /k B T and E = mv 2 /2. The MB distribution is 
useful because 

f m (v) = (3.12) 


4 Ref. [37]: The History of Troilus and Cressida, III.i.l20,p. 996. 
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from which it follows that for some quantity that depends only on v x , or 0{v x ) = O x , averages with respect to the nor¬ 
malized distribution f MB (v) satisfy 

(O x ) = J O x f MB (v)dv x dv y dv z = J O x f MB (v x )du x (3.13) 

To make the notation simpler, suppress the MB subscript on /(e). 

There are subtitles. ( v x ) = 0 as the distribution f(v x ) is an even function in v x , which makes (v x f(v x )) identically zero if 
n is odd. Finding a usable average velocity therefore takes one of two approaches. The first is to take v ave « y/(v x ), that is, 
to approximate the average velocity by its root mean squared (rms) value. The second approach is to consider the average 
velocity in the +x direction (v x ), which will be close to (v~) if the temperature gradient is small. 


EXAMPLE: What is the rms velocity of an electron thermalized with the lattice? 

SOLUTION: A straightforward calculation gives 


2 v x exp {-pmvj,/2)dv x 

(3.14) 

X exp {-pmvl/2)dv x 

2k B T fZ, s 2 e^ds k B T 

m IZ e ~ s2ds ~ m 

(3.15) 

Observe that this implies m(v 2 )/2 = (3/2 )k B T as a consequence of the equipartition theorem. At 300 K, - \J{v 2 ) = 

csj3k B T/mc 2 = c\/(3/38.7)/(5.11 X 10 5 ) = 3.896 X 10 _4 c. 



EXAMPLE: What is the average positive velocity of an electron thermalized with the lattice? 
SOLUTION: Including only those electrons traveling in the +x direction in the distribution. 


(v + x ) 


/ 0 °° v x exp(—lJmv x /2)dv x 
exp (~pmv x /2 )dv x 


or 79.8% of yfW)- 


(3-16) 


A calculation similar 


Eq 3.16 gives (Eje±> = {2k\Tl/jim) 1 / 2 or 

4<«»3C,(iT') ' 


(3.17) 
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Collecting results and using l « (v^)r gives 


6t klT 


nm 


(3.18) 


3.2.3 The Lorentz number for metals 


But that's not unusual/No it isn't strange 

After changes upon changes/We are more or less the same. 

- Paul Simon 5 


There is no apriori reason to take the t of Eq. (3.9) to be that of Eq. (3.18), but if the thermal and electrical relaxation 
times are the same or close then the Wiedemann-Franz law predicts that the Lorentz number L is given by 



(3.19) 


That this might have been anticipated is well-captured by Ziman ([53], p. 232): 6 


In electrical conduction each electron carries its charge q, and is acted on by the field q£. The current per unit field is proportional to q 2 . 

In thermal conduction each electron carries its thermal energy k B T, and is acted on by a thermal force kVT. The heat current per unit 
thermal gradient is proportional to k? B T. The ratio of these two transport coefficients must be of the order ofl^ B T 2 /q 2 ; the factor k 2 /3 
comes from the fact that we are dealing only with electrons on the Fermi surface, obeying Fermi statistics. 

with a well-placed teaser at the end about what is in store because of the statistics electrons obey. The Lorentz number 
L is therefore a ratio of fundamental constants and independent of material properties, and is approximately L crude = 
2.8365 x 10 -8 [eVlq K] 2 = 2.8365 x 10 -8 [A/L2 K] 2 , where "crude" means that consideration of quantum effects much 
improves the estimate (see Eq. (3.20)). It is a striking achievement of Drude theory that for actual metals, the ratio of their 
thermal and electrical conductivities matches over a range of temperatures for a variety of metals, as shown in Figure 3.3. 


3.2.4 Deficiencies of the Drude model 

Since.. .truth emerges more readily from error than confusion, we consider it useful to leave the understanding at liberty to exert itself... 

- Francis Bacon 7 


Figure 3.3 The Lorentz 
Quantum mechanical c 



5 Paul Simon, lyrics to The Boxer, 1968. 

6 Ziman uses e rather than q for unit charge, and k rather than k B for Boltzmann's constant; to maintain consistency, the terms were rendered 
as used here instead of adhering to the conventions of the source material. As a general matter, Ziman is an excellent resource on transport 
and scattering. 

7 Francis Bacon, Novum Organum, Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 30, Francis Bacon. Chicago: Encyclopaedia 
Britannica, 1952, Aphorisms Book If, XX. 
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Figure 3.4 Heat capacity of copper. At low temperatures, C v ss yT+AT 3 , where (C e = yT) is due to electrons and (C t = AT 3 ) to the lattice. 
Low-temperature data (black circles) from Figure 5 of Corak et al. [54]; high-temperature data (black squares) from the American Institute of Physics 
(AIP) ([55], pp. 4-106). At high temperature, the heat capacity approaches the Dulong-Petit value ( C v -* 3 N A k B ) for one mole. 

The limitations of Drude theory are traceable to its assumption that the electron distribution function is MB. The 
specific heat capacity C v = dE/dT for solids at high temperatures is known to be 3 Nk B (the Dulong and Petit value), as 
in the discussions surrounding Eq. (5.13), a finding dependent on/(u) being an MB distribution. However, metals at low 
temperature depart from that relation, where the electron contribution to the specific heat dominates and is linear in 
temperature. This is shown in the low-temperature region of the heat capacity of copper in Figure 3.4, in which the red 
line fit to the data follows the form C v = 0.00016099T + 1.1921 x 10 _5 T 3 in the units of cal/mole-K: C v has been scaled 
by the ratio of the density with the atomic mass of copper, or (8.96 g/cm 3 )(63.5456 g/mole) _1 = 0.141 mole/cm 3 . 

An early success of quantum mechanics was the explanation of heat capacity, and as a part of the new physics electrons 
were found to be described by Fermi-Dirac statistics, a distinction that corrects the Lorentz number to 



(3.20) 


as shown in the treatment leading to Eq. (7.50), and produces the correct temperature dependence of C v = C e + C, = 
yT + AT 3, at temperatures well below the Debye temperature, as shall be shown when the quantum distribution functions 
are considered. The Sommerfeld model, which relies on the Fermi-Dirac distribution, corrects the Drude model and 
therefore forms the basis on which the equations of electron emission are erected. 

3.3 The challenge of photoemission 

...a light quantum delivers its entire energy to a single electron ...the velocity distribution of the ejected electrons will be independent of 
the intensity of the incident light; on the other hand the number of electrons leaving the body will, if other conditions are kept constant, 
be proportional to the intensity of the incident light. 


- Albert Einstein 8 


Given the long run of Thomas Edison's incandescent light bulbs in dispelling darkness for much of the 20th century, 
it is almost intuitive that heated objects (in the light bulb, a tungsten wire filament) emit light, and the spectrum of the 
light they emit is related to their temperature - in the case of a tungsten wire, a temperature comparable to 2200 IC 
produces light in the visible region of the spectrum. A warm object that radiates light is termed a "black body radiator". 

In 1900, Lord Rayleigh derived the distribution in power as a function of frequency (the Rayleigh-Jeans formula). If 
light is imagined to be enclosed in a box and at thermal equilibrium with it then the energy density u(a>), or the amount 


s Albert Einstein, Concerning the Emission of Cathode Rays Through Illumination of Solid Bodies, Ann. Physik., 17, 132, 1905, as translated in 
American Journal of Physics, 33(5), May 1965, Section 8. 
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of energy in each phase space element, can be derived. The phase space concept will be revisited in much greater detail 
in Section 5.1, but for now the phase space volume is (2/(2 jt) 3 )£$, where the 2 in the numerator of the coefficient is 
for each polarization of light, a factor of 2 n is associated with each component (k x , k y , k z ), and hk is momentum. The 
average energy e w of each mode of light (represented as little harmonic oscillators for which the kinetic and potential 
energy is k B T/2, the sum of which is the total energy) is k B T. Lastly, for light, the relation between energy hco and 
momentum is hco = tike. Putting the pieces together, the Rayleigh-Jeans formula is, after integration over the angular 
part dip f* sin OdO = An), 


e m —^-—4 nk 2 dk = k B T-^—dco 
(2 ny ^ 2 c 3 


u r (co)dco 


(3.21) 


which shows both the temperature T and angular frequency co dependence (although the account here is beholden to 
modern notation and parameters). While u r {oo) captures the long wavelength (short frequency) spectrum of black body 
radiators rather well, it exhibits the disquieting feature that the total energy contained in the cavity, or the integral over 
all frequencies, is infinite. 

Shortly before, in 1896, Wien developed a relation for the high-frequency regime, thereby avoiding the so-called 
"ultraviolet catastrophe" that the Rayleigh-Jeans formula suffers, but departed from observations at low frequencies. 
Wien's formula is equivalent to 

u(w)dw=^—e~ tuo/kBT dco (3.22) 

n 2 c 3 

Enter Planck, who realized that the Wien and Rayleigh-Jeans formulae were high- and low-frequency limits of an 
equation of the form 

u P^ da = jjf Utr.i ) dm < 3 ’ 23 > 


Planck found that he could derive his formula if he assumed that, for whatever reason, the modes of the electromagnetic 
radiation appeared as integer multiples of a unit energy [43], or hco = nhco 0 , transforming the integral over frequency to a 
sum over n. The derivation is deferred to Eq. (6.4) in Section 6.1 below (there, because photon number is not conserved, 
the parameter /a, or chemical potential, is 0). The total energy U, that is, the integral over u(m) over all frequencies, 
thereby became finite, and the ultraviolet catastrophe averted. In fact, U is 


C - 1 

nWcip 4 J 0 e* - 1 n 2 h?c 3 P 4 


157i 3 c 3 /? 4 


(3.24) 


where the integral is the product of the Rieman zeta function f(4) = n 4 /90 (see Section A2.3) with the gamma function 
T(4) = 3!. 


EXAMPLE: Calculate the coefficient of T 4 in Eq. (3.24) and compare it to the constant a sb in the Stefan-Boltzmann 
equation relating total power radiated per unit solid angle, or j = oT 4 . 

SOLUTION: Inserting values for the fundamental constants. 


*4 

is {hey 


n 2 (eV/11604.5 K) 4 
I? [(197.33 eV nm) 3 ] 


= 4.7222 X 10“ 24 


eV 

nm 3 K 4 


or, in MKSA, 7.5658 x 10 -16 [J/K 4 m 3 ]. The energy dU = Pdt radiated through a differential surface element is equal 
to the product of U with differential radial distance cdt and a factor cos 0d£l, where cos 0 is due to Lambert's cosine 
law (diffuse radiation is proportional to the cosine of the angle with the direction to the viewing location). Using 
dSl = sin OdO dtp in polar coordinates, an integration over all (p and over 0 <0 <n/2 (beyond n/2 is inside the sphere) 
gives P = (c/4)17 and so 


a sb = 


_nX_ 
60 c 2 h} 


= 5.6704 X 10“ 8 W/I< 4 r 
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Frequency / c [cycles/cm] 

Figure 3.5 COBE satellite data: the cosmic microwave background radiation (CMBR) of the Universe as a function of frequency. Data digitally extracted 
from Figure 2 of ref. [56]. The three fits to the data are the Rayleigh-Jeans law of Eq. (3.21), Wien's formula of Eq. (3.22), and Planck's formula of Eq. 
(3.23). The x axis corresponds to 2 kw/c in units of cycles/cm. 


There is perhaps no more impressive a black body radiator in the Universe than the Universe itself: the spectrum of 
the cosmic background radiation, a remnant of the Horrendous Space Kablooie, 9 thermalized during the first year of 
the Universe's 13.7 billion year expansion. The cooled spectrum measured by the COBE (short for cosmic background 
explorer) satellite is reproduced in Figure 3.5 and demonstrates that the present background temperature of the Universe 
is T = 2.735 ± 0.06 K [56]: the data is extremely well-fitted by Planck's formula assuming that B = 2 nh/k B T (the units 
of brightness on the y axis are indicated as arbitrary so as to not become enthralled with factors including, among other 
things, the aperture size of the measuring instrument and other minutia). 

Einstein's genius was in taking the quantization condition that lead to Planck's formula seriously, as actually indicating 
something about the discrete nature of light. That was a rather bold act given the astonishing coherence of the explanation 
of electromagnetic radiation by Maxwell's equations, ft allowed Einstein to explain at the same time photoemission [57] 
as the absorption of one quanta of light by an electron such that the electron had sufficient energy to surmount the 
barrier to emission that otherwise kept electrons inside the metal. The models of photoemission and black body radiation 
served as a powerful impetus in the development of quantum mechanics. Einstein was awarded the Nobel Prize in 1921 
for his explanation of the photoelectric effect. 

Quantization of light particles explained why the electrons were emitted immediately for photoemission from a metal 
surface even in low illumination conditions, rather than after a passage of time while the energy of the electron was grad¬ 
ually built up as a light-as-wave description would require. As to the barrier, Einstein referred to it as "the potential.. .of 
negative electricity relative to the body" but in modern parlance it is simply referred to as "work function" and designated 
®. Einstein predicted: 10 

If the derived formula is correct, then when represented in Cartesian coordinates as a function of the frequency of the incident light, 
must be a straight line whose slope is independent of the nature of the emitting substance. 

® is the (positive) potential of the conductor such that an electron photo excited from it is just barely collected by a 
grounded conductor nearby (a technique analogous to a retarding potential analyzer in field emission studies). 

How well does that prediction play out? Although the question is rhetorical (the Nobel Prize would not reward a 
failed prediction), it invites a reconsideration of the problem that introduces concepts of later utility. Light incident on 
a metal surface will excite many electrons, and so the number of electrons it liberates (a phrasing in keeping with the 
light-as-quanta concept) is also a measure of photoemission: by comparing the ratio of the number of excited electrons to 


9 "HSK" was coined in the Calvin and Hobbes strip of June 21, 1992; the appellation "Big Bang" is an alternate reference to the Universe's origin 
coined by the English astronomer Sir Fred Hoyle - ironic because he was being dismissive to the expanding universe model by giving it a snide 
moniker. 

10 Ref [57]: Concerning the Emission of Cathode Rays Through Illumination of Solid Bodies, Section 8. Therein, Einstein denoted the work function by 
n, but he also denoted the speed of light by L. The quote has been rendered to use the modern notation of ®. 
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the number of incident photons, or the quantum efficiency (QE), another means exists for considering Einstein's prediction 
using a wealth of QE data. 

The number of excited electrons is the ratio of charge collected over the elementary charge, or A Q/q, and the number 
of incident photons is the energy deposited over the photon energy, or A E/tico. Therefore, QE is 



(3.25) 


If the light from a laser or monochromatic source illuminates a surface for a time At, long by comparison to the emission 
time of the electron (not hard, since the latter is measured in picoseconds), and the emission area is the same as the 
illumination area, then AQ/AE = or the ratio of the electron current over the light intensity. As in Eq. (3.5), 
current is the product of charge, density of electrons, and their average velocity, but if the electrons are distributed in 
velocity, then the current density should be a sum over velocities weighted by their distribution, or 



(3.26) 


where f(k) is the distribution such that J f(k)dk is normalized. Without getting into the details, what can be inferred 
from J ? Much depends on the form of f(k) for the photo excited, electrons, about which nothing so far has been said. In 
the absence of information, the pragmatic approach is the Kelly Johnson motto of "keep it simple, stupid" 11 (or KISS): 
consider it later (see Chapter 14). What is more important is how the integral in Eq. (3.26) scales. This is not as outlandish 
as it seems: f(k) is dimensionless and therefore will not affect the scaling of J q . Similar behavior is seen in Eq. (3.16): the 
MB distribution in the exponential is dimensionless, therefore the integral ratio scales as v x <x \/k n T. 

Given that hk x is the electron momentum leaving the surface, and that dk = k 2 sin 0d0d(/>, one expects the integral to 
scale as k 3 dk <x (velocity) 4 , and therefore Einstein's prediction suggests 


QE <x (hm - <t>) 2 


(3.27) 


Consider QE data taken from copper as it is a very well characterized photocathode metal for rf photo injectors given 
its ruggedness and ease of use, even though its QE is relatively low, as is true for metals in general. Here, the data will 
be from ref. [58] for measurements of the copper photocathode used at the Linac Coherent Light Source (LCLS) at the 
Stanford Linear Accelerator (SLAC), although modified from its original representation of In QE vs. k, to use Eq. (3.27). 
The comparison is shown in Figure 3.6. The intercept of the graph is quite close to the commonly accepted value of the 
work function of copper of 4.6 eV, albeit that different crystal faces have different work functions, and the extraction field 
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Figure 3.6 QE vs photon energy. Data of Figure 8, ref. [58] recast in the coordinates suggested by Eq. (3.27). The applied field at the surface was 
50 MV/m, making the Schottky factor \fkQF = 0.268 eV. The work function of copper is approximately 4.6 eV. 


11 Johnson was the designer of the world's highest and fastest aircraft, the SR-71 Blackbird, for Lockheed's Skunk Works program, so his advice 
is characteristically practical; see Ben Rich, Clarence Leonard (Kelly) Johnson (1910-1990), a Biographical Memoir, Biographical Memoirs, Vol. V67. The 
National Academies Press, 1995, p. 231. 
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used to collect the emitted charge alters the apparent work function by a small factor (as per Eq. (2.10), a 50 MV/m field 
reduces the work function by \JAQF = 0.268). Thus, Einstein's prediction is borne out when hco — <S> is small, although 
Figure 3.6 does anticipate future complications: the dipping behavior of the curve right near the intercept, as well as 
the non-linear behavior for higher photon energies, suggests physics yet to be encountered, but which will require an 
understanding of the electron distribution's statistics. 
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CHAPTER 4 

Statistics 


... intelligence... may, on the other hand, go so far as to enable a man to be moved emotionally by statistics. 

- Bertrand Russell 1 

Even a small volume of metal contains an enormous number of electrons, for example 1 cm 3 of copper contains 
» 8.411 x 10 22 electrons, comparable to the number of atoms. The physics of a large number of atoms or electrons, 
particularly with respect to the distribution of their energy, is the purview of statistical mechanics. The most familiar dis¬ 
tribution may be the Maxwell-Boltzmann (MB) distribution, but understanding electrons in bulk matter and how they 
transport and scatter requires familiarity with the Fermi-Dirac (FD) distribution (whence "fermions"), which differs in 
important ways from the Bose-Einstein (BE) distribution (whence "bosons") already encountered for photons. Under¬ 
standing the distributions is aided through combinatoric arguments. Practicing with distinguishable particles prepares the 
way and introduces randomness, probability, and the concept of entropy. 


4.1 Distinguishable particles 

And when I search a faceless crowd 
A swirling mass of grey and 
Black and white 
They don't look real to me 
In fact, they look so strange 

- The Rolling Stones 2 

MB statistics govern non-interacting particles in thermal equilibrium and form the basis of the classical description 
of an ideal gas, but to the uninitiated that is hardly helpful. Analogies help make unfamiliar ideas easier to assimilate. 
Taking a cue then from the notion that the particles are distinguishable or individually identifiable, much like people 
(even in a crowd), an analogy between the role of energy for a gas atom and the role of income (money) in a person's life 
is actually a rather useful model on which to introduce a number of needed concepts. From two very simple hypotheses, 
an MB-like distribution emerges: 

• In an employment hierarchy the bottom-tier positions (generally workers) are many and the top-tier positions (gener¬ 
ally managers) are progressively fewer the higher up the management chain. Assume the tiers are labeled by j and the 
number of positions in each tier behaves as tz J+1 /«y = 1/a, with a > 1 a dimensionless factor governing the number 
of subordinates or employees compared to bosses or employers. 

• The income Ej for each tier; of the hierarchy is monotonically increasing and crudely linear. 3 Assume the variation is 
of the form E j+l - Ej = b. 

Instead of discussing workers and managers, it will be more convenient to generically refer to "households". An exponen¬ 
tial relationship between the number of households tij and an income bracket E } exists in that the number of households 
n decreases exponentially with an increase in income E, specifically, n(E) <x exp|-(£/l?)ln(a)| so that \n(a)/b takes over 
the role of p in an MB distribution. 

Bertrand Russell, The Basic Writings of Bertrand Russell. New York: Simon and Schuster, 1961, p. 423. 

2 Mick Jagger and Keith Richards, lyrics to Salt of the Earth, 1968. 

3 Perhaps very crudely, for example the 2012 GS Schedules of the Federal Government can be fitted by Ej a 12.45 + 3.25; for ; < 11 and 
E, as 46.29 + 13.100' - 11) for; > 11 for the step 1 positions, with similar curves for the higher steps. 


Introduction to the Physics of Electron Emission, First Edition. Kevin L. Jensen. 
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The simple hypothesis clearly cannot be correct over all scales: it fails at the extremes, as on the bottom (7 -*■ 1) where 
the social safety net provides assistance to the poor, or at the top (7 » 1 ) where income is power-law distributed (the 
so-called Pareto distribution in economics). The example here is meant to be more pedagogical than exact. 


EXAMPLE: Using the data of Table 4.1, what are the median and mean incomes evaluated conventionally? 
SOLUTION: The median value is that value for which half the population is below and half above. The number 
of households is 122,459,000 (sum over all rij). If S k = tij then the median point lies between S 10 and S H . 
Interpolating gives E median = $50,694 (compare the American Community Survey (ACS) value of $51,371). The mean 
value corresponds to the weighted average. For the N b = 42 median values of Table 4.1, it is 


(E) = 


Y,%n jE] 


A point worth observing is that a median ( E ) of median values Ej (which should be represented as (Ej)) is the median 
of the population where N = tij, because 




where E jk is the klh member of the income bracket (or group) j: if all income brackets had the same number of households 
n, then the unique correspondence between l and ( 7 , k) would be / = (7 - 1 )n + k, but in general the number of members 
of the group rij varies, be they households or energy levels. 

A plot of the US Census Bureau Data 2012 distribution shown in Table 4.1 and Figure 4.1 indicates that for the middle 
class, the exponential distribution is more or less reasonable. The noise in the data (a consequence of grouping households 
into $5000 bins) makes parameter extraction challenging, so consider instead the summed data shown in Figure 4.2, which 
represents an integrated distribution function of 




(4.2) 


Table 4.1 US Census Bureau Data 2012 for median income E: (in 
$100,000) and number of households rij (in millions). 


0.01183 4.2040 0.71988 

0.07944 4.7290 0.76950 

0.12419 6.9820 0.81983 

0.17262 7.1570 0.87148 

0.22230 7.1310 0.91955 

0.27193 6.7400 0.97103 

0.32080 6.3540 1.0179 

0.37056 5.8320 1.0718 

0.42034 5.5470 1.1209 

0.47160 5.2540 1.1713 

0.51902 5.1020 1.2197 

0.57121 4.2560 1.2715 

0.61931 4.3560 1.3201 

0.67042 3.9490 1.3711 


3.7560 1.4197 1.1190 

3.4140 1.4711 0.9200 

3.3260 1.5161 1.1430 

2.6430 1.5727 0.8050 

2.6780 1.6191 0.7310 

2.2230 1.6721 0.5750 

2.6060 1.7209 0.6160 

1.8380 1.7710 0.5700 

1.9860 1.8195 0.5020 

1.4640 1.8720 0.3640 

1.5960 1.9216 0.4320 

1.3270 1.9718 0.3780 

1.2530 2.2010 2.5490 

1.1400 4.3178 2.9110 
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Figure 4.1 Household income vs number of households. Data is from US Census Bureau Data 
2012 (Table 4.1), which gives the mean household income Ej in bins of width $5000 (high 
income bins were larger and therefore hold more households). Ep t = $15,000 (approximately the 
federal poverty level for a family of two). 



Figure 4.2 Cumulative household income vs number of households. Data is from US Census 
Bureau Data 2012 (Table 4.1). They axis is the number of households below the income level E, 
(x coordinate). 


That data can be parametrically represented as a function of the form A{\ - e BE ), the derivative of which with respect to 
E is the sought distribution function/(£) = ABe~ BE . Finding the mean income is then a trivial integration: 


f 0 °°Ef(E)dE _ i 
/ 0 °°mdE b 


(4.3) 


or (E) = $71,100, from which E median = (E) ln(2) = $49,300, close to the numerical values found above. 

It is worth pausing a moment and taking note of a number of important features. To generalize the discussion, refer 
to "particles" instead of "households". 

1. The particle can be labeled by its position in the total population (E,) or its position within its subgroup or energy level 

%■ 

2. The distribution of discrete values can take on the appearance of a distribution function, from which the statistical 
quantities of interest, such as mean value, are derived. 

3. The distribution can be thought of as the probability that the particle 1 in question was chosen from a certain energy 
level Ej, where that probability Pj is nJN -» f(E)dE. 

4. Two methods can be used to calculate the mean, and they are 


±2 E i = < M > = I / Ej p j ( 4 . 4 ) 

where E, refers to the energy of the l particle, Ej to the energy of the jth level containing rij particles with Pj = tij/N, 
and the last form is J Ef(E)dE in the continuum limit. 

The income data has features that belie the simple model. Some are as follows. First, a strictly exponential distribution 
function would suggest that the number of households below poverty is [1 - exp(-B£^ p )] = 19%, rather than the 13% 
of the first three rows in Table 4.1, indicating a "hidden" mitigating effect at one of the limiting conditions (e.g., poverty 
assistance programs). Second, if the "middle" census data were plotted as log(households) versus income, a discernible 
weak but present negative convexity at the low end and positive at the high is apparent, indicating complexity in the 
relation between income and number of households missed by the simple model (due to any number of causes for 
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treating the better off even better). Third, the simple model suggests the percentage of total income held by the bottom 
97.6% of the households (<$250K) is 1 - (1 + 2.5 B)e~ 25B = 86.5% for 1/B = 0.71274 (median income in $100K), which 
seems to run counter to commonly quoted statistics that put the percentage of wealth held by the bottom 99% closer 
to 65%. Income, although correlated with wealth, differs from wealth, so speaking of them interchangeably is incorrect 
(e.g., wealth includes property). Lastly, a perennial problem with logarithm plots is that the extremes do not extrapolate 
well from the middle, especially when the data is noisy, a case in point being the putative "top earner" characterized by 
N{ n top) = 1- suggesting E top = B~ l In (N) = $11.63M, which is 0.3% of the income of Bill Gates in 2012. 

This suggests some cautionary points worth emphasizing. 

1. Data with an underlying exponential dependence always looks linear on a log plot: the non-linearity can be subtle 
and obfuscate important features (like subsets of emission sites that put curvature on Richardson-Laue-Dushmann 
(Chapter 12) and Fowler-Nordheim (Chapter 13) plots of current density as a function of temperature and field, 
respectively). 

2. Conditions at the extremities of the log plot can signal the onset of unaccounted for physics (like space charge, which 
affects Miriam curves (Section 29.1) in thermionic emission and causes turnover in field emission, or even leakage 
current in field emission). 

3. The data shown may not be the only (or even best) indication of the phenomena putatively ascribed to it, or there may 
be more than one cause for apparent trends (like thermal-field emission, dark current, and competing mechanisms). 

4. Lastly, a relation, or law, about how something behaves is not the same as having a theory about why that behavior 
is so (a point revisited in Chapter 16). 

A final feature of the income model, buried above (particularly Eq. (4.2) and the so-called important points 3 and 4), 
illuminates the transition from a discrete to a continuum model, and introduces concepts related to phase spaces that 
will be important. The transition must make reference to the size of the bin or "cell." In the census data, the bin size was 
$5000 and Table 4.1 gives the values of (E } ) in each of those cells. Thus, in going from a summation of a discrete set fj to 
an integration over fix), the size of the cell must be respected in a manner familiar from approximating sums by integrals 
(asinEq. (A1.10)): 



(4.5) 


where £2 is the volume of space holding the population of points and Ax is the size of the bin. When finding mean values, 
the factor Ax appears in both numerator and denominator, and therefore factors out, so its existence was masked. If the 
bin were reduced to such a small region of "money" space that it contained only one income of one household, if that, 
then most of the cells would in fact be empty. The idea of small cells of phase space will be revisited in Section 5.1. 

The analog for particles of a gas takes account of the fact that the dynamics of a particle are determined if its position 
and velocity are known, and so the volume element of phase space corresponds to the product AxAc. Actually, from both 
classical and quantum mechanics, it is position x and momentum p = hk that matter (e.g., Hamilton's equations in classical 
mechanics and conjugate variables in quantum mechanics). The product of momentum and position is related to the 
action, and the principle of least action governs classical dynamics. Thus, it is seen that h is the unit of action. If the smallest 
cell size is taken as Planck's constant h p = 2nti, then the differential volume in phase space is dxdp / (Inti)* = dxdk/(2x) 3 . 
Consequently, the phase space analog of Eq. (4.5) is 



(4.6) 


4.2 Probability and states 

. ..knowledge and probability are of such contrary and disagreeing natures, that they cannot well run insensibly into each other. 

- David Hume 4 

Probabilistic reasoning can be non-intuitive, but the usage of a good methodology is of great aid. Given the pervasive¬ 
ness of probability in quantum mechanics, there is value in mimicking quantum mechanics methods and notation, so 


4 David Hume, A Treatise of Human Nature, Part 5, Section 1, “ Of Skepticism with Regard to Reason". New York: Clarendon Press, 1978. 
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the representation of probability in terms of vectors \y/) and matrices P is adopted for a model to show that probabilities 
must sum to unity and the sub-populations, or rather a complete basis set, must all be represented in calculations lest 
the solution be invalid because of a neglected population. 

A well-known and counterintuitive example is assessing the trustworthiness of a measurement. Measuring a feature 
not present is termed a "false positive." To make the example concrete, if the fraction of a population that has a trait is p, 
and an individual subjected to a measurement P tests positive, what is the probability that the individual has the trait? 
To make the example very concrete, suppose the trait is a particular low probability disease and the test is correct most, 
but not all, of the time. 

The basis states are the presence or absence of the disease: let u designate "un-diseased" and d designate "diseased" 
(or, equivalently, "up" and "down" - why will be seen in a moment). Then 

W) = c u \u) + c d \d) ( 4 . 7 ) 

with | u) and \d) a complete description of the population types (or basis states), and the coefficients Cj describing the 
population fraction. The basis states are "orthogonal" in that (u\d) = (d\u) = 0. The measurement P acting on | d) will 
detect the disease and its expectation value will return the fraction p of the population that has it. If P is a perfect 
measurement (no false positives), then (\p\P\y/) = c 2 d (d\P\d) = p, but if there are false positives, then (u\P\u) = 8 where 8 
is the fraction of un-diseased (healthy) who are wrongly diagnosed as diseased. Thus, if the test is 99% accurate, 8 = 0.01. 
The question is then: given that one is diagnosed with the disease, what is the probability that one is actually diseased, 
or, put another way, how confident is one of the test result? 

The notation intentionally follows Heisenberg's matrix approach to quantum mechanics (with a nod to Dirac for the 
bra-ket notation). So, let the basis states be given by 

= w = (°) 

and the coefficients corresponding to the weighting factors are 

c « = V 1 -p. c d = Vp 

One can verify that \{\tr\d )\ 2 = p; \(\ir\u )\ 2 = 1 - p, and therefore that 

tyw) = 2>y> = 1 

where \j) corresponds to either \u) or \d). This equation is important: given that \y/) is normalized, it follows that 

= I (4.11) 


(4.8) 


(4.9) 


(4.10) 


where I is the identity matrix. Continuing, a moment's tinkering will show that 


P = 


8 0 

0 1- 5' 


(4.12) 


where 8 accounts for false positives, and 8' for false negatives (having the disease but being 
"Confidence" C is a ratio of how many people actually have the disease with how many 
and so 

, , (v\d> (d\P\v) 

(v\p\v) 


Cm 1 


told one does not). 
people are diagnosed with it 

(4.13) 


Conversely, "Doubt" D makes the substitution d -* u in C. Therefore, in light of Eq. (4.11), C + D= 100% as there is no 
other outcome. 

Consider a typical example. Suppose 5% of the population has the disease (p = 0.05) and the test for it is 99.9% 
accurate (8 = 0.001) with no false negatives (S' = 0). Direct computation shows C = 98.14%, which is fairly confident. 

On the other hand, if the inaccuracy of the test is equal to the proportion of the population that does not have the 
disease, or 8 = p = 0.001, and the false negatives are the same as the false positives (8' = 8), then one finds C = 50%, which 
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is fairly astounding: a highly accurate test is thwarted by the rareness of the disease and the deluge of false positives. In 
fact, working through the matrix multiplications, the general equation is 


P( 1 ~ <50 


p(l -<5') + <5(l-p) 


(4.14) 


with the subscript 1 indicating the number of times the test is performed. Thus, the more rare the disease (smaller p), the 
less confidence one has in a test, especially if the disease is more rare than the test is accurate, perhaps some consolation 
to those who wonder why a one-in-a-million event picked them when they get bad test results. There is still hope, at least 
until the second test (P —► P 2 in Eq. (4.13), which in the present example results in C 2 (p) = 99.9%, the subscript denoting 
the power of P). 

The example has one more interesting perturbation to consider (a subtle, but irresistible, pun). Suppose the act of 
getting the test (making the measurement) has a chance of giving one the disease. Slightly longer tinkering will convince 
oneself that this will populate P with off-diagonal matrix elements. In the parlance of quantum mechanics, this can be 
thought of as the measurement causing one basis state to change, or "scatter", to the other basis state. Another scenario 
to consider: instead of disease, suppose one is talking about disgruntled employees or terrorists in situations where they 
constitute a tiny fraction of the population: not only do those-who-test have to worry about the many false red flags 
that must be investigated, they must also consider whether their testing adds to the problem by actually creating more 
discontents (scattering). 

So much for a useful calculational method. Now what about probability itself? An ideal gas (monoatomic noninteract¬ 
ing point particles) is a very useful model on which to hone one's understanding of thermodynamic properties such as 
pressure, volume, and temperature. Some ideal gas properties anticipate those of an electron gas, but electrons, unlike 
people, are not distinguishable, and that makes an enormous difference in combinatorics. Knowing how to count particles 
is therefore necessary. Reacquaintance with probability concepts is the first step. 


4.3 Probability and entropy 


And on the pedestal these words appear: 

“My name is Ozymandias, king of kings: 

Look on my works, ye Mighty, and despair!" 

Nothing beside remains. Round the decay 
Of that colossal wreck, boundless and bare 
The lone and level sands stretch far away. 

- Percy Bysshe Shelley 5 


Early efforts to understand entropy considered the absorption of heat by gases to characterize reversible versus irre¬ 
versible processes, and find the efficiencies of engines. In the reversible Carnot cycle, the isothermal (at constant temper¬ 
ature) and adiabatic (no exchange of heat) expansions and contractions of an ideal gas between two reservoirs on a plot 
of pressure P versus volume V resembles a distorted and curved parallelogram, but on an entropy S versus temperature 
T plot, it is a pleasing rectangle: along an isothermal, entropy changes (vertical edges), and along an adiabatic, entropy 
is constant (horizontal edges). In terms of A Q, changes in entropy AS are governed by 


(4.15) 


The connection between entropy and probability, though, provides perhaps a more graspable notion of what entropy is 
about, particularly with respect to disorder, but that requires a mild detour into probability. 

A fair coin has an equal chance of landing heads (h = 1/2) or tails (t = 1/2). Thus, the probability of a coin landing 
heads is P h = h/(h + t) = 0.5. What about two coins? The probability of coin 1 being heads and coin 2 tails (i.e., the 
probability of the configuration ht where the first coin is the first position, and the second coin is the second position) is 
P ht = P h P t = 0.25. If N coins are being tossed, then the probability of the ad hoc configuration hthhtthhth is 


Phthhtthm = W = (0.5) 6 (0.5) 4 = 1/1024 


(4.16) 


5 Percy Bysshe Shelley, Ozymandias, The Norton Introduction to Literature, 2nd edn, eds C.E. Bain, J. Beaty, and J.P. Hunter. New York, 1977, p.733. 
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Extend the exercise to dice, which when thrown randomly give the numbers 1 through 6. If four distinguishable die are 
thrown, what is the probability of 1241 occurring? Generalizing the coin example, it follows 


Pi24i = n>/ J = OVVz ) 1 ^) 0 ^) 1 ^) 0 ^) 0 = 1/1296 


(4.17) 


A few points to notice: the method of calculation did not care if P, = P jr the number of times rij an outcome; occurred was 
useful in grouping the results, and rij = 0 indicates outcome; did not occur. Observe also that if the dice are distinguishable 
or their order matters, as would be the case if die 1 gave the first digit, die 2 the second, and so on, versus indistinguishable 
or when the order does not matter, as would be the case if only the sum of the faces was of interest, then different 
probabilities occur. As an example, the probability of an event that does not care about order such that face j appeared 
on Hj dice would be nU «/ ! times more likely. For the 1241 example above, the probability is therefore twice as likely 
since there are two different ways the die with a face of 1 can be placed. 

There is a very subtle concept associated with order, and it serves as a convenient introduction to the concept of 
entropy. To motivate the discussion, suppose a single die was thrown 288 times and the faces recorded. Question: which 
of the two tallies below seems more likely? 


• Case A: 

2 4 3 6 6 4 

1 6 6 2 2 2 

5 4 2 5 1 4 

6 5 4 1 3 3 

2 5 2 4 4 1 

2 6 3 1 5 2 

5 2 2 2 2 3 

6 5 4 1 2 2 

4 4 6 6 4 5 

2 3 16 16 

5 3 6 6 4 5 

• Case B: 

1 4 6 5 3 2 
4 6 3 4 1 6 
1 4 6 5 3 2 
4 6 3 4 1 6 
1 4 6 5 3 2 
4 6 3 4 1 6 
1 4 6 5 3 2 
4 6 3 4 1 6 
1 4 6 5 3 2 
4 6 3 4 1 6 
1 4 6 5 3 2 


5 1 6 4 3 3 1 

4 4 2 5 4 2 5 

6 2 1 3 4 3 1 

2 14 2 14 2 

4 5 2 5 4 1 5 

3 13 3 13 1 

2 5 5 5 3 1 5 

4 5 5 6 3 5 2 

1 6 6 3 3 6 2 

5 6 5 6 4 6 5 

2 5 2 1 3 4 2 

2 1 5 4 6 3 4 

3 5 2 1 4 6 5 

2 1 5 4 6 3 4 

3 5 2 1 4 6 5 

2 1 5 4 6 3 4 

3 5 2 1 4 6 5 

2 1 5 4 6 3 4 

3 5 2 1 4 6 5 

2 1 5 4 6 3 4 

3 5 2 1 4 6 5 

2 1 5 4 6 3 4 


112 14 11 
3 3 5 2 6 3 5 

3 3 1 1 4 2 4 

3 4 3 4 1 2 1 

2 1 2 5 5 1 4 

5 6 5 6 1 5 3 

5 3 4 4 6 2 5 

5 4 6 3 4 2 5 

6 6 4 6 1 1 4 

4 2 3 4 5 1 6 

2 2 4 2 5 

1 6 3 5 2 1 4 

3 2 2 1 5 4 6 

1 6 3 5 2 1 4 

3 2 2 1 5 4 6 

1 6 3 5 2 1 4 

3 2 2 1 5 4 6 

1 6 3 5 2 1 4 

3 2 2 1 5 4 6 

1 6 3 5 2 1 4 

3 2 2 1 5 4 6 

1 6 3 5 2 


3 16 6 15 6 

3 4 3 2 5 5 1 

2 6 6 4 3 3 6 

6 16 16 4 6 

4 3 5 4 1 5 3 

2 3 3 6 4 4 6 

5 4 6 3 1 5 6 

4 2 3 1 3 1 3 

2 2 6 3 3 6 1 

3 3 15 12 6 


6 5 3 2 2 1 5 
3 4 1 6 3 5 2 
6 5 3 2 2 1 5 
3 4 1 6 3 5 2 
6 5 3 2 2 1 5 
3 4 1 6 3 5 2 
6 5 3 2 2 1 5 
3 4 1 6 3 5 2 
6 5 3 2 2 1 5 
3 4 1 6 3 5 2 


A simple numerical criteria, like the mean of the numbers, gives 3.5 for both, which is the expected mean. A close 
eye discerns, however, that in Case B, the sequence repeats after the 18th throw, and so Case B seems to violate what 
is commonly meant by random, even if the 18 throws are random in nature. A sequence that starts to repeat after a run 
of length 18 throws contains elements that are no longer probabilistically distinct. Put another way, it would be easier 
to describe Case B than Case A, where no order is present. To describe Case A, each entry in the entire sequence has 
to be given. To describe Case B, the first 18 throws are specified followed by an instruction to repeat that sequence 16 
times. A measure of how much information is required to specify the sequence becomes a measure of entropy (Boltzmann 
followed a similar reasoning to introduce entropy into statistical mechanics [59]). Case A is therefore a plausible outcome 
of fair die throws, but Case B is not likely to occur because, in the idiom of gambling, the fix is in. 

More starkly, if all the flips in a sequence of coin tosses were heads, a very ordered arrangement would be had - it 
would be trivial to describe ("all heads") and it represents a state of zero disorder, or zero entropy. Conversely, truly 
random strings of heads and tails show great disorder and entropy would therefore be deemed large. These two notions 
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taken together lead one to expect if p is the probability of an event, that S <x £p/(p), where f{p) is a function such that 
/ s (l) = 0 but pf s (p) -*■ 0 as p -* 0. 

Joint probabilities impose another requirement. In Case A, all the throws are random, so one would intuitively expect 
that the entropy of the long sequence would be the sum of the entropies of two shorter sequences obtained by par¬ 
titioning the long sequence. In Case B, the sequence repeats and therefore looks ordered in the long view. That is, in 
Case A, the probability of the ;'th throw is independent of all others, but in Case B, the (j + 18)th throw is same as the 
;th throw. Mathematically this is expressed by saying that, for uncorrelated events as in Case A, the probably that the 
z'th and ;tb throws will take a particular value are p t and pj respectively, and their joint probability is p { pj. By compari¬ 
son, Case B is correlated such that for; = i + 18, the outcome is determined (pj = 1) and therefore the joint probability 
is different. 

Notationwise, the conditional probability is represented as P(i\f) and spoken of as the probability of outcome p { given 
that outcome pj occurred. If events are random or uncorrelated, then P(i\j) = p, Pj. The entropy definition must be cog¬ 
nizant of that: if the probabilities are independent, then the entropy should be additive. For a random sequence like 
Case A, the entropy of the whole sequence is equal to the sum of the entropies of its smaller pieces if it is subdivided. 
Conversely, consider a volume of gas which has a partition. Removing the partition should just add the entropies because 
the total box is the sum of its parts (this important consideration will be revisited with consideration of the Gibbs paradox 
below). Mathematically, the requirement of additivity demands 

mp)f,(p.p) (4.i8) 

= X p/*Cp) + x p U p) = s+ ~ s (4 - 19) 

this last demand can be shown to require f 5 (p, p) = f s (p) + f(p) by insertion into the above equation and using 
E P = E P = 1 • Th e only form of f s (p) which satisfies these requirements is f s (p) = - ln(p). That indicates that entropy S is 
defined by (known as Shannon's formula in information theory) 

N 

S = -Xp«(P z) (4.20) 

In physics, a state of thermodynamic equilibrium is one that maximizes S (in which case it is more common to append 

a factor of k B to the definition, so that S -*■ S/k B , see Eq. (5.22) below). 


EXAMPLE: What probabilities for heads p and tails q maximize the entropy of a sequence of coin tosses? 
SOLUTION: The Shannon entropy is simply S = — p In p — q In q, but q = 1 — p. Therefore, 

d p S = d p {-p In p - (1 - p) ln(l - p)} = In = 0 

or p = q = 1/2, as can also be seen graphically by plotting S(p) as a function of 0 < p < 1. 


EXAMPLE: What probabilities maximize the entropy of a sequence of die throws for which the probability of face 

j is Vp 

SOLUTION: This problem is a generalization of the previous coin example. If the die has n faces, then the require¬ 
ment is that the entropy is maximized for the condition that Pj = 1. Using the method of Lagrange multipliers 
(see Section 5.4 and ref. [60]) a constant a must be found such that 

or - In pj = a - 1, that is, all the Pj are equal, and therefore equal to 1/n by virtue of the fact they sum to 1. The 
finding confirms the intuitive understanding that all outcomes are equally probable. 
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EXAMPLE: Consider N equi-spaced Xj =jA that occur with a probability Pj, and the average of which is x g > 0. 
Find pj if the entropy S is maximized. 

SOLUTION: There are two constraints on the pf. first, they sum to 1, and second, J^jXjPj = Nx 0 . Using the method 
of Lagrange multipliers (see Section 5.4 and ref. [60]) two constants a and b must be found such that 


* Pl (s- a 'L p j- b 'L x Pi) = Q 

It is found that pj = exp (-a - 1 - bxj) and so, using e~i bA = (e M - 1) _1 , 




-X 1 - 


(4.21) 


which sets b = (l/A)ln\x 0 /(x 0 - A)] if e~ NiA «: 1. Consequently, p,- <x exp(-foq) and is an example of an MB distribu¬ 
tion. For the numerical choices N = 200, A = 4/1V, and b = 5, it is found that bx 0 = 1.0508, that is, b « 1 jx 0 . In other 
words, if N is very large, then 

Pj (X exp (-Xj/x g ) (4.22) 


an outcome reminiscent of Eq. (4.3). Observe that if x 0 is held fixed (and therefore b is fixed), then the consequences 
of increasing N are to reduce A: if x ; - is an energy, then the spacing between the energy levels consequently decreases. 


Boltzmann's equation for entropy differs subtly in appearance: his equation is referred to as the statistical definition of 
entropy [61] or entropy as the logarithm of the statistical weight [62], It relates, for example, the number of ways atoms 
or molecules can be arranged to S, or 6 

S = -k B In W((E)) (4.23) 

where (E) is the mean value of energy. Because the logarithm of the distribution function is a linear function of energy, 
then In W({E)) = (In W(E)). If the components of W are p,-, then the average of In W is the sum over the product of In p i 
with its probabilistic weight p p or simply Bq. (4.20). An example of W might be the number of combinations associated 
with putting n objects into m places, with m > n. Calculating that (as will be encountered in Eqs (4.26) or (29.56)) requires 
familiarity with combinatorics. 


4.4 Combinatorics and products of probability 


How do I love thee? Let me count the ways. 

- Elizabeth Barrett Browning 7 

Two boxes (volume) containing N particles (number) are equivalent if the distribution in energy of those particles are 
equal. If the particles all have identical characteristics (like mass), then particle a being in energy state E t and b in E 2 gives 
the same average energy as if b were in E 1 and a in E 2 . So, how particles occupy energy states is a matter of counting: for 
a gas in equilibrium, the probability that a particle has an energy Ej is the ratio of the number of states with that energy 
to the total number of states. 

Consider an analogy. An Afghan blanket is made of squares containing different patterns, as shown in Figure 4.3. 
Suppose the patterns are knitted by sequentially choosing yarns of different colors. If there are three yarns, for example 
red (r), green (g), and blue (b), the possible variations are trivial to write down as (rgb grb rbg gbr brg bgr) or, if the colors 
are labeled by numbers so rgb =123, then 

123 213 132 231 312 321 


6 The inscription on Ludwig Boltzmann's gravesite in Vienna, Austria is similar to Eq. (4.23). 

7 Elizabeth Barrett Browning, "Sonnets from the Portuguese XLM", in M.L. Rosenthal, Poetry in English: An Anthology. New York: Oxford 
University Press, 1987, p. 630. 
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Figure 4.3 Color combinatorics. Afghan blanket made using six different colors such that each square region contains three different colors. No two 
squares are identical. 


Four yarns (n = 4) take more patience, but give 

1234 2134 1324 2314 3124 3214 1243 2143 1342 2341 3142 
3241 1423 2413 1432 2431 3412 3421 4123 4213 4132 4231 
4312 4321 

For n = 5, writing the combinations without using an algorithm is tedious: 

12345 21345 13245 23145 31245 32145 12435 21435 13425 23415 31425 

32415 14235 24135 14325 24315 34125 34215 41235 42135 41325 42315 

43125 43215 12354 21354 13254 23154 31254 32154 12453 21453 13452 

23451 31452 32451 14253 24153 14352 24351 34152 34251 41253 42153 

41352 42351 43152 43251 12534 21534 13524 23514 31524 32514 12543 

21543 13542 23541 31542 32541 14523 24513 14532 24531 34512 34521 

41523 42513 41532 42531 43512 43521 15234 25134 15324 25314 35124 

35214 15243 25143 15342 25341 35142 35241 15423 25413 15432 25431 

35412 35421 45123 45213 45132 45231 45312 45321 51234 52134 51324 

52314 53124 53214 51243 52143 51342 52341 53142 53241 51423 52413 

51432 52431 53412 53421 54123 54213 54132 54231 54312 54321 

By n = 6 the task is tedious and difficult, and left to an algorithm. 


EXAMPLE: Develop an algorithm to list all permutations for n = 6. 

SOLUTION: Let C" be an n\ x n matrix containing all permutations of [1,2, ...,«], and let C n be the jth column 
of that vector. Let n be a column vector of length n - 1 with all entries equal to n. It is obvious that C 1 = [1] and it 
follows that 



(4.24) 


which generalizes to 



(4.25) 


cr l q-' cr 1 
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The n columns of C when concatenated to a single column give the permutations sought and which can then be 
sorted. As n -*■ n + 1, the number of columns increases by 1 and the number of rows by a factor of n + 1, showing 
that the number of rows grows as a factorial. An algorithm is given in Section A3.1. 


Counting the number of entries for each case, the number of permutations N p is found to be N p ( 3) = 6, N p (4) = 24 and 
N p ( 5) = 120, or N p (n) = n\. As a verbal instruction, it amounts to saying "There are n ways to pick the first yarn, (n - 1) 
ways to pick the second yarn...", and so on down to the last yarn, or n x (n - 1) x ...2 X 1 = ni. 

Now let there be more than one spool of yarn for each color, that is, let there be rij spools of yam for color;. Clearly, if 
the spools are numbered (the spools are distinguishable), the algorithm above is going to vastly overpredict the number 
of unique patterns because, for example, whether the red yarn came from spool 1 or 2 does not change the pattern. In 
other words, the yarn colors are equivalent (they function the same way), and as a consequence the number of distinct 
combinations is reduced. If P is the number of unique patterns of k colors, then the verbal description of how to calculate 
it might be "If P is the number of unique permutations using N spools of yam, then it is given by the number of possible 
permutations N p (N) divided by the number of permutations N p (n j) of color 1, and so on for all colors up to k." The 
equation is easier: 

N P (N) N , 

P= ---=-—- 4.26) 

N p (n l )N p (n 2 )...N p (n k ) \n 2 \...n k \ 

Revisit now the case of five yarns considered above and, for sake of argument, say three of them are red (n 1 = 3) and 
two are green (n 2 = 2). Then there are 5!/3!2! = 10 combinations, and they are 

11122 11212 11221 12112 12121 12211 21112 21121 21211 
22111 

so the equivalence of the yarn colors makes an enormous difference in the number of distinct patterns. 

Probabilistic reasoning can take on peculiar turns, an iconic case being when evaluating the probability that something 
occurs, it may prove more profitable to consider when it does not occur. A practical example of that will be encountered in 
the treatment of scattering in the Monte Carlo method (Section 32.2.3; see also Section Al.2.5): here, consider a slightly 
more amusing and accessible variation instead that also serves the dual purpose of providing an example of how to deal 
with factorials such as n\ when n becomes large. 

Let there be N objects that have n distinguishing variations (like color, or weight, or number of spiny ridges, or the 
contents of fields on forms labeled "DOB"). Assume IV » n and that each variation is weighted equally in the population. 
If all the objects are in a container and k of them are removed, what is the probability P n (k) that any two of the k 
objects will be of the same variation? Solving this problem straight up is challenging (try it), but because the maximum 
probability is 1, the question can be answered by asking instead, what is the probability W n (k ) that there will not be a 
match? Clearly, W n (k) = 1 - P„(k). 

The reasoning can then proceed as follows. Think of "variations" as being unique containers or slots into which objects 
are placed. With the first extracted object, there are n empty slots out of n total in which to put it, and so W n ( 1) = n/n = 1. 
The probability W n ( 2) is easy to figure out: there are (n - 1) remaining empty slots into which to put the second withdrawn 
object, which now has to be multiplied by W n { 1), that is, the total probability is the product of the probability of the first 
event with the probability of the second (they are independent or uncorrelated). Therefore, W„( 2) = \{n - 1 )/n] x W„( 1). 
Now the reason for large N becomes apparent: if N is sufficiently large, the removal of one of the objects will not have 
much of an impact on the fraction of its brethren amongst the remaining (IV- 1) objects, and so the small change in 
the distribution can be neglected. The reasoning can then proceed iteratively as removal of objects does not change the 
distributions, and so with the third selection there are (n - 2) slots, and by induction, with the Mi selection there are 
(n - k + 1) slots in which to put it so that 

W n (k) = (n ~ k+l) w n (k- 1) = n! (4.27) 

n n k (n-k)l 

where the second relation follows from mathematical induction. 

Although Eq. (4.27) solves the problem via P n (k) = 1 - W n (k), the solution is lacking because it requires repetitive 
multiplications that get cumbersome if n and k are large. Moreover, Eq. (4.27) is highly dependent on how the mul¬ 
tiplications are performed due to the size that the numerators and denominators reach when factorials are used (see 
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Figure 4.4 The 



. evaluation of W n (k) using Eq. (4.27) compared to the approximate form of Eq. (4.28) using n = 100, for which W 100 (12) » 1 /2. 


Section A3.2 for an appropriate algorithm). Better to now use Stirling's approximation of Eq. (A2.4) to find 



(4.28) 


where (n - k) p = n p \ 1 - ( k/ri)Y for a power p has been used, as in Figure 4.4. 

Matters become amusing when applied to real life. For the particular case n = 365 (the number of days in a year), the 
description (sometimes called the "birthday paradox") becomes the same as finding the number of people in a room that 
must occur so that the probability that at least two of them have the same birthday is 1/2. Using direct calculation with 
Eq. (4.27) or the algorithm of Section A3.2, one finds that k a > 22, or about 23 people, a surprising result to the intuition 
of lay people. 8 


8 Violating expectations is an astonishingly poor reason to call something a "paradox" given the accuracy of general intuition amongst lay people. 
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CHAPTER 5 


Maxwell-Boltzmann distribution 


5.1 Classical phase space 

On contentious ground, I would hurry up my rear. 

- Sun Tzu 1 

Position and velocity are of fundamental importance (not only in battles). To describe a box of particles requires 
knowledge of where they are are and how fast they are going. This takes six numbers, x,y,z, v x , v y , v z , per particle. Figure 5.1 
shows just the x coordinate of 1000 such particles for which the positions are randomly distributed. Figure 5.2 shows the 
corresponding v x velocities, which are Maxwell-Boltzmann (MB) distributed. Equivalent figures could also be generated 
(but are not) for the y and z components. If v x , v y , v z are are MB distributed, then so is |u|: 

m „ exp (-£t) , exp (-^L) exp (-^j exp (-^) (5.1) 

Understanding comes from doing, giving some impetus to see how an MB distribution can actually be generated. 
A simple and elegant method to do so makes use of the central limit theorem, which asserts that the average of a sum 
of n random numbers is normally distributed. 2 An algorithm to do so is given next 3 (contrast it with an alternate one 
utilized in Sections 33.1 and A3.33 for generating MB distributions of v x and v y simultaneously). 


EXAMPLE: Generate N = 1000 random velocities v that are MB distributed. 

SOLUTION: An MB distribution in velocity/(b) is obtained from a normal distribution 1V(0,1) oc e^! 2 by a special 
case of a more general algorithm in ref. [63]. 

Require: R is a 12 x N matrix of uniform random numbers 0 < Ry < 1. 

Require: | v) is a vector of length N. 

Generate R 

for j = 1 to N do 

°, = (££,*«)-« 

end for 

|f) corresponds to 1000 v values scaled by kv/yfl. 


The phase space representation shown in Figure 5.3 is far more useful, in which v x is plotted against x, along with 
an overlay showing small regions, or "cells", containing a smattering of particles - or none at all. By summing over all 
the cells that share a common position, a histogram of the velocities is generated in Figure 5.4. The histogram takes on 
the appearance of an MB distribution in velocity f(v ). As the number of particles becomes large and the bin size of the 
histogram small, a continuous representation of the histogram scaled by N results and corresponds to (note that/(F) is 
independent of position) 


f( v x ) dv x = 


J dxf dvydvjiv) 

f dxj dvf(v) 


(5.2) 


f(v) oc exp (—pm ( v 2 x + V 2 + d 2 )/2) 


(5.3) 


1 Sun Tzu on The Art of War (trans. Lionel Giles), Project Gutenberg Etext, Entry 47, p. 1682 of 2062. 

2 The designations normal distribution, Gaussian distribution, and Bell curve all designate mathematical creatures that have the same appear¬ 
ance. The MB distribution for E x = mv 2 /2 presently under consideration is a special case with (v x ) = 0. 

3 The algorithm is based on Algorithm N5 of ref. [63] but with n = o 2 = 12, choices which make the procedure extraordinarily easy to write and 
implement. 


Introduction to the Physics of Electron Emission, First Edition. Kevin L. Jensen. 
© 2018 John Wiley & Sons Ltd. Published 2018 by John Wiley & Sons Ltd. 
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0 200 400 600 800 1000 

Particle Label 

Figure 5.1 The x positions of 1000 particles labeled by particle number. The positions have been randomly chosen. 



Figure 5.2 The v velocities of 1000 particles labeled by particle number. The velocities have been chosen according to an MB distribution. 



Figure 5.3 The v x velocities of 1000 particles plotted versus their positions. Based on Figures 5.1 and 5.2. The grid overlay serves to suggest "cells" in 
which a particle (and sometimes many particles) finds itself: each cell is characterized by an energy e j where j refers to the cell, if the cells are made 
sufficiently small. 
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Figure 5.4 A histogram of the v x data summed over all cells with a given v x coordinate. In the large N limit, it would resemble Nf(v x )dv x of Eq. (5.2). 

All this talk of "velocities", though, is a bit old-school for three reasons. First, speaking so is the language of 
classical ideal gases, in which the particles are big, and although their speeds are respectable (e.g., a neon atom 
at room temperature travels about 600 m/s « 2 x 10 _6 c), the particles of greatest interest here are electrons that 
have velocities thousands of times greater in metals (e.g., 0.005 c for an electron at the Fermi level of copper) and 
have wave-like properties that matter. Second, if those electrons are accelerated to large energies, as is common in 
particle accelerators, a phase space in which v 2 -* c 2 is dreary, and adopting the language friendly to the beam physics 
studies of Chapter 33 is handy for the transition. Third, the conjugate coordinate to position is momentum hk (see 
Section 8.2.2). Best to just get used to it. Nevertheless, insofar as intuition can be gleaned from comparisons to ideal 
gases, speaking of velocity a bit longer will be occasionally useful; that usage, though, will be dropped at the earliest 
opportunity. 

5.2 Most probable distribution 

A probabilistic inference is a prediction today based on frequencies gathered yesterday. But that was then; this is now. How do you know 
that the world hasn't changed in the interim? 

- Steven Pinker 4 

Counting may be hard, but counting with constraints is far more difficult. Working up from a simple problem to 
more complex arrangements is therefore good practice. But the simple problems had best be "timeless" in the sense that 
the conditions for how they unfold should not have changed from one day to the next: the probability that a person 
will do something is affected by something as trivial as weather and as consequential as mortality. Thus, while even 
faceless people are more interesting for understanding MB statistics, generic particles in a box are better for understanding 
probability. 

Suppose that N particles are in said box and in thermal equilibrium, characterized by an average energy (E) such 
that the total energy N(E) = U (the designation of the total energy of the box by U is by convention and precedent 
[62, 64, 65], not because it makes sense). To be clear, the box, which constitutes a "system", can have many possible 
energy states Ej, which are a consequence of a particular packing of the phase space cells. It is therefore useful to refer to 
the energy of a particle (rather than a state) by a small-case e ; - that is interpreted as the energy of the particle if it resides 
in the jth cell. Clearly, some of the e ; - can be equal, even if they reside in different cells: in Figure 5.3, for example, all the 
cells with the same vertical coordinate have the same energy because f(v x ) is independent of x. 

The question is, what is the most probable arrangement of the {« ; }? Simple counting says there are N\ ways to choose 
the order in which the particles are assigned to the cells (i.e., there are N ways to chose the first particle, (N - 1) ways 
the second, and so on), but if the order of the particles in each cell does not matter, then the number of arrangements 


‘Steven Pinker, How the Mind Works. New York: Norton, 1997, p. 351. 
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associated with the occupation numbers {«, } is instead Nl/(n l \n 2 !...). Overlooking the importance of this factor is unwise: 
when encountering Bose-Einstein (Section 6.1) and Fermi-Dirac (Section 6.2) statistics, it is this factor that changes. That 
is pleasingly ironic, in that even particles in a box are not quite as timeless as one would like. 

Now impose the constraints n j = N and tijEj = U using the method of Lagrange multipliers [60], which allows 
for the maximization of the arrangements subject to constraints on the occupation numbers {w ; } and associated energies 
{ tij£j} (recall the treatment of Eq. (4.21)). For ease, observe that maximizing the arrangement is equivalent to maximizing 
the log of the arrangement, and so the later shall be done. Thus, letting A be the multiplier for £ tij and B for £ n j £ j> 
then 


In 



-A^n j -Bj j n j E j = 0 


(5.4) 


The minimization shall occur with respect to the ttj, therefore taking the derivative d/drij of Eq. (5.4) and setting the 
result to 0 gives 

— In [rijl\—A — Bej = 0 (5.5) 


Invoking Stirling's approximation of Eq. (A2.4) for n\ = riT(n), then d n In («!) « In (n) + (1/2 ri), which approaches In (n) 
when n is large. Although tij can be small, and as suggested by Figure 5.3 many of them can even be 0, it is reasonable 
to conclude that some of the tij will be large if N » 1. Thus, the most probable arrangement is characterized by 


tij « exp (-A - Be/) = A exp (-Bej) (5.6) 

otherwise known as the Maxwell-Boltzmann law. For e ; - = mv 2 / 2 the MB distribution of Figure 5.4 and Eq. (5.2) results. 

Since the ratio U/N = £ e ; « ; / 2 n j is seen to depend only on B, the conclusion is that B = B{J) -» p. Lastly, if Z is 

defined by 

Z= £exp(-/?e,.) (5.7) 

then A = N/Z. The quantity Z is the "partition function," and is the sum over all energies that a particle has access to, so 
Pj = Z -1 exp(-p£j) is the probability that the particle has an energy e ; -. Importantly, 

H = z -2^ = -z-'a,z = -a,(inz, (5 . 8) 

which was foreshadowed by Eq. (5.2). If the box is not subject to any forces, then the energy is the kinetic energy 
mv 2 /2 = (hk) 2 /2m, and the summation can be converted into a phase space integration over k. Thus, with e = Ti 2 k 2 /2m, 

(5 ' 9> 


Consider the simpler problem of integrating a Gaussian e axl where a is a constant. Letting x -*■ r cos 6 and y -»■ r sin 9, 
it is seen that 


(£> 


(5.10) 

Thus, 

J e- ax 2 dx= ^ 

(5.11) 



(5.12) 

Applied to Eq. (5.9), U/N = -d jS (In (^ _3/2 ))+ constant = 3/2 p, or 



U= | Nk B T 

(5.13) 
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Thus, the average energy in each cartesian direction is U x = U y = U z = k B T/2, a restatement of the principle of equipartition 
of energy. 

An important conclusion is thus revealed: for an ideal gas, the average energy U/N depends only on T. Equipartition entails 
that k B T/2 attaches to each degree of freedom enjoyed by a particle. As a well-known example, an atom vibrating in a 
solid can be treated as a harmonic oscillator with three kinetic energy components (e.g., one such being mu 2 /2) and three 
potential energy components (e.g., one such being k> e 2 /2, where k is the oscillator constant). That suggests U = 3Nk B T, 
for which the heat capacity for one mole (N = N A ) is 3 k B N A = 3 R = 0.2585 meV/mole-K= 24.94J/mole-K, where R is the 
gas constant. The relation, experimentally confirmed in the early 1800s, is known as the Dulong-Petit law, as evidenced 
in the high-temperature thermal conductivity of metals such as copper and shown in Figure 3.4. 

If the probability of particle a having an energy e l is p x , and an independent (non-interacting) particle b having 
an energy e 2 is p 2 , then the probability for both possessing those energies is P ah = p a (£f)p h (£ 2 ) = exp (-/?(e, + e 2 ))/Z 2 , 
and generalizing to N particles. 


f N ex P y-P X £ jJ 


(5.14) 


By labeling the particles, they 


for the moment - being treated as distinguishable. The factor Z N is given by 


Z N = ( X ex P (-Ptf) 


(5.15) 


If the particles were ^distinguishable, then Z N -* Z n /N\. In what appears to be a wondrously bad idea, the power of that 
sum can be expanded by generalizing the binomial theorem relating the sum of two quantities u and w raised to a power 
n, given by 


(u + wf 


X 


X 

i k+j=n) 


(5.16) 


where the {...} term at the foot of the summation in the last representation is a restriction on the ks and js that they must 
be jointly considered: they have to sum to n. So, the expansion of Z N is 


z N = X 



(5.17) 


The coefficient should look delightfully familiar from the earlier combinatorics exercises. Still, that is one big equation, 
hard to write compared to its predecessor, and just about impossible to calculate. Why writing it is a good idea is borne 
of experience not yet but invariably encountered. 


5.3 Energy and entropy 


I was moving/toward something/1 thought 1 understood 
There was an order/that was clear to me/a lawful order 
Then... connections/were broken/we were sent/in a thousand directions. 

- Joseph Chaikin and Sam Shepard 5 

The energy of the system can be changed by changing the cell size (e.g., by squeezing the container), changing the 
distribution of particles (e.g., by having them absorb energy), or changing the number of particles (e.g., by tossing more 
in). Whatever the origin of the thousand directions of change, they are represented by 

dU n j d£ j + X e i dn i (548) 


5 Joseph Chaikin and Sam Shepard, The War in Heaven (Angel's Monologue), Performing Arts Journal, 9 (2/3), 1985, pp. 249-262. http://www 
jstor.org/stable/3245529, p. 251. 
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Changing the cell size is the application of a force to the box. A force F ex applied to the box face of area A that causes it 
to move a distance dL has changed the energy by -F ex dL = —{F ex /A)AdL = -PdV. That is, changes in cell size are classically 
associated with -PdV. 

Changing the distribution {rij} of particles having energy e ; - is related to how much heat dQ is added, and that heat 
changes the entropy 5 by dS = dQ/T. For fixed particle number, dU can be written 

dU = —PdV + TdS (5.19) 

When N is fixed, the independent variables of the free energy F = U-TS are the volume V and temperature T, 

from which the pressure P and entropy 5 are defined by 

P = —(d v F) T ; S = -(d T F) v (5.20) 

Its relation to the partition function Z is 

F =-k B T \n(Z N ) (5.21) 

from which the thermodynamic quantities can be specified. If the particles interact, then E(x, k) contains interaction 
terms, making the partition function for N particles more involved than the form Z N arising from a non-interacting ideal 
gas model, but such a complication need not concern the present discussion. Recalling the Shannon form of entropy in 
Eq. (4.20) 

S=-k B J j p i \n(p i ) (5.22) 

where k B is Boltzmann's constant, shows that /? = 1 /k B T. 

Particles entering and leaving the box are characterized by an average particle number (N) = f° r which the 

Gibbs function G = U + PV - TS is used in place of the free energy F. A modification to Eq. (5.19) to 

dU = TdS - PdV + pdN = TdS - PdV + p'^ j dn j (5.23) 

results, and requires a third Lagrange multiplier in the generalization of Eq. (5.4) that results in e ; - -* £j - p in the proba¬ 
bility pj and the partition function Z N . The quantity z = involving the chemical potential p is often referred to as the 
"fugacity" [64-66], 

The consideration of non-constant N allows the requirement £ m ; - = N to be relaxed in Eq. (5.17), leading to an enor¬ 
mous simplification: a summation that was just shy of impossible has become almost enjoyable. The MB distribution will 
be considered first, with the Bose-Einstein and (more to the point for electrons) the Fermi-Dirac distributions treated 
subsequently. 


5.4 The Gibbs paradox 


He who confronts the paradoxical exposes himself to reality. 


- Friedrich DurrenmaU 6 


Individuality amongst particles gives rise to Gibb's paradox. Consider an ideal gas (a box of non-interacting sin¬ 
gle species particles) separated into two regions of volume V x and V 2 containing N x and N 2 particles, respectively; it 
does not change the argument to assume that the volumes are equal (V t = V 2 = V) and contain the same number of 
particles (N x = N 2 = N), but that assumption does make the exposition of the paradox unfold more rapidly. If the parti¬ 
tion is removed, then the only real change is that, compared to one of the regions, V -*■ 2V and N -*■ 2N, and the entropy 
S should be additive so that S 25. 

If the particles are distinguishable, are these expectations realized? The surprising answer is "No": removing the par¬ 
tition makes a difference where there should be no difference. This is seen as follows, where the partition function Z is 
used because the particle number is fixed (the grand partition function being employed when particle number can vary). 
The subscript "d" on F d is to emphasize that the particles are considered distinguishable. It is convenient to introduce the 


6 Friedrich Diirrenmatt, The Physicists (trans. James Kirkup). New York: Grove Press, 1964, Appendix: "21 Points to The Physicists"-. Point 20. 
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"thermal wavelength" defined by 



and to become reacquainted with how pressure and entropy are evaluated. 


(5.24) 


EXAMPLE: Calculate F d for a gas of IV » 1 distinguishable particles. 

SOLUTION: If N is large, the energy levels are finely spaced, and the summations can be approximated using 
integrals. Thus, using the prescription of Eq. (4.6), 

Z ” = (2 

For an ideal gas, E = £'=] h 2 k?/2m and is independent of x so that 

■ dk N e 



(*)' 

(5.25) 

from which F d = —Nk B T \n {V/P T ). 




EXAMPLE: Using Eq. (5.25), calculate the pressure P. 

SOLUTION: Using Eqs (5.20) and (5.21), direct differentiation shows that 


(dFA Nk B T 

V dv ) T v 


and the result is seen to be a variation of the ideal gas equation PV = Nk B T. 


(5.26) 


EXAMPLE: Using Eq. (5.25), calculate the entropy S. 

SOLUTION: Using Eqs (5.20) and (5.21), direct differentiation shows that 



(5.27) 


The effect of removing the partition between regions of equal size and particle number is to let V -*■ 2V and N -* 
2N. Does this entail S -*■ 2S as would be required if entropy is additive? By inspection, the answer is "no" because S is 
augmented by an extra factor Nk B In(2) that arises because ln(2V) = ln(V) + In(2). 

If the particles are ^distinguishable, then Zj Z^/1V! (the subscript "in" denoting "indistinguishable") because the 
ordering of the N particles is no longer unique, and the effect of N\ makes a subtle change in Z m and S, but not P. 


EXAMPLE: Calculate F in for a gas of N » 1 indistinguishable particles. 

SOLUTION: The procedure is as before, but using N\ « (N/e) N y/2nN (Stirling's approximation). 


Z^ 

N\ 


I ***”•**)' 


/ 1 ( eV 

y/2^N\N2 \ 


from which F in « -Nk B T In (eV/NA*) if N » ln(JV). 


(5.28) 
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EXAMPLE: Using Eq. (5.28), calculate the entropy 5. 
SOLUTION: Direct differentiation shows that 



where the factor 5/2 = ln(e) + 3/2. 


(5.29) 


Observe that U = F+TS= (3/2 )Nk B T as before. Now it is clear that performing V -*• 2V and N -*■ 2N results in 
S -*■ 25: the problematic factor of Nk B In (2) does not arise because in the logarithm, the ratio V/N is found, which 
does not change when volume and particle number scale together (the density is constant). Consequently, there 
is no paradox: the removal of the partition adds the entropies of the two compartments, as intuitively it should. 
The same findings result if the compartments are unequal in size and particle number, so long as the density of 
the gas in each is the same. The factor of N\ in Eq. (5.28) goes by the name of correct Boltzmann counting and is 
required if the Bose-Einstein and Fermi-Dirac statistics are to merge into the MB distribution in the appropriate 
limits. 

5.5 Ideal Gas in a potential gradient 


Sors salutis/et virtutis/michi nunc contraria, 
est affectus/et defectus,/semper in angaria. 

- "O Fortuna", Carmina Burana 1 


The MB distribution for non-interacting particles has so far been considered under field-free conditions, so that E(x, k) = 
n k e k , but that is not how life works: the pressing down of incessant forces is the fate of all. Let the potential energy V(z) 
vary with the z coordinate such that V{z) = -mgz where m is the mass of the gas atom and g = 9.8 m/s 2 is the acceleration 
due to gravity at the Earth's surface. How that modification results in the variation of pressure with height is known as 
the barometric formula. 

The energy e of an atom of gas is now e = Ti 2 k 2 /lm - mgz so that Z in Eq. (5.25) becomes 

Z = 2“ 3 f dxe~ Pm3Z = —— (1 - e-*"*) (5.30) 

J mgk^. 


where A = J dxdy and h is the height, 
recognition that dV = Adh, it follows 


V = Ah. From Eq. (5.21), the relation for pressure of P = ~(d v P) T , and the 



If P -*■ P 0 = p 0 k B T as h -* 0, where p 0 = N/V is the density of the gas at h -* 0 (say, sea level or some equally convenient 
reference point), and if the exponential argument is small such that 1 - e~^ m9h « pmgh, then 


(5.32) 


although the relation can obtained by simpler means by considering the change in pressure dP associated with a thin 
layer of gas -pg dh and the relation PP= p from the ideal gas law. The point is to obtain a familiar result from a proper 
analysis. 


7 "Fate is against me, in health and virtue, driven on and weighted down, always enslaved." https://en.wikipedia.org/wiki/O_Fortuna. 
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EXAMPLE: Find the pressure difference between the top and bottom of the Grand Canyon, assuming the depth is 
1.8 km and the atmosphere is nitrogen, N 2 = 28.135 g/mole. Assume T = 300 K at the bottom. 

SOLUTION: The exponential factor pmgh is 

mah ( 28 - 135 dk) ( 9 - 8 ?0 (1800 m) 

= V -moieM- sj - = o 19897 

k B T f 8.6173 X 10 -5 y) (300 K) 

Thus, P/P 0 = 0.19897/(e 0 19897 - 1) = 0.90 3 8, compared to Eq. (5.32), which gives 0.8196. 


The assumed constancy of the temperature in the previous example is not strictly true: an ideal gas expands adiabati- 
cally as its altitude increases such that PV Y = constant, where y x, 5/2 for a diatomic gas [66]. Coupled with PV = Nk B T, 
a crude estimate is found to be T(h)/T 0 « (P(h)/P 0 ) 2 / 7 , or 8.54 K lower at the top of the Grand Canyon compared to the 
bottom. 


5.6 The grand partition function 


The one is made up of all things, and all things issue from the one. 

- Bertrand Russell 8 

The grand partition function Z, obtained from summing over all particles in Z n /N\, is defined by 

, exp \fSnj (p - ty)] 




and it contains all the thermodynamic information possible about MB (as well as Fermi-Dirac and Bose-Einstein) gases. 
Observe that the consequence of dropping the requirement that the sum over all has been to allow each nj to appear 
in its own summation. By knowing that e x = all of these sums can be performed and it follows that the grand 

potential £2 is 

Q ( T, V,jt) = ~k B T In Z = ~ £ exp [fi (g - £; )] (5.34) 


where the summation follows from taking the logarithm of Z■ The grand potential £2 is so named because it is the basis 
for the calculation of the thermodynamic quantities of pressure P, density p = N/V, and entropy S via the relations 
[62, 65] 


P = -(<)Q/dV) Tlt 

(5.35) 

N = —(d£2 /dp) T y 

(5.36) 

S = ~{dQ./dT) Vli 

(5.37) 


where the outside subscript indicates the terms held fixed (compare Eq. (5.20)). In the continuum limit, the (number) 
density p MD becomes (compare Eq. (4.6)) 

Pmd = (2*0“ 3 J dkf UD (E k ) = (2tt)- 3 j dke**-**> (5.38) 


s Bertrand Russell, A History of Western Philosophy. New York: Simon and Schuster, 1945, p. 44, Chapter IV: Heraclitus. The translation is the same 
as that of John Bumet, Early Greek Philosophy, 3rd edn. London: A.&C. Black, fragment 59, 1920. The Latin translation of the phrase is similar 
to E pluribus unum, the motto on the Great Seal of the USA. 
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EXAMPLE: Calculate the chemical potential p of an MB gas (neon) at standard temperature and pressure (STP). 
SOLUTION: First express the summation as an integral over phase space: 

£{...} ^(2^)“ 3 4 nV {...}tfdk (5.39) 

where f dx=V and the integration over the angles of k give An. It is seen that the remaining integral is a variant of 
Eq. (5.12), but half as large due to the lower limit of integration on k. Performing the integration, it is found 


Q 


P \2nh 2 p) pX\ 


(5.40) 


where the thermal wavelength X T = (2 nh 2 p/M) 1 ^ 2 is used (Eq. (5.24)). The mass M of neon is 20.1797 g/mole, and 
standard temperature is 298.15 K (P = 38.922 eV' 1 ), so that X T = 0.0225 nm, or about 0.425a,,. With P = 101325 Pa 
= 6.3242 eV nm” 3 , it is found that p = p~ l In (pPX\) = -0.3876 eV. 


Observe that the introduced length scale of the thermal wavelength X T is a measure of when differences in statistics 
become important. If the average inter-particle spacing is shorter than X T , then quantum effects on statistics matter. 
For neon, p = pp = e^/2 3 = L -3 , where L = 152.72 r , so that clearly MB statistics are adequate for the ideal gas example 
considered, and the usage of quantum statistics is overkill for gases such as neon. 


— 

EXAMPLE: Calculate the entropy S of an MB gas (neon) at standard temperature and pressure (STP). 

SOLUTION: Taking the derivative of £2 with respect to T and expressing in terms of the chemical potential p, it is 
found that 

S=h B pPV(5-2pp) (5.41) 

or, using numbers from the prior example along with pp = -15.086, it is found that S/V = 0.4329 k B . 


Eq. (5.41), which goes by the name of the Sakur-Tetrode equation, is the expression for entropy S (albeit in a different 
notation) that informed the derivation for the vapor pressure model of thermionic emission explored by Dushman [67] 
and recounted in Section 5.7. 

However, for an average metal like copper where the electron number density is comparable to the atomic density, the 
thermal wavelength is about 4.32 nm, compared to an atomic diameter of copper of 0.256 nm. For electrons in a metal, 
statistics matter with a vengeance. 


5.7 A nascent model of electron emission 


We may then consider the evaporation of electrons from a metal as thermodynamically equivalent to the evaporation of a monatomic gas, 
for the electrons in the space necessarily have the same specific heat as the molecules of a monatomic gas. 

- Saul Dushman [67] 

The efforts to understand statistical methods and the MB distribution enable an understanding of first formulation of 
thermionic emission, in which electrons were modeled as evaporating off the surface of a metal. In contrast to modern 
derivations of the Richardson-Laue-Dushman (RLD) equation that begin by considering the tail of a Fermi-Dirac distri¬ 
bution, Richardson's original derivation applied a Carnot cycle to the electron gas in equilibrium with a metal. Dushman 
[67] reconsidered Richardson's equation and his expression for the coefficient A RLD agrees with more modern treatments 
apart from a factor of 2 associated with electron spin (a purely quantum mechanical feature of the electron revealed 
by the Stern-Gerlach experiment in 1922) that Fowler [68] argued was required. It is a pleasingly different method of 
determining current density from thermionic emission using thermodynamic arguments without invoking the quantum 
mechanical ideas that form the backbone of most modern treatments. 
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Current density J, as in Eq. (3.10) and the expression for (v + ) in Section 3.2.2, is the product of charge, density, and 
velocity. For a gas, the density N/V = P/k B T, and since only half of them move in the direction of the surface, it follows 


2\V 


)« = - 


(5.42) 


The pressure of the electron gas at the surface of a metal is, from thermodynamics, analogous to a transition wherein 
a liquid becomes a gas: a quantity of heat called the latent heat of transformation (L) induces the change. As known 
from Eq. (4.15), this means AS = L/T. But when a gas and a liquid are in equilibrium, the change in the Gibbs energy 
of the gas dG g is matched by an equal change in the Gibbs energy of the liquid dG lf or, as in the discussion surrounding 
Eq. (5.23), 

d(U -TS+ PV) gas = d(U -TS+ PV) Uquid (5.43) 


which, when coupled with Eq. (5.23) itself, shows that 

-S g dT + V g dP = -S,dT + = g 

and as a result it follows that 

L _ dP 
TAV ~ dT 


(5.44) 

(5.45) 


Gases are far less dense than liquids, suggesting that V g » V). From Ay =V g -V l &V g = Nk B T/P from the ideal gas 
equation, the starting equation of Dushman's analysis (Eq. 1 of ref. [67]) is obtained, and is 


L _ dln(P) 
Nk B T 2 ~ dT 


(5.46) 


a relation known as the Clausius-Clapeyron equation, and the starting point for early derivations of the thermionic emis¬ 
sion equation (e.g.. Appendix A in ref. [69]). Dushman proceeds by considering L to be the sum of a work function term 
L 0 = ®/N a and terms dependent on the specific heat at constant pressure C p for the electron gas and for the electron liq¬ 
uid c p (as peculiar to modern sensibilities as that sounds). A more economical approach makes use of the Sackur-Tetrode 
equation (Eq. (5.41)) for the entropy of an ideal gas obeying MB statistics, from which 


d In (P) _ l / 2 n\ 

dT 2T \ k B T J 

In (P) = - - In (T) - y— + constant 
2 k B T 


(5.47) 

(5.48) 



Figure 5.5 Thermionic data of Dushman. i 
tungsten on an RLD plot specified by In (J^p/T 2 ) vs 1 /k B T. The slope of the line y = A + Bx gives the work function. The data therefore suggests 
= —B = 4.473 eV. 
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just as Dushman found. Coupled with Eq. (5.42), it has been established that J <xT 2 exp {n/k B T), where p in this analysis 
is equivalent to the depth of the well the electrons find themselves within inside the metal, that is, the negative of the 
work function 

The constants of integration following Eq. (5.47) are difficult to intuit from the analysis, and it is easier to bring the 
full power of statistical mechanics to bear, beginning with the grand potential £2 of Eq. (5.34), in the form of Eq. (5.40), 
from which P is 

k T 

P = -dykl = (5.49) 

a t 

Using the definition of X T from Eq. (5.24), switching over to the work function <t> notation, and inserting by hand a 
coefficient of 2 to account for electron spin as Fowler did gives 

Jsld = J^( k BT) 2 exp {-Q>/k B T) (5.50) 

The performance of Eq. (5.50) was quite good: data given by Dushman (Table n in ref. [67] and reproduced in Table 
A3.1) was remarkably linear on a plot of In (Jjud/T 2 ) vs 1 /k B T, as shown in Figure 5.5. The least squares methods of 
Section A3.3 give the best fit parameters which can then be used to infer work function. Curiously, the fit to the data 
suggests C 1 = 61.375 amp/(cm 2 K 2 ), which tends to obscure the contribution of electron spin of 2 given that A RID = 
120.1735 amp/(cm 2 K 2 ) (Table 2.3). A more modern perspective, faced with e A < A RJD , often attributes the discrepancy 
to crystal face variations and work function non-uniformity, more of which will be spoken of in subsequent sections. 

Such is the celebrated Richardson-Laue-Dushman equation. Now begins the difficult task of achieving the same 
equation using the Sommerfeld model of a metal, in which the electron distribution function is not MB, as in the Drude 
theory, but rooted in the same Fermi-Dirac statistics required to understand field emission. 
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CHAPTER 6 

Quantum distributions 


Statistical mechanics, together with quantum mechanics, provides a foundation for modern physics which aims at the thorough 
understanding of physical phenomena from the microscopic viewpoint of atomic physics. Fundamental knowledge and training in 
statistical mechanics are therefore of vital importance not only for students studying the physical properties of matter but also for those 
who study nuclear physics or even astrophysics... 

- Ryogo Kubo 1 

Quantum particles are indistinguishable and interchangeable, and that feature means that with regards to the 
combinatorics, the order the particles are put into the energy states does not matter, and the arrangement within 
the energy state does not matter. Whereas the (generally large) combinatoric factor of N\/n y \n 2 \... arose when order 
mattered and particles were distinct, now each arrangement counts as one unique state, and such a factor is no longer 
included. This applies for Bose-Einstein (BE) as well as Fermi-Dirac (FD) statistics. Blow that factor plays out makes all 
the difference for how the distributions behave. 

6.1 Bose-Einstein distribution 

Brian: You 're all individuals! 

Crowd: Yes, we're all individuals! 

Brian: You 're all different! 

Crowd: Yes, we are all different! 

Homogenous man: I’m not. 

- Life of Brian 2 

BE statistics are similar to Maxwell-Boltzmann (MB) in that the range of each tij is unlimited. Therefore, the Grand 
Partition Function becomes m ^ 

z be =n z exp \.p n j - e /)] ( 6 - 1 ) 

but that is an easy sum: making use of Eq. (A1.4) with x n -*■ 0 as n -> oo for x < 1, it is found 

- ln(Z BJ5 ) = X In (1 - (6.2) 

7=0 

and so the occupation factors (tij) are given by 

(N) = -d H (\nZ) T V = 2>,> = X ^ ( 6 - 3 ) 

1 i= o i =o 

Observe that the quantity (tij) is the mean number of particles in a state characterized by an energy e ; -: in the continuum 
limit £j -> E k , it becomes the BE distribution f BE (E k ), so that 

p BE = (2n)~ 3 J dkf BE (E k ) = (2zr) —3 J dk(e KE ^ - l) _1 (6.4) 

The form of E k depends on the boson under consideration. For bosons, p is always negative, making the term e~^ 
potentially large. 

A common claim is that the high temperature limit of the BE distribution becomes the MB distribution. A more 
nuanced account is that at high temperature, higher energy levels can be occupied until all the occupation probabilities 


' From the Preface to the Japanese edition of ref. [61], p. vi. 

2 Monty Python (Graham Chapman, John Cleese, Terry Gilliam, Eric Idle, Terry Jones, and Michael Palin), screenplay to Life of Brian, 1979. 


Introduction to the Physics of Electron Emission, First Edition. Kevin L. Jensen. 
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(rtj) become small (< 1). When that happens, » i ; and the MB distribution is recovered. In contrast, when the 

temperature is very low, (nj) can be quite large, and BE condensation occurs [65], meaning a large crowd of bosons with 
the same quantum numbers can collect together in the ground state. 


6.2 Fermi-Dirac distribution 


...to get agreement with experiment one must assume that two electrons are never in the same state. This rule is known as Pauli's 
exclusion principle. It shows us that electrons are fermions. 

- Paul A.M. Dirac [40], p. 211 


FD statistics differ from MB and BE in that the range of each tij is constrained to be either 0 or 1: fermions cannot share 
the same quantum numbers (of which momentum states are one). Therefore, the Grand Partition Function becomes 


Zfd = n Z ex P [ pn i (l* - e /)l 


(6.5) 


but that is an even easier sum than the BE case, and so 

-ln(Z OT )=|]ln(l + e^>) 


(6.6) 


for which the occupation factors (rtj) are given by 

(N)=df J {e p(£ r» > + l)~ t (6.7) 

i =o 

where a factor g = 2 has been inserted by hand to account for two electrons of opposite spin being allowed in the same 
momentum state. In the continuum limit, this becomes 

Pfd = 9 ( 2 * r 3 J dkf FD (e k ) = g(2*r 3 f dk(e l>{t ^ + l)" 1 (6.8) 

For electrons, e = Ti 2 k 2 /2m. Densities typical of metals entail that the chemical potential g can become positive: for most 
metals, it is in the range of 2 to 11 eV, with copper in the middle (/< [Cu] « 7 eV). 

The relation between number density p = N/V and chemical potential g is particularly important for electrons as it 
explains why charged particles packed extremely tightly can nevertheless be treated as "non-interacting" in the Som- 
merfeld model of nearly free electrons in a metal. Recall that for an MB gas, p MB = exp (g)/2? T , as in Eq. (5.38). For the 
densities of typical gasses, the thermal wavelength X T is far smaller than the length scale of the interparticle separation 
(l/p M8 ) ] / 3 (a requirement if quantum mechanical effects are to be neglected) and the average energy (3/2 )k B T is in excess 
of the interaction energy between the gas atoms, which for neutral atoms is very small as per the Lenard-Jones potential 
(a requirement if the gas is to be treated as an ideal gas). The tight packing of electrons in a metal belies these "ideality" 
conditions, so why can they be treated as "free" (or nearly free)? The answer will be seen to be closely related to a key 
feature of fermions: from the Pauli Exclusion Principle, they do not share quantum numbers. 


6.3 The Riemann zeta function 

Now (1) the infinite qua infinite is unknowable, so that what is infinite in multitude or size is unknowable in quantity, and what is 
infinite in variety of kind is unknowable in quality. But the principles in question are infinite both in multitude and in kind. Therefore 
it is impossible to know things which are composed of them; for it is when we know the nature and quantity of its components that we 
suppose we know a complex. 

- Aristotle 3 


.. .the more I give to thee/The more I have, for both are infinite. 


- William Shakespeare 4 


3 R.P. Hardie and R.K. Gaye, Aristotle Physics Book I (Physica), Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 8, Aristotle I. 
Chicago: Encyclopaedia Britannica, 1952, p. 262. 

4 Ref. [37]: Romeo and Juliet, Il.ii. 134—35, p. 870. 
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Pondering infinity has consumed philosophers since antiquity. The f function series that appear to be infinite but are 
not were characterized by Euler and put to use since then, as in quantum field theory [70] where Eq. (6.12) below is part 
of the renormalization of the effective action. While a source of amusement to mathematicians, such musings are the 
bane of the innumerate. The Greek philosophers struggled mightily, as the above quote by Aristotle in his disquisitions on 
Zeno's paradox makes plain. Zeno argued that an infinite number of bisections were necessary to close the gap between 
two spatially separated objects in a finite time, and therefore the concept of infinity was problematic [71], an argument 
lampooned in the joke opening Chapter 1. 

Why does any of that matter here? The form of the FD (and BE) distribution insures that Riemann's zeta function f (p) 
appears with relentless regularity in the calculation of quantities such as the electron density p in its relationship to the 
chemical potential p. f (p) is a remarkable function, and some familiarity with its calculation is necessary. The function is 
defined by the series relations (Eq. (A2.8)) with the form of interest here being 

c (p) = 73^ £ (-i r l r = £r p (6.9) 

where the second summation is the more common definition of f(p). The series for f(p) do not converge as quickly as 
the series agitating Zeno, but familiar expressions result from them all the same. For example, with p -* 1 

£ f-iy 11 / 1 = ln(2) (6.10) 


is well known because of the Taylor expansion of ln(l + x) as x -* 1. It is the second series of Eq. (6.9) that is the more 
disquieting because of consequences resulting from the fact that the series is infinite. To see why that matters, observe 
the integral definition of f(p) with p -*■ 0 in Eq. (A2.7): the integral becomes logarithmic-infinite at the same time that 
T{p) becomes infinite. Those infinities can be handled by isolating the divergent parts of the integrals via 


1 f°° dx _ 1_ fo 2 jci-P(e»+l) 

Tip) J 0 x l -”(e x +l) ~ 2 f o °°xr- l e-*dx 


(6.11) 


where the integral definition of Tip) is used. Now, as p -*■ 0 the numerator integral is finite on the right-hand side, whereas 
the denominator retains its logarithmic-infinite nature, and so f(0) = —1/2, the minus being from (1 — 2 l ~ p ) -* —1. This 
is unremarkable until the series definition is considered, resulting in 


£l = l+C(0)=i 


(6.12) 


a relation found by Euler. Such relations do not sit well with those who treat oo as just another number, or view infinite 
sums as a limiting process. Their logic may be akin to the following. Consider the easier series (Eq. (A1.4)) 

l-x N 
1 -x 


as N increases. As x -*■ 1 the ratio is evaluated using L'Hopital's rule /(x) /gix) f (x„)//(x 0 ) as x -> x 0 if x : is a zero of / and 
g, and primes denote derivatives with respect to argument. As a limiting process, therefore, the sum is N, and therefore 
approaches +oo as N does. 

Not so fast. 

First, sums such as J^°l 0 x j = S(x) are different than finite sums. For one thing, it is evident that adding (or subtracting) 
1 to x5(x) is still 5(x), or 

1 + x5(x) = 5(x) ^ S(x) = (6.13) 

and this is true for any x, even negative ones. Because Eq. (A1.4) contains an extra factor x N , it differs from Eq. (6.13) 
because x N = 1 when x = +1, regardless of how big N is (the case x = -1 shall be ignored, but it is related to the philo¬ 
sophically infamous light-bulb problem of predicting the final on/off state of a light bulb switched an infinite number of 
times [71]). Note that 5(1/2) = 2 was encountered in Zeno's paradox. 
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Now consider S(x) + S(-x) = 2S(x 2 ). It is already known that S(-l) =1/2 fromEq. (6.13), and so S(l) =1/2=1+ f(0), 
where the equality holds because two series are numerically equal if all of their component terms are equal. 5 Thus, f (0), 
which by its series definition is adding one to itself an infinite number of times, is equal to (-1/2). If that is not bad enough, 
what about the sum of all positive integers? Consider the derivative of S(x), or 

S’(x) = d x S(x) = - TV = —= (6.14) 

xp{ (1 -xf 

so that S'(l) = 1 + 2 + 3 + ■ ■ ■ = f (-1), which is the sum of all positive integers. Observe that S' does not have a j = 0 term. 
Then (and being careful that d x S(-x) = -S'(-x)) it follows that 

S'(x) - S'(-x) = 4xS\x 2 ) (6.15) 

which, forx = 1, gives S'(l) - S'(-l) = 4S'(1) -»■ S'(l) = -1/12. Lastly, S'(l) is numerically equivalent to f(-l), as shown 
by comparing the series with s = -1, along with the r(s)r(l - s) relation of Eq. (A2.5) and the particular value f (2) = ji 2 /6 
(as shown in Eq. (A2.7)). Either way, 

C(-l) = £; = -^ (6.16) 

that is, the sum of all positive integers is a negative number, and a not so big one at that, an even greater insult to the 
intuitions of many. Properly (i.e., mathematically) speaking, f(p) is not defined at p = -1, but by analytic continuation, 
4TC—1) is, leading to a conclusion that does not endear mathematical physicists to the lay public. 

Therein lies the cautionary moral of Riemann's zeta function: having a formula does not mean it is understood or used 
correctly, or appropriate for the parameters for which it is used. More generally, using series expansion formulae requires 
care. Mathematicians are a bit more blunt: as remarked by Niels Abel (1828), 6 "Divergent series are the invention of the 
devil, and it is shameful to base on them any demonstration whatsoever..." 


6.4 Chemical potential 

There is no light in earth or heaven/But the cold light of stars; 

- Henry Wadsworth Longfellow 7 

Think how it wakes the seeds, -/Woke, once, the clays of a cold star. 

- Wilfred Owen 8 

The allure of poetry is in the ambiguity of its language, so that using words not literally appropriate conveys understand¬ 
ing that would not be possible otherwise. But still, poets who speak of "cold stars" seem to have passed from ambiguous 
to oxymoronic. Before one speculates that either stars are emblematic of a destiny indifferent to human aspirations, or 
that poets literally do not know what they write about, ask if stars can indeed be understood to be "cold" in some sense 
that makes sense. The outer layers of the Sun are described by a hot MB distribution, but paradoxically the center (as 
well as stars like white dwarfs) can be better described by a cold electron gas (Section 6.6). How this is even possible, 
though, requires an understanding of what the chemical potential is, how it is evaluated, and what limits it takes on for 
the quantum distribution functions as a function of particle density. 


5 Be cautious: some series here include a ; = 0 term, and some do not. 

6 As quoted in Berry and Howls, Infinity Interpreted, Physics World, June, 1993, p. 35. 

7 Henry Wadsworth Longfellow, "774. The Light of Stars", English Poetry III: From Tennyson to Whitman. Vol. XLII, The Harvard Classics. New York: 
P.F. Collier & Son, 1909-14. Bartleby.com, 2001. www.bartleby.com/42/. 

8 Wilfred Owen, "Futility", A Concise Treasury of Great Poems, English and American, ed. Loius Untermeyer. New York: Pocket Books, 1953, p. 502. 
Owen was killed in action on the Western Front one week before the end of World War I. His "Anthem for Doomed Youth" in the same 
collection as "Futility", targets the wasteful slaughter of war. 
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For energy parabolic in momentum, or E k = (hk) 2 /2m, Eq. (6.8) can be rewritten 
coefficient and a dimensionless FD integral function F p (x) [72] by 

as the product of a dimensioned 

T> =^ (zjp) m s ■jmW* 

(6.17) 

p Jo l+e*-“ J-u 1 +e 1 

(6.18) 

The first form of Eq. (6.18) is more common, but the second form is more useful when 

u is large and positive, as it allows 

for a partitioning of the integral into forms seen to be related to the Riemann zeta function. 

EXAMPLE: Evaluate N c in Eq. (6.17). 

SOLUTION: Rewriting N c gives 


i/2^ 1 (Ry\ V2 3/2 3 

N c = -3——— = - - = 6.8122 eV“ 3/2 nm“ 3 

(An) 2 \ Q 2 ) 

(6.19) 

where R y = 13.606 eV and Q = 0.35999 eV-nm are used. 



Three methods are considered for fermions for three regimes of u = ftp, large positive, small positive, and negative, to 
illustrate three different methods that are generally useful for evaluating integrals related to density apart from a brute 
force numerical integration. They will be asymptotic methods, recursion techniques, and series expansions, respectively. 


6.4.1 Large positive u 

The integral for F p (u ) can be partitioned into three regions 

ill _ 

for x < 0, and collecting the remaining integrals as 

F,(u)=-u i/2 + [ ^ U + X ^ U — dx+ f ^ X+U dx (6.21) 

1/2 3 J 0 e* + 1 J u <?+1 v ' 

Asymptotically (in the large u limit), e~ u is negligible, allowing the last integral to be neglected. The integrand of the 
second can use Taylor expansions of \Ju±x to obtain 


, where the first region is obtained by making the replacement 

:l ~YV^ (6 ' 20) 




8u 3 


7x 5 
128 w 5 


} 


(6.22) 


followed by the replacement of the upper limit to oo. The middle integral of Eq. (6.21) is then well approximated by 




(6.23) 


where numerical values of f(2) and higher are as in Section A2.3. 

Zero temperature corresponds to the limit u = ftp -* oo. The value of p(Q) m p 0 is known as the Fermi energy (more 
commonly encountered as E p ), and the corresponding momentum hk F = \j2mp 0 is called the Fermi momentum, in 
terms of which the density is 

2 k 3 

Pfd (T -+ 0) = ^N c p]' 2 = (6.24) 

An increase in temperature does not correspond to an increase in particle number, and so the relation p FD (T > 0) = 
p FB (J = 0) can be used to develop a low-temperature expansion of p(T) in terms of p g . This can be performed by replacing 
p -► p a (l - ad - b8 2 - c<5 3 ), where <5 = {\/2)(n/2u g ) 2 , in F, /2 (m) = (2/3 )uj, /2 and Taylor expanding to order <5 3 . Equating 
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u=Pp 


Figure 6.1 FD integral function F 1 / 2 (.a) (Eq. (6.17). Numerical evaluation compared to the approximations of Eqs (6.23), (6.28), and (6.30) for u large 
positive, small positive, and negative, respectively. 


the coefficients of the powers of <5 separately to 0, then a = 2/3, b = 4/5, and c = 1976/405 is found with some effort, 
leading to the result 

For a metal like copper at room temperature, where p 0 = 7 eV, u 0 « 270.77, and so p is 99.9989% of p g . Increasing 
the temperature to 3000 K (u 0 « 27.077) changes that to only 99.888%, making the temperature dependence of p for 
metals often ignorable even past their melting points. Clearly, the expansion is reasonable as long as p 0 > nk p T\ at room 
temperature that would demand p 0 > 81.2 meV, or an electron density just above 10 20 cm -3 . In fact, Eq. (6.25) works 
reasonably well for slightly smaller p 0 , as evidenced in Figure 6.1, but then one is employing the first few terms of a 
divergent series for calculation (occasionally done by physicists, but distasteful to mathematicians). 

A Fermi energy of 7 eV corresponds to k F = 13.55 1/nm, indicating that p = 8.411 x 10 22 q! cm 3 . Compare this to 
copper, which has 8.49 x 10 22 atoms/cm 3 , and it is clear that to a reasonable approximation each Cu atom gives up its 
outermost electron to make the electron gas of "free" electrons. Using Eq. (5.24) for room temperature conditions. 

Pm <M4 = 1024< ? (6-26) 

or a large number of electrons packed into a volume of sides equal to the thermal wavelength. Given the wave properties 
of electrons, at such densities the electrons are so overlapping that any connection to the Drude model's reliance on MB 
statistics of distinct, separated, and non-interacting particles is opaque, making the prediction of the Lorentz number 
(more or less, as in Eq. (3.19)) or Dushman's version of Richardson's equation (as in Eq. (5.50)) rather surprising. 


6.4.2 Small positive u 

In semiconductors, the number of free electrons is determined by the dopants. The doping density is such that even 
heavily doped semiconductors (e.g„ with concentrations approaching 10 19 atoms/cm 3 ) have electron number densities 
(or carrier concentrations 9 ) that are orders of magnitude smaller than metals like copper. In fact, for such an electron 
density, p(p)X\ = 0.143 q, or a fraction of an electron per thermal wavelength cubed, where X T is given by Eq. (5.24) with 
m n replacing m as the electron's mass: for many semiconductors, the effective mass m n < m (e.g., the ratio of effective mass 
with rest mass mjm is 0.066 for GaAs and 0.55 for Ge, although for Si it is 1.062 [73]), making the thermal wavelength 
even larger. Therefore, fractions of an electron per thermal wavelength cubed require that p < 0. The boundary between 
small and negative is given by u -*■ 0. Comparing F 1/2 (0) to the integral definition of f(p), 

*1/2(0) = (1 - 2- 1 ' 2 ) r (f ) C (|) * 0.678094 (6.27) 


’Although two types of carriers, electrons and holes, exist, the discussion of one can be bent to describe the other. If that dodge is unsatisfying, 
then a richer description can be found in Section 22.4. 
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Table 6.1 Values of from Eq. (6.29) obtained via numerical 
integration. 


/ 9, 1 9j i 9j 


0.67809 11 

0.53608 12 

0.43647 13 

0.36465 14 

0.31146 15 

0.27105 16 

0.23961 17 

0.21461 18 

0.19434 19 

0.17762 20 

0.16360 21 


0.15170 22 

0.14146 23 

0.13257 24 

0.12478 25 

0.11788 26 

0.11174 27 

0.10624 28 

0.10128 29 

0.096778 30 

0.092679 31 

0.088930 32 


0.085486 

0.082311 

0.079374 

0.076649 

0.074114 

0.071749 

0.069537 

0.067464 

0.065517 

0.063684 

0.061955 


At room temperature, this would imply an electron density of p = 1.1770 x 10 20 qlcm? = q/k\. Thus, even heavily doped 
semiconductors have negative chemical potentials, but as will be seen, band bending near the surface of the semicon¬ 
ductor brought on by high applied field can render the electro- chemical potential small but positive. Thus, a method to 
find F l/2 (u) for small u is necessary. Repeated use of Eq. (6.20) gives rise to the series 

=9o + £ d - e- u y 3j (6.28) 

B} = /o°° (hSfV** (6-29) 

Observe that g 0 = {n/A) [2 - f (3/2). Values are tabulated in Table 6.1, but for j> 9 the approximation 
«s 1.2502j -0 ' 86697 is reasonable (error < 5% by ;' = 9 or < 1% by j = 17). The performance of Eq. (6.28) is shown 
in Figure 6.1 with the series truncated at; = 32 and using the numerically calculated values of 3 j given in Table 6.1. 


6.4.3 Negative u 

When u < 0, then e x ~ u > 1 for all positive x in Eq. (6.18), and so the expansion of Eq. (6.13) may be employed in the 
form of 1/(1 + e* _H ) = e u ~ x S(—e u ~ x ). Each term in the resulting series is integrable in terms of the r(x) function, and so 

F l / 2 ( U <O)=^% ( -=0^ e ]u (6-30) 

where M is chosen for a desired accuracy: the choice of the integer closest to M & 6/\u\ provides agreement to better than 
five significant digits. Note how F l/2 ( 0) compares to the series definition of f(3/2) in Eq. (A1.8). For a very dilute gas 
(-u » 1), g decreases as 

g(T) = (3/2)k B Tln [(16/9* ) 1/3 g 0 /k B T] (6.31) 

Consequently, g(T) passes through zero at a temperature of 

T(g = 0) = p^( 2 “V5)C(3/2)J (^= 0.99873^ (6.32) 

where Eq. (6.27) is used. From Eq. (6.31), it is seen that at high temperature, g(T) varies as -Tln(T). The behavior for 
various g 0 cases is shown in Figure 6.2. 

The regions separated in Figure 6.2 by the dashed line at g = 0 are referred to as a degenerate gas where g > 0 and a 
non-degenerate gas where g < 0, where non-degenerate gases obey classical statistics, but degenerate ones require quan¬ 
tum statistics. Instead of the imputation of moral turpitude normally associated with the word, degeneracy refers to the 
situation in which more than one particle can share quantum numbers: in a degenerate electron gas each set of quantum 
numbers (e.g., momentum) are occupied by two electrons differing only by spin. That factor of 2 increased the value of 
A rld in Eq. (5.50) to its modern value. 
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T [Kelvin] 


Figure 6.2 Variation of u as a function of temperature for u 0 = 1.0 eV (top), 0.5 eV (middle), and 0.1 eV (bottom). Red lines correspond to Eq. (6.25) up 
to quartic behavior, and blue to Eq. (6.31). The dashed gray line corresponds to u(T) = 0, and where ii(T) crosses it is specified by Eq. (6.32). 


6.5 Classical to quantum statistics 


If you want to find out anything from the theoretical physicists about the methods they u 
don't listen to their words, fix your attention on their deeds. 


e, I advise you to stick closely to one principle: 

-Albert Einstein 10 


The passage from classical (MB) to quantum (FD and BE) statistics (and vice versa) can be charted by fixing attention 
on the grand potential encountered in Eq. (5.34) and its transformation into Eqs (6.2) and (6.6). Close inspection reveals 
that all three can be obtained from ^ ^ 

\fE\n[\ + se^-^dE (6.33) 

2 P Jo 

where N c is from Eq. (6.19) and s —► [-1,0,1] to obtain the BE, MD, and FD representations, respectively, 11 from which 
pressure P and number density N/V = p are obtained using Eq. (5.35) to find 


' 3/?V2 J 


k \/2 


—dx 


(6.35) 


s Jo e*-“+s' 

where an integration by parts has been performed on P s to make its comparison to p s more agreeable, u = fip, and x is an 
integration variable. Before, the narrative focused on the accurate evaluation of p for a given density p, but now consider 
instead the departure of FD and BE statistics from the ideal gas, or MB, behavior, so that the fixation on numerical 
accuracy can be relaxed a bit. Therefore, Taylor-expand the denominators of the P and p integrals in Eqs (6.34) and 
(6.35), and integrating the resulting series term by term to find 


It is immediately deduced that 


s 4 P /2 jri P /2 

, „v5iy <dt±p 

ls 4 p^P pP 


zr =l f 5/2 (-se“y 1 


(6.36) 


V p / S E”i r 3/2 (-se«y 1 


10 The Herbert Spencer lecture, delivered at Oxford on June 10, 1933, reproduced In Albert Einstein, Ideas and Opinions. New York: Crown 
Publishers, 1954, p. 270. 

11 Clearly, the choice of s = 0 should be viewed as a limiting procedure as s -* 0. 


www.ebook3000.com 







Quantum distributions 57 



i red circles correspond to numerically evaluated values, and Hi 


(6.35) (for N). 

Eq. (6.40). BE (r = -1) andFD (s = +l), respectively. Gray dashed lit 


(MB) corresponds to the ideal gas relation PV = Nk B T. The coordinate limits for the left (Pa < 0) and right (Ppi > 0)) differ; otherwise the red circles and 
gray line would be continuous across the origin. 


For MB statistics (s -*■ 0), the series are understood to truncate after the first term. Therefore, for an ideal gas described 
by MB statistics. 


an alternate way of rendering the ideal gas law PV = Nk B T. For BE and FD statistics when pp = u «c -1, the leading order 
approximation to pP/p is obtained by truncating the series after two terms, giving 


- => 
V p /mb 


p Ps _ 1 - (se"/2\/8) 
Ps 1 _ (seK/ys) 


(6.40) 


and it is evident that both the BE (s = -1) and FD (s = +1) distributions, pPjp s , approach the MB (s = 0) result of 1 as 
u -» -oo, but BE does so from below and FD from above unity. Conversely, and only for FD and BE statistics (s = ±1), 


I^^ = i( X+5 + Vi(X+1) ) 


£(5/2) 
f (3/2) 


(6.41) 


or 0.51351 (BE: s = -1) and 1.1334 (FD: s = +1), which are also below and above unity, respectively. For BE statistics, 
there is no going further {p BE < 0 always), but for a degenerate Fermi gas (p > 0), the final limit of u » 1 is obtained by 


Eq. (6.42) is a rather astonishing result. Unpacking it, it says that as the temperature gets very small such that (pp » 1), 
the pressure goes to P -* Ipp/'h which is the zero temperature result: unlike MB and BE statistics for which the pressure 
vanishes as the temperature vanishes, the Fermi gas has a finite pressure as T ^ 0. This behavior is seen in Figure 6.3, but the 
interpretation could easily well be that the density is becoming very large if the temperature were high. Considering 
such a regime seems like pompous scholasticism as the electron density even for metals, which tend to be the largest, is 
nowhere near such values - on Earth, anyway. 


6.6 Electrons and white dwarf stars 

There are more things in heaven and earth, Horatio, than are dreamt of in your philosophy. 

- William Shakespeare 12 

The black dwarf material is best likened to a single gigantic molecule in its lowest quantum state. On the Fermi-Dirac statistics, its high 
density can be achieved in one and only one way, in virtue of a correspondingly great energy content. But this energy can no more be 


! Ref. [37]: Hamlet, I.v. 166-167, p. 943. 
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expended in radiation than the energy of a normal atom or molecule. The only difference between black dwarf matter and a normal 
molecule is that the molecule can exist in a free state while the black dwarf matter can only so exist under very high external pressure. 

- Ralph Howard Fowler 13 

Although physics is interesting in the extremes, temperatures up to 6000 K in Figure 6.2 appear overblown for 
any metal or semiconductor. Yet there are natural circumstances where it does occur in stellar evolution, when a 
middle-weight star like the Sun nears the end of life and transitions to a white dwarf. 

The Sun is a plasma sufficiently rarified and hot that the charged particles satisfy MB statistics. Near the end of life of 
such stars, the outward pressure opposing gravity weakens as the nuclear fuel is consumed, and the star can collapse to 
a point where the pressure opposing gravity is supplied by electrons: as per Eq. (6.40), an electron gas has a non-zero 
pressure even at zero temperature. The star continues to collapse until the electron pressure counterbalances that due to 
gravity. It is on the electrons to do so: the much higher mass of the ions (typically helium) means that their statistics remain 
well-characterized by MB statistics; moreover, He+ + is characterized by BE statistics, and the pressure of a boson gas drops 
as the temperature drops even faster than that of an MB gas [62], a consequence of the BE condensation encountered 
in Section 6.1. In that case, the bosons increasingly occupy the ground state where their thermal agitation, for lack of 
a better phrase, is low, and insofar as thermal agitation and pressure go hand in hand, the pressure drops. Fermions, 
particularly electrons, cannot do so: they squeeze into whatever low energy level presents itself, with the average energy 
being 60% of the Fermi energy, as in Eq. (7.4). For very massive stars, the electron pressure is eventually insufficient to 
counterbalance gravity, causing the electrons to be forced into combination with protons to create neutrons, and the star 
collapses further into a neutron star, or, for the really massive, a black hole. 


EXAMPLE: What is the chemical potential of the electron gas in a white dwarf star such that the e~ density is 
p = 10 30 cm -3 (a generally accepted value)? Assume the electron density is uniform. 

SOLUTION: Using the zero temperature relation between chemical potential and density (see also Eq. (7.3) below). 


Vo = t-(3^ 2 p) 2/3 
2m 

_ (197.33 eV-nm) 2 
1.022 Xl0 6 eV 


3x 10 9 2 
nm 3 


) 


2/3 


= 0.3646 MeV 


(6.43) 

(6.44) 


Observe that for a temperature of 5000 K (temperature of the photosphere of the Sun), Pp is ridiculously large because 
p in Eq. (6.44) is large, but that even at a temperature of ~ 1.5 x 10 7 K, as in the center of the Sun, pp = 282, which 
is sufficiently greater than 1 as to simplify Eq. (6.45) below: such numbers mimic a zero-temperature approximation, 
allowing the star to be characterized as a "cold" gas. Observe also that the chemical potential of the electrons in a white 
dwarf is comparable to the rest energy of the electron: this means that within such a star, the electrons are relativistic, 
which modifies the analysis. 

For most stars, "more massive" goes with "larger." In contrast, white dwarf stars are smaller the more massive they are, 
and the explanation of that can be illuminated as follows. The largeness of Pp » 1, when applied to the Grand Partition 
Function Q. FD = -k B T\n{Z FD ) (Eq. (6.5)) and using the continuum distribution approach of Eq. (4.6), results in 




,= -^- [ ln(l + ^-^)4 nk?dk 
(2 n?p Jo 

y?l 2 p 5 l 2 V 


(2nYp J 
Asjl 
\5n 2 h i h 

From the relations for pressure P = d v £l and between electron density p and p of p = (h 2 /2m){1n 2 p) 2/2 then 


(6.45) 

(6.46) 


(6.47) 


13 Quote from ref. [74]. A black dwarf is a white dwarf that has radiated away all of its energy. Fowler was addressing a seeming paradox 
whereby a portion of black dwarf matter, brought to free space, would not have the means to resume its gas-like (normal density) state in the 
absence of FD statistics. 
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This takes some getting used to. It was assumed that the electrons are more or less uniformly distributed throughout, and 
in fact, that is so; the nuclei follow suit and become fixed with respect to each other in a crystalline-like pattern termed 
a quantum solid, with densities in the center of a white dwarf close to 10 10 -10 u kg/m 3 , compared to the 162200 kg/m 3 
for the center of the Sun [75], 

The electron pressure P el will be opposed by a gravitational pressure trying to collapse the white dwarf star: the evalua¬ 
tion of the later can be estimated crudely. On one side of an area of size A = r 2 AQ a distance r from the center of the star, 
where A Q, is a small solid angle, the force pushing out is due to the electron gas; that pushing in is due to gravitational 
attraction. Gravity is an inverse-square law force analogous to the interaction of charged particles, and from the latter it 
is known from Gauss's theorem that the force on a test charge a distance r from the center of a uniformly dense sphere 
of charge depends only on the amount of charge within a sphere of radius r, not the charge outside. So, too, with gravity 
(as an aside, in a problem expressly designed to torment physics graduate students, a mass dropped through a bore hole 
passing through the center of a non-rotating planet of mass M 0 will exhibit harmonic oscillations: the gravitational force 
on the particle would be F(r ) = —GM(r)m/r 2 where G = 6.6738 x 10 _n m 3 / kg s 2 is the gravitational constant. Since 
M(r) = p(4^r 3 /3) = M 0 (r/R ) 3 with R the radius of the planet, then F = -k g r with k g = GM 0 /R?\ a force linear with dis¬ 
placement is the signature defining feature of a harmonic oscillator). The mass within the thin slab is pA£lr 2 dr, so that 
the gravitational force attracting it to the center is 


dF = ^ ^ m h p(^j ^ ( m h pA£lr 2 )dr = ^nGm 2 h AQ.p 2 r i dr 


(6.48) 


The pressure gradient dP = dF/A is therefore dP/dr = ( 4n/3)Gm 2 p 2 r . Integrating that from the center of the star 
outer radius R gives a crude estimate of the average pressure due to gravity, or 

_ 3 GM 2 
G 8jtR 4 


to the 

(6.49) 


where the average density p = 3M/(4jrR i ) has been used. Equating P d = P G and placing all the star parameters on one 
side and the constants on the other gives 


This is a weird and remarkable equation, and relates to work by Chandrasekhar to provide an upper bound on the mass 
of a white dwarf to 1.4Mq, where the © signifies the Sun, before gravity overcomes the electron pressure. What is 
weird about Eq. (6.50) is that the left-hand side refers to stars and the right to constants associated with gravity, quantum 
mechanics, chemistry, and (if the calculation were done acknowledging the near-light velocity of the electrons glimpsed 
in the discussion after Eq. (6.44)) relativity that are not usually expected to keep company. What is remarkable is that 
the right-hand side is composed only of physical constants, and therefore is constant itself. In other words, the mass M of the 
star is inversely proportional to the cube of its radius, that is, counter to general expectation, as the radius decreases, the 
mass increases. 


EXAMPLE: Use Eq. (6.49) to estimate the pressure at the center of the Sun (Mq = 1.9885 x 10 30 kg, Rq = 6.96 x 
10 8 m). Compare it to the commonly accepted value of 2.477 x 10 16 Pa. Explain why the estimate appears defective. 
SOLUTION: Inserting the values. 




8tt 


^6.6738 X 10 -11 


[ m 3 1\ (1.9885 XlO 30 [kg]) 2 
[kg s 2 J / (6.96 X 10 8 [m]) 4 


= 1.3424 x 10 14 Pa 


(6.51) 

(6.52) 


or about a factor of 185 smaller than the currently accepted value. A substantial reason for the disagreement is the 
approximation of uniform density: the average density of the Sun is Pq « 1408 kg/m 3 (about 40% more than that 
of water), whereas the center is 162200 kg/m 3 , or a factor of 115 larger. 


Harping on the solar density variation, but as a pedagogical exercise only, if instead the density p(r) is taken as a power 
law for the form 


(6.53) 
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for some n, to mimic the concentration of mass in Sun's center, where the coefficients insure that the integration over 
dr gives the mass of the Sun and Pq = Mq/Vq is the average density as the ratio of the mass of the Sun to its volume, 
then the pressure Pq would be instead 


p _ GM 2 (n + 3) 2 (n + 2) 2 (n + 1) 
° " 96;rP 4 (2 n + 1) 


(6.54) 


A factor of n « 8 is therefore suggested. A power-law representation, however, is too simple a representation of the 
many layers involved, and much has been neglected, but the power law ansatz makes plausible the importance of density 
variation in stars where the electron gas is not degenerate. 

A discourse of any length on white dwarf stars may appear a bit odd in a treatment of electron emission, but there is a 
subtle nod to the human dimension of the physics involved. Ralph Howard Fowler was an exceptional physicist, having 
made (as seen above and will be seen below) seminal contributions to the emission equations for thermal and field 
emission. He was wounded at Gallipoli 14 during World War I, was Lord Rutherford's son-in-law, was elected a Fellow 
of the Royal Academy in 1925, and had students such as Lennard-Jones and Dirac under him. He clearly possessed an 
excellent mastery of statistical and quantum physics, but he was equally (better?) well known as an astrophysicist and 
for his work on.. .yes, white dwarf stars [74]. Paying homage to him by using another of his passions to illustrate a point 
in a tome on electron emission, a field he had much to do with creating - how poetic is that? 


14 Other physidsts there were even less fortunate: Henry Moseley, who worked under Fowler's father-in-law Rutherford, and whose 1912 
study of X-ray spectra from elements bombarded with high-energy electrons resulted in his demonstration that nuclear charge and not weight 
determined chemistry (for which he might have won the Nobel Prize), was killed at Gallipoli on August 10, 1915 [76]. Physicists were kept 
further from the front lines during World War II and subsequent conflicts, and thereby had much greater impact on how their countries 
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CHAPTER 7 

A box of electrons 


7.1 Scattering 

...the Argives and their allies advancing with haste and fury, the Spartans slowly and to the music of many flute players - a standing 
institution in their army, that has nothing to do with religion, hut is meant to make them advance evenly, stepping in time, without 
breaking their order, as large armies are apt to do in the moment of engaging. 

- Thucydides 1 

...by the early sixth century, the charge of a well-trained and well-armored phalanx, its thousands of men acting as one and keeping 
pace by means of rhythmic shouts of the paean, proved itself all but irresistible. 

- William H. McNeill 2 

At the Battle of Marathon (490 BCE), the huge invading force of 50,000, sent by the Persian king Darius I to crush 
the Greeks, suffered humiliating defeat at the hands of an outnumbered Athenian force (the Spartans arrived after the 
engagement because of their decision to conclude religious festivities first). The destruction on the Persian side, a seventh 
of the forces, was in stark contrast to the small losses suffered by the Greeks. The deployment of the Athenian hoplites 
in a phalanx had much to do with that. The phalanx, in which troops are arranged shoulder to shoulder with locked 
hoplons (shields), many rows deep, gave little opportunity for scattering, and as a result the seeming irrationality of 
being pressed onto the end of an opponent's sword or spear resulted in much fewer collective losses: being embedded 
in a phalanx made sense to those who viewed the risks of battle as shared equally, particularly when being routed was 
deadly but the urge to flee an individual matter that once acted upon could spread panic and therefore disarray [77]. 

Scattering has consequences ...but only if it is possible. 

The electron density in a typical metal is so high that even though the electrons are cheek by jowl (i.e., their average 
nearest-neighbor separation is smaller than their thermal wavelength), unless they are near the Fermi level, scattering 
with their neighbors is not possible because there is no open space, no available state they have the energy to jump to, that 
is not already occupied. Sommerfeld's usage of Fermi-Dirac statistics and the newly developed quantum mechanics to 
describe metals resolved difficulties encountered in the Drude theory related to conductivity [78], but more significantly 
here paved the way for a reconsideration of the thermionic emission equation by Fowler [79] and the development of 
the field emission equation by Fowler and Nordheim [36], An important application of that theory is that electrons will 
not scatter if their final state is occupied, and that realization fixed Drude's theory of conductivity quite nicely. 


7.2 From classical to quantum mechanics 

...there is nothing more difficult to carry out, nor more doubtful of success, nor more dangerous to handle, than to initiate a new order 
of things. 

- Niccolo Machiavelli 3 

A key assumption in Sommerfeld's model was that the electrons behaved much like a non-interacting, or "free", gas 
of particles, and as quixotic as that seemed, it worked. The relation between number density p = N/V and chemical 
potential p hints why charged particles packed tightly can nevertheless be treated thusly in the Sommerfeld model of a 


Thucydides, The Landmark Thucydides: A Comprehensive Guide to the Peloponnesian War, eds Robert B. Strassler and Richard Crawley. New York: 
Free Press, 1996, Book 5.70, p. 343. 

2 William H. McNeill, The Rise of the West: A History of the Human Community: With a Retrospective Essay. Chicago: University of Chicago Press, 1991, 
p. 198. "Sixth century" refers to the sixth century BCE of Ancient Greece. 

3 Niccolo Machiavelli, The Prince. New York: New American Library, 1952, p. 49. 
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metal. Usually, an ideal or Maxwell-Boltzmann gas satisfies several conditions. First, as inferred by the density relation 
p MB = exp(/?^)/2 3 where the thermal wavelength X T is given by Eq. (5.24), X T must be far smaller than the length scale of 
the inter-particle separation L = (l/p MB ) l/3 . Observe that the smallness of X T /L requires that the chemical potential p be 
negative so that exp(pp) < 1, and recall that p can be made negative either by the gas being rarified or the temperature 
being so large that the gas is no longer degenerate. Moreover, the average kinetic energy (3/2 )k B T must be comparable 
to or larger than the interaction energy between the particles if the "interaction" can be ignored. 

Previously in Section 5.6, neon was taken as an example ideal gas, so for variety, consider argon (M = 39.948 
amu = 0.039948 kg/mole) now. For it, L/X T « 200 at standard temperature and pressure (STP) conditions ( T = 273 K, 
P = 101325 Pa). The energy of interaction between argon atoms is well described by a Lennard-Jones (LJ) 6-12 potential 

y 1J (r) = 4e„[(^) 12 -(^) 6 ] (7.1) 

for which the minimum of the potential is at 2 l ^ 6 r 0 and takes a value of —e LJ . Also, r 0 = 0.3067 nm and e LJ = 8.04 meV, 4 
where the average separation r m L= 10.89 r g , making the energy of interaction V U (L) very small. By comparison, the 
kinetic energy of an argon atom at STP is (3/2 )k B T =35 meV. 

So much for a gas. What about a metal? As per Eq. (6.25), p(T) « p 0 due to (n/pp 0 ) 2 «: 1, so that f PD (E) -*• &(p - E k ) 
and so 




N 

V 


= P(T= 0) = 


3n 2 


k 2 dk 


(7.2) 

(7.3) 


where the Heaviside step function @(x) = 1 if the argument is positive 


n electron in a metal 


ana 0 otnerwise. Similarly, tne 






A 1 f k * h 2 k 2 ,2 ,, 3 


The ratio between the average kinetic energy KE and the energy of interaction PE = 4 Q/r of two electrons separated by 
the mean spacing r = 1/p 1 / 3 with p from Eq. (7.3) is then 

KE _ (9j[) 2 ^h 2 k F 

PE ~ 10 amc ( ' ’ 


which increases as p (and therefore k p ) increases. At the limit of white dwarf conditions, the average kinetic energy is well 
beyond the potential energy of interaction of two electrons; even for more terrestrial metal-like conditions the kinetic 
energy is not trivial, as there, Eq. (7.5) is of order unity. Thus, fermions oddly become more "classical" as the density 
increases. 

An "interaction" requires something to change: electrons scattering off each other swap energy and momentum, and 
therefore change their quantum numbers. If they are foresworn from doing so because the quantum numbers of any final 
state they could possibly scatter into is already taken, they cannot be said to be "interacting" in the usual sense - rather, 
they go on doing what they were doing and behave as though they were free. Hence, the Sommerfeld assumption is a 
bold but effective one. 

A final distinction between classical and quantum but with regard to statistics resides in the consideration of the spacing 
of the energy levels. Recall in Section 3.3 Einstein's insight that the discrete energy spectrum used by Planck to recon¬ 
cile Wien's law with the Rayleigh-Jeans formula, leading to Eq. (3.23), was fundamental to the nature of the photons. 
Discrete energy states are quite different from the continuum energy spectrum associated with classical physics, and 
that distinction is the last metric by which quantum descriptions go over into classical ones and vice versa: when the 
energy difference between adjacent energy levels is small by comparison to the (average) particle energy, classical statis¬ 
tics are good. That distinction can be viewed as a requirement on the size of the quantum numbers, for example the 
well-understood harmonic oscillator [44] has an energy spectrum that behaves as hco n = (n+ 1/2 )fiw 0 , where w 0 is the 
ground state energy. The condition for a classical description is then that hco n+l — tico n = tico 0 «: (n + \/2)Tico 0 , or 2n » 1. 


4 See Table 11.1 of ref. [65], 
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7.3 Moments and distributions 


Not all those who wander are lost... 


- John R.R. Tolkien 5 6 7 


Having carefully explored the landscape leading to the Sommerfeld model, the quantum refinements to Drude's model 
giving the correct coefficient to the Lorentz number of Eq. (3.20) can be determined (it was encountered already in the 
red line of Figure 3.3). The agreement is profound, and significantly strengthens the grounds by which the Sommerfeld 
model serves as a basis for the electron emission equations which follow. The journey shall (i) begin with Eq. (3.5) for the 
charge current density J q but recast in the language of distribution functions/(x, k), ( ii) recast the momentum integration 
to an (easier) energy integration by introducing the density of states, {Hi) place scattering in the context of phase space 
(leading to Boltzmann's equation), (iv) incorporate the effects of occupied states on scattering, and finally (v) arrive at 
the quantum mechanical form of the Lorentz number L. The language to do so shall make use of the moments of the 
distribution. 

Recall that the distribution of phase space points /(x, k) is normalized according to 


N — j dxdkf (x, k) 

(2 ny J 


(7.6) 


where N is the number of particles, so that the charge density is qN/V = qp. Observe that the "2" in the numerator of 
the coefficient (previously called g) accounts for electron spin. For a constant potential, the distribution of particles will 
be independent of x and so 


f(k) = ^ j dxf (x, k) 


(7.7) 


The moments M n (k ) of a distribution function/(k) are often defined with respect to a mean value, but here take them to 
refer to integrations of powers of k with the distribution function such that 

M n (k.) = f k’lf(k)d~k (7.8) 

(2 nf J 

where kj is one of the cartesian components. This form will allow for generalizations leading to the supply function in 
the one-dimensional emission equations. The number density is the zeroth moment, or p = M 0 (k x ). Average quantities 
such as (k”) are the ratios of the nth moment with the zeroth, a particularly important one being current density 


'•-•w- si?/*(£)** 


(7.9) 


where the average velocity is ratio of the first moment M y (k x ) with the zeroth M 0 (k x ), the later of which is proportional 
to the number density p, so that 

p(v x ) = ^M x (k x ) (7.10) 

Thus, Eq. (7.9) is seen to be J q = (qh/m)M y {k x ) and therefore equivalent to Eq. (3.5), except for a tiresome negative sign 
that indicates direction of flow but which is otherwise inconvenient to retain. 6 Eq. (7.9) is the defining equation for charge 
current density in a bulk material, and therefore the starting point to consider conductivity. Later, with modifications, it 
will be the starting point for the development of the one-dimensional emission equations. 

Given that MB, FD, and BE statistics are more easily expressed in terms of energy E rather than wave number k, 1 
modifications to the integrals result and introduce the density of states (DOS) T){E), allowing number density to be 
written as ^ 

p= f D(E)f{E)dE (7.11) 

Jo 


5 J.R.R. Tolkien, The Fellowship of the Ring, The Lord of the Rings. Boston: Houghton Mifflin, 1987. 

6 In units of q = 1 , the current density and the number current density are the same. 

7 Observe the change in notation to a more conventional E in the emission nomenclature, rather than the past usage of e k to indicate the energy 
of a single particle characteristic of the statistics nomenclature. 
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In polar coordinates, the phase space volume element is dk = k 2 sin OdkdQdcj). A parabolic relation for energy E=ti 2 k 2 /2m 
depends only on the magnitude of k. Finding the expression for the density of states D(E) is then obtained by considering 
the zero temperature number density or 

"» (T = 0) = ^ C ,7 ' I2) 

where An is from integration over the angular coordinates. By introducing 

D i (.E) = ^y/2mE = N c VE (7.13) 

as per Eq. (6.19), then p = f£ D(E)dE at zero temperature, and where the subscript 3 distinguishes it from the one- and 
two-dimensional versions. The discussion seems to be about point-like particles, but the same relations are obtained when 
the states in question are manifestly quantum mechanical in origin, as shall be seen in Section 21.2 and the discussion 
leading to Eqs (21.10)-(21.12). 


EXAMPLE: Find D(E) for two dimensions. 

SOLUTION: In two dimensions, the area element is kdkdO and so 

1) 2 (E)dE = -~^k (^]dE f d0= -^rdE 
(2 nf \dEj J 0 nh 2 

or T) 2 (E) = m/nti 2 and is independent of energy. 


EXAMPLE: Find D(E) for one dimension. 

SOLUTION: In one dimension, the line element is dk and so 


' (In) 1 




Pleasingly spherical distributions render the calculation of V(E) no more difficult than finding the surface area of a 
sphere and an expression for dk/dE, but this is a consequence of the triviality of the model, not the simplicity of atomic 
arrangements: the potential was taken to be constant, whereas in the presence of atomic cores, forbidden regions open up 
where ranges of k values are excluded. In other words, band gaps form when atoms and the periodic arrangement of core 
potentials associated with them are involved. In such cases, the surface area is no longer simple, and finding all k for a 
given energy E is more difficult. A parabolic density of states is a model of convenience and often adequate, but not always. 

Lastly, revisit the distribution of electrons given by the product D(E)f FD (E), the integral over which gives p(T) as in 
Eq. (7.11) and shown in Figure 7.1: the top plot is for a large positive Fermi level (p = 7 eV), for which not much 
variation is visible in the distribution; the middle plot is for conditions similar to a very heavily doped semiconductor 
(p = 0.5 eV), for which the temperature dependence is becoming more pronounced near E » p; the bottom plot is for 
when the chemical potential is negative (p = -0.1 eV), for which the differences are rather stark, a circumstance that 
anticipates the longer discourse on semiconductors in Chapter 22. 


7.4 Boltzmann's transport equation 

Socrates: Heraclitus is supposed to say that all things are in motion and nothing at rest; he compares them to the stream of a river, and 
says that you cannot go into the same water twice. 

- Plato 8 


8 PIato, The Dialogues of Plato: Cratylus (trans. Benjamin Jowett), Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 7, Plato. 
Chicago: Encyclopaedia Britannica, 1952, p. 94. 
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Figure 7.1 The integrand of Eq. (7.11) for various p. for a copper-like ^ = 7 eV (top), for a very heavily doped semiconductor p = 0.5 eV (middle), and 
for a doped semiconductor p = -0.1 eV (bottom), which anticipates the electron distribution of Figure 22.7, with the difference that the zero of energy 
is at the bottom of the conduction band here. 


Perpetual change is a better model of life than unchanging equilibrium, although if the phase space distribution is 
disturbed, it will relax back to the equilibrium Fermi-Dirac distribution, with the electrons thermalizing by scattering off 
each other (e-e scattering), off the atomic cores or rather groups of cores in collective motion (e-p scattering, where p 
refers to phonons), off individual atoms of a different type than the bulk material (impurity scattering, be they ionized 
or neutral), or various other mechanisms which complicate the transport of electrons away from a simple ballistic model 
[80, 81]. There will be a time to consider the individual processes - what matters here is that they exist, that they 
thermalize the energy of excited electrons to an equilibrium distribution, and that the equilibrium distribution tends to 
the thermal Fermi-Dirac distribution. 

The waters may not be the same, but the river looks familiar: if the overall density p of the electrons does not change, 
then 

p = — f f(x,i\t + df)dxdi= f f(x,k;t)dxdk (7.14) 

(2 ny J (2 icy J 


so this suggests 


—^—r f ( 4-f(x,k;t))dxdk= 0 
{2k? J \dt JK ’7 


(7.15) 










66 Chapter 7 


which in turn indicates df /dt = 0 because the electrons are neither created nor destroyed and therefore have to go 
somewhere in phase space (the invariance of the phase space element is treated in greater detail in Eq. (33.26)). It is well 
known that the total time derivative is composed of three parts: the first is how / depends directly on time t, the second 
is due to changes in position as the particle moves with a velocity, and the third is changes in velocity as a consequence 
of forces, or, in short, 

fit fit . ^ ^ 

(7.16) 


| = | ti-Vd-V 


where the odd dots over x and k indicate their time rate of change (the "over-dot", or Newton, notation). Dots, vectors, 
partials - in short order, the notation is maddening, and the urge to a simpler one-dimensional problem irresistible. 
Therefore, consider just one component of Eq. (7.16), say the x direction, and assume uniformity in the transverse 
directions, so that 

ft = Tt +kdxf+kdkf (7A7) 

Temporarily ignoring the subscription^ (and force F below), then i = hk/m is velocity, and hk/m = -F/m is acceleration 
(forces being associated with the negative gradients of potential energy and F being a positive quantity in the present 
convention). Now, if the distribution is pushed away from its equilibrium configuration/^, then df /dt is not zero, but 
relaxes back to the equilibrium distribution with some characteristic time, the relaxation time, represented as t, or 
df/dt= —if — f eq )/r, a form known as the relaxation time approximation. Put together, a simple form of Boltzmann's 
transport equation (BTE) results, and is 

d l 


dt 




( f~f eq ) 


(7.18) 


Integrating over dk on both sides of Eq. (7.16) recovers the definitions of p and J from Eqs (7.6)-(7.9). The same integral 
over the force term vanishes because /(±oo) = 0. Finally, for the scattering term, integrals over dk for both / and f eq give 
the same density, and so the right-hand side vanishes. Alternately, one can say that in equilibrium as many electrons 
scatter into a state as out of it with no net noticeable effect. The cumulative effect is to regain the continuity equation 


d t p + d x J = 0 


(7.19) 


where d x J -*■ V ■ J for three dimensions. With scattering, density of states, a Fermi-Dirac distribution, and a moments 
approach, the BTE allows for a correction to the Drude model for, in particular, conductivity and heat capacity. 


7.4.1 Electrical conductivity revisited 


And enterprises of great pitch and moment 
with this regard their currents turn awry. 

- William Shakespeare 9 


With the thermalized Fermi-Dirac distribution f PD as the equilibrium condition, consider afresh the conductivity under 
the conditions that (i) the problem is time-independent (d t f = 0), (ii) the current density is constant (dj' = 0), (raj the 
force term is only in the x direction, (zV) the difference between/(k) and f pn (E k ) is small, and (v) f FD (E k ) is symmetric in k 
because E k is parabolic in k, then the current flows only in the x direction. Looking closely, it is assumptions (zV) and (v) 
that seem simple but are not (compared to the others which seem simple and are), so examine each. 

The smallness of the difference assumption means that / » f FD + 8f, where 8f is a perturbation. Thus, the right-hand 
side of Eq. (7.18) is -<5 //t. What causes the disturbance, the force F, is therefore small as well. In short, there are big terms 
like f PD , small terms like F and 8f, and negligible terms like products of F and 8F. To leading order then (considering just 
the big terms), d t f PD + d x f FD = 0 which is clearly true given that the FD distribution is independent of time and position. 
The next order terms are what matters, and so neglecting F8f, 



Recalling Eq. (7.19), if the density is time independent, then the current density is spatially uniform ( d x J = 0), which 
implies d x Sf -* 0. Now, reconsider the current density itself: it is obtained by integrating over k x the distribution / with 


9 Ref. [37]: Hamlet, IH.i.86-87, p. 951. 
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Tik x /m. That means that any symmetrical function in k x can be subtracted from/ and not change the current density because 
the integral of a symmetric function with an antisymmetric one vanishes. The Fermi-Dirac distribution is symmetrical 
in k x , and so 

2a f - hk x 

J q = < 7 - 21 > 

(2 nY J m 

which allows f -f FD = 8f = (rF/h)diJ Fn to be inserted. As a result, the current density is given by 

,7 - 22) 

where the likely energy dependence of the relaxation time is made explicit, and where q is appended as a subscript to 
J to indicate that it is a charge current density. A common visualization of Eq. (7.21) is that of a spherical distribution 
being rigidly translated a small amount in momentum space [82]. The symmetry observation also entails that the number 
density depends on the symmetric part of the distribution, but the current density depends on the antisymmetric part of the 
same distribution, a natural consequence of the moments viewpoint. 

That the Fermi-Dirac distribution is the equilibrium distribution eases matters considerably, foremost because it 
depends only on the magnitude of k and not the angular coordinates. The quantity k x d k J FD is therefore easy to resolve. 
Recall spherical coordinates: a function g(r) that depends only on the magnitude of r entails 

J xd x g(r)df = X -J Cr-V)g(r)dr= 1 - j rd r g(r)df (7.23) 


Likewise, if g(r 2 ) is the function, then 2 d r g(r 2 ) appears in the integral and similar relations hold for k and V k . Thus, 
k x d k J PD = (2/3 )Ed E f FD = ~(2/3)pES, (IKE - g)) where 


8,(s) = 


(e s + 1) (e~ s + 1) 


(7.24) 


the notation intended to be evocative of the Dirac delta function in that 8,(s) is sharply peaked compared to other terms 
in the integrand, and Sfs)ds = 1. In fact. 


8 l (ME - g)) = f FD (E) (1 ~f PD m (7.25) 

The full width at half maximum (FWHM) 2AE, determined by where 4<5; (pAE) = 1/2, is therefore 2 A E = 2k B Tln(2y/2 + 
3) = 3.5255k B T, or about 10 meV for T = 329.16 K: such a small FWHM allows terms that vary slowly by comparison to 
8 t (s) to be evaluated at E = g and taken out of the integral, to quite a good approximation. 

Recall the relationship between current density J and conductivity a of J/F = qo: conductivity is then given by 

= T~P [ ET)(E)t(E) 8, (ME - M) dE (7.26) 

3 m Jo 

First consider this equation, then evaluate it. From Eq. (7.26), the first factor of f FD (E) is the probability that the energy 
state characterized by E is occupied, that is, there is an electron there to scatter. The second factor (1 - f FD (E)) is the 
probability that it can scatter into a different state characterized by energy E. That is, f FD (E ) is the probability of occupation, 
and (1 —f FD (E)) is the probability of vacancy, and the probability of a scattering event occurring depends on both. But 
only electrons within a few k B T of the Fermi level participate, and so all the terms other than SJE) can be evaluated at 
E — g and pulled outside the integral, a figurative way of speaking about how to deal with sharply peaked integrands 
described in Section Al.2.4. 

The quality of this approximation can be assessed as follows. Change the integration variable to s = ME - g), let u = pu, 
and make use of properties of 8,(s). From Eqs (7.13) and (7.24), a becomes 

a = ^-gT(g)D(g) j (l + ^8Js)ds (7.27) 

For u = Pg 1, 8 t (s)ds = 1/(1 + e~ u ) « 1 — e~ u , which is unity to an excellent approximation. This means that the 
lower limit of the integral can be well approximated by -oo, so that the symmetric nature of 8,(s ) = <5,(-s) can be exploited 
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(Eqs (A1.33)-(A1.35)). If (1 + ( s/u )) 3 / 2 is Taylor expanded, the linear (antisymmetric) term may be discarded because it 
is an odd function of s and its integration with S,(s) vanishes. The next term is of order u~ 2 and therefore small itself. 
Therefore, 



(7.28) 


For a metal like copper (g = 7 eV) at room temperature (T = 300 K), the correction term is close to 1.68 x 10 5 . Conse¬ 
quently, the conductivity is, to a good approximation. 



(7.29) 


and equivalent to the Drude result of Eq. (3.9) except that the meaning of rig) has changed because only the relaxation time 
near the Fermi level matters, that is, the same words are used, but the meaning has become profoundly different. 

7.4.2 Thermal conductivity revisited 

Cold indeed, and labor lost: 

Then farewell heat, and welcome frost! 


William Shakespeare 10 


Thermal conductivity will be easier having gone through charge transport in such detail. Recall that for heat transport, 
electrons from a higher temperature region transport to a lower temperature region and thermalize through scattering 
events. The amount of heat they take with them is understood by reference to Eq. (5.23) for fixed volume (dV = 0) or, 
isolating the heat term dQ, 


dQ = dE — gdN 


(7.30) 


where using E instead of U for energy reflects the parlance of the present section, and where dQ is the quantity of heat 
transported per electron (dN = 1), therefore, each electron transports a quantity of heat of E — g. Thus, instead of charge 
transport q in Eq. (7.21), the transport of heat (h) current makes the replacement q -*■ (E- g), or 



(7.31) 


where/(k) is evaluated for a temperature at a particular location: because the temperature varies with position. 



(7.32) 


where the differential element AT = (dT/dx)Ax has Ax = v x r = hk x r/m on the right-hand side, using arguments familiar 
from Section 3.2. Then, taking J h = x(dT/dx), and the arguments that led to J q in Eq. (7.21), results in 



(7.33) 


Some of the same trickery from the evaluation of a remains useful. First, the integrals can be processed according to 



(7.34) 


where (■ ■ ■) sweeps up all the k 2 (or E) dependent terms. Nevertheless, show it directly by using k x = k sin 0 cos cp so that 


f k 2 dk f* sin 3 Odd fo* cos 2 cpdcp 


1 

3 


(7.35) 


f k 2 dk J* svnOdO ft* dcp 


Second, 


^f PD (E) = -k B p 2 (E - g) 8, (fl(E - g)) 


(7.36) 


10 Ref. [37]: The Merchant of Venice, n.vii.75-76, p. 225. 
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as can be shown explicitly by simply working out the derivatives. As before, terms within the integrand that vary slowly 
by comparison to the peakiness of 8j(s) can be evaluated at the chemical potential and taken out of the integral, resulting 

pr(p) f°° 2 n 2 pr(p) 

k « - T J s8 l (s)ds= — - T (7.37) 

where, to a good approximation, p « (2/3)pD(p), as shown using Eq. (7.12). 

7.4.3 Heat capacity revisited 

Man's capacity for justice makes democracy possible, but man's inclination to injustice makes democracy necessary. 

- Reinhold Niebuhr 11 

Capacity, or the ability to hold a quantity of something desirable, is often of great interest. Heat is one such thing, 
and in a material it can be held by either the lattice of vibrating atoms or the electrons, although each has a different 
capacity for doing so. Their differences affect, for example, how fast heat can transport through a material: generally, 
good thermal conductors make good electrical conductors because energy is transported by free electrons. Insulators, 
on the other hand, which have a paucity of free electrons, have to transport heat by lattice vibrations (phonons), so 
that electrical insulators tend to be thermal insulators as well. The calculation of heat capacity C v , therefore, includes 
a contribution due to electrons (C e ) as well as the lattice (C,), or C v = C e + C{. as in Figure 3.4, at low temperatures, C e 
dominates, and at high temperatures, C, dominates. The heat capacity due to the electrons behaves as C e = yT. At high 
temperatures, C ( - asymptotically approaches the Dulong-Petit value of 3 Nk B . 

Focus on the electrons first. The electron heat capacity C e is 

C e = jfJ ED{E)f PD {E)dE (7.38) 

but to simplify the evaluation of the integral, notice that the integral can be modified to subtract pp without changing 
much, as that product is relatively temperature-insensitive for metals, even more so at the low temperatures where C e 
becomes noticeable. Replacing p by its integral definition, C e can therefore be rewritten as 

c e=jf f (E- d)EKE)f FD (E)dE (7.39) 

As a result, the integration becomes familiar after the approximation T){E) « T)(p) is made, and then removed from the 
integral, 

C e « k 2 B D( M )T J s 2 S,(s)ds = ’jI&fipjT (7.40) 

Comparison with the thermal conductivity factor k demonstrates 

v = ^z(p)C e (T) (7.41) 

3 m 

which are useful when heat diffusion phenomena associated with photo and secondary emission are considered 
[83-85], 


EXAMPLE: Evaluate y = CJT and k for copper-like parameters at 300 K assuming r(p) = 25 fs and p = 7 eV. 
SOLUTION: Using k F = y/lmp/h = 13.555 nm' 1 , CJT = y can be written 


from which k is 


mk F kl , P v t 

= —-J = 4.4031 X 10~ 7 , , = 70.546—f— 

3 h 2 K 2 nm 3 K 2 m 3 


* = -r^JpjCJD = 0.0027105- 


1 Reinhold Niebuhr, The Children of Light and the Children of Darkness. New York: C. Scribner's Sons, 1944, p. xx. 
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Compare the value of y to the value determined from Figure 3.4, found to be 0.00016099 (cal/mole K 2 ). Recalling 
that the ratio of density to atomic mass for copper is 0.141 mole/cm 3 and that 1 cal = 4.1868 J, the fitted value 
corresponds to 95.04 (J/m 3 K 2 ). 


As C e depends only on Fermi level, electron mass, and temperature, of which \x and T are either well-known by 
other means or specified, the discrepancy between theory and experimental values is said to reside in the mass term: 
an electron "thermal mass" is therefore defined by q = m th /m = Y eX p/Ytheory- F° r copper, r) = 1.375, whereas for silver, it 
is closer to 1.0136 [52, 85], Even so, the previous example tends to gloss over that electron scattering and relaxation 
times are temperature dependent and theoretically complex. Something more than approximation by a constant may be 
needed, even though representative values are useful. 

The second contribution to the heat capacity is due to vibrations of the lattice. Without going too deeply into scattering 
(which will occupy Section 32.3.4), observe that the "phonons", or the lattice vibrations represented as a particle, follow 
Bose-Einstein statistics, and therefore a simple model is sufficient to determine the temperature dependence of heat 
capacity due to them. Recall the discussion of Planck's correction (Eq. (3.23)) to the Rayleigh-Jeans formula. Using those 
methods, each of the three directions that characterize a lattice contribute to the density of states. As with Eq. (3.21), the 
phase space volume couples with the relation k = m/c s , with c s the speed of sound, to give for the density of states factor 



(7.42) 


where the longitudinal (cj) and transverse (c t ) sound velocities are presumed to be the same and given by (c s ). 

A difference with the photons of Eq. (3.23), however, is that phonons are lattice vibrations, and therefore the number 
of modes is limited by the number of atoms N, the highest mode begin when the atoms are oscillating out of phase. 
Therefore, an upper limit will exist such that m < a> D , where the subscript refers to Debye, in turn related to the Debye 
temperature T D of a material by hw D = k B T D . In terms of N, then, where the factor of 3 is from the vibration directions. 



(7.43) 


Thus, the Debye temperature can be specified by knowing the density of the metal and its sound velocity due to acoustic 
phonons. As specific heat is the change of energy with temperature 


(/: 




(7.44) 



(7.45) 


Substituting N from Eq. (7.43), 



(7.46) 


The Debye temperature for various materials varies [55] (determined, for example, by the methods of Section 32.3.4); for 
copper, it is T D « 343 K [86]. It is therefore clear that for metals that are at room temperature and higher, the denominator 
of the integrand can be expanded and is approximately x 2 to third order (it is an even function), so that 



(7.47) 


recognized as the high-T limit, or Dulong-Petit result, seen in Figure 3.4. Conversely, when the temperature is low, the 
upper limit may be extended to oo and 1/(1 - e~ x ) » 1 - e~ x . The resulting integrals are well approximated by f(p) and 
the gamma function, or 



(7.48) 



(7.49) 
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EXAMPLE: Evaluate A = Cj/FT 3 for copper-like parameters at 300 K assuming T D = 343 I< and copper values for 
density of 8.96 g/cm 3 and atomic mass of 63.5456 g/mole. 

SOLUTION: The factor (24?(5) + (3/4) = 25.6363 « 282/11. Thus, Q/VT 3 « 2538^/1 IT 3 = 6.7031 J/K 4 m 3 , 
where p t = 0.141 mole/cm 3 . Scaling to the units of Figure 3.4, it is found A = 1.13545 x 10~ 5 calorie/K 4 mole, 
reasonably close to the fit. 


7.4.4 Lorentz number revisited 

Because things are the way they are, things will not stay the way they are. 


- Bertolt Brecht 12 


The Lorentz number is the ratio of Eq. (7.37) with Eq. (7.29). It should be constant, but because of the quantum 
statistics analysis, the form encountered in the Wiedemann-Franz law and crudely given by Eq. (3.19), is now corrected 
to be 



(7.50) 


and shown compared to experimental data as the red line in Figure 3.3. The agreement at higher temperature is remark¬ 
able. 

That tightly packed and charged electrons meandering through a lattice of vibrating ionic cores in a metal can be 
modeled as a non-interacting electron gas in a box without fields with the electrons subject to Fermi-Dirac statistics is 
an inescapable conclusion. Modeling the electrons as a gas in the metal that come to the surface only to be turned away 
by the surface barrier - but not always - results in the emission equations that, like the Weidman-Franz law and the 
specific heat evaluations, enjoy a notable ability to predictively model experimental data. The particle statistics are now 
on firm footing. What now remains to be done before encountering modern versions of the electron emission equations 
is to acquire a comfortableness with the methods of quantum mechanics. 


12 As quoted in John Cook, Steve Deger et al.. The Book of Positive Quotations, 2nd edn. Minneapolis: Fairview Press, 2007, p. 390. 





CHAPTER 8 

Quantum mechanics methods 


I think 1 can safely say that nobody understands quantum mechanics. 


- Richard P. Feynman 1 


8.1 A simple model: the prisoner's dilemma 

[Athenians:].. .you know as well as we do that right, as the world goes, is only in question between equals in power, while the strong do 
what they can and the weak suffer what they must. 

-Thucydides 2 

The peoples of the world must unite, or they will perish. This war, that has ravaged so much of the earth, has written these words. The 
atomic bomb has spelled them out for all men to understand. 

- J. Robert Oppenheimer 3 

At the conclusion of World War II in 1945, America was the sole possessor of atomic weapons. Relations with the Soviet 
Union deteriorated and the Cold War was underway after 1947. After the Soviets acquired their own atomic weapons in 
1949, a fierce and public debate arose as to how the USA should react. The brilliant physicists and mathematicians of Los 
Alamos, the birthplace of the new power, were part of that discussion from the start, with J. Robert Oppenheimer (director 
of Los Alamos) and John von Neumann (a founder of game theory) publicly making opposing arguments. Dealings with 
the Soviet Union were couched in game theoretic ideas, as the utterances of Oppenheimer and von Neumann (who 
could quote Thucydides' excerpt above verbatim) make evident. The choices were to launch a preventive attack, as the 
Athenians did to the Melians and as von Neumann advocated, or take the risk of cooperation to enable mutual survival, 
as Oppenheimer advocated. 

As shown by von Neumann, a rational course of action in a conflict (or game) between two antagonists is always 
possible if their aims are opposed (i.e., if one wins, the other loses). Such games are called "zero-sum", but cooperation 
(and survival) is a different breed. Von Neumann was hired as a consultant by the RAND corporation, where he and 
others applied themselves to strategic studies of intercontinental nuclear war. Two of von Neumann's new collegues at 
RAND, Merrill Flood and Melvin Dresher, devised a simple game for which the best outcome was not the rational one: 
the Prisoner's dilemma (PD) [87], which figures strongly in studies on the evolution of cooperation [88], Here, it affords 
a convenient discussion of ideas that introduce a methodology illuminating to the quantum mechanical treatment of 
electron emission, and so the PD is used to introduce the techniques in a pleasingly simple model. 

The PD [87] is concisely described thus: two criminals are charged with an offense, but held separately. If both do not 
confess, both will be convicted on a lesser charge and serve a year in prison. If both confess, each will serve three years 
in prison. But, if one turns state's evidence (confesses) and the other does not, the confessor will go free and his partner 
spend four years in prison. To avoid being duped, both prisoners have a powerful temptation to confess, even though 
collectively both would do better to trust his partner by cooperating and not confessing: in the parlance of game theory, 
mutual defection is the Nash equilibrium (the circumstance where no player can do better by switching his strategy 
unilaterally). The beauty of the problem is that it can be formulated in the language of a payoff matrix and vector states, 
and thereby in a simple-to-grasp and easy-to-manipulate manner allow for the exploration of recurrent ideas that will 
be subsequently encountered. 

Richard P. Feynman, The Character of Physical Law. Cambridge MA: MIT Press, 1989, p. 129. 

2 J. Robert Oppenheimer, Acceptance speech for Los Alamos Certificate of Appreciation, October 16, 1945, quoted in W. Poundstone [87], p. 70. 
3 "Melian Dialogue'', quoted in Thucydides (trans. R. Crawley), The Landmark Thucydides: A Comprehensive Guide to the Peloponnesian War, ed. 
Robert B. Strassler. New York: Simon and Schuster, 1998, p. 352. 
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Although the PD is traditionally formulated as a parable involving the unscrupulous, it is easier to make the payoff 
matrix actually refer to payoffs and to speak of players and a game show host: instead of serving a sentence, let the players 
gain a reward depending on whether they cooperate (act in good faith) or defect (betray the other player). 4 The payoff 
matrix then refers to the amount the host pays out: if the players cooperate, the host pays 6 units (3 units to each player), 
if both players defect, the host pays 2 units (1 unit to each player), and if one player defects and the other cooperates, 
the host pays 4 units (all to the defector, none to the cooperator). The essence remains the same: there is an incentive to 
defect (or cheat) and gain a higher reward, in addition to a strong incentive not to be a dupe (one who cooperates when 
the other player defects) where there is no gain at all. In the bigger picture, observe that if both players cooperated, the 
host would have to pay out the most and the players' collective gain would be the greatest, an outcome John Stuart Mill 
would embrace but Ayn Rand would excoriate. 

In contrast to game shows where contestants interact with each other only once, a richer game is afforded when they 
repeatedly interact: such a game is called an iterated PD. In a computer simulation that pitted different strategies against 
each other, one strategy worked surprisingly better than the others and was dubbed "tit-for-tat". This strategy cooperated 
in the first round, but then simply did what was done to it in subsequent rounds: it cooperated as long as it was cooperated 
with, but when it suffered a defection, it defected on the next turn. A strategy that performed better in the long haul was 
tit-for-tat with occasional forgiving of a defection. Altruism, even in this very limited sense, can perform well. Seeing 
why this is so is possible using the language of matrix operators and bra-ket vectors. 


8.1.1 Matrices and vectors 


A great deal of my work is just playing with equations and seeing what they give. 


- Paul A.M. Dirac 5 


Using the notation introduced in Eq. (4.8), let \u) refer to a cooperator, and | d) to a defector but now both players 
must be considered, so that the state vector of the players \y/} must take both into account. The cases where they are 
both cooperating or both defecting are straightforward to write: 


'"“ >= A (!“>)'A 




A 


(8.1) 


(8.2) 


where the factor l/^/l assures that (uu\uu) = (dd\dd) = 1 , that is, the states are "normalized" and orthogonal in that 
(uu\dd) = 0. A similar reasoning shows that when one player cooperates and one defects, two more state vectors arise 
depending on who does what: 


Y 


V 

0 

; 1*0 = 4 = 

1 

0 

i 

A 

1 

0 


(8.3) 


Here is the rub. To construct the payoff matrix, which shall be designated H to bring to mind in the future the Hamil¬ 
tonian, four such vectors \y/j) for; = 1 to 4 are needed, with the first two, |y/,) = \uu) and | y/ 2 ) = \dd), already found. But 
| ud) and \du) are not the other two. 


4 A British daytime game show called Golden Balls, airing between 2007 and 2009, contained just such a PD game but with a different payout 
matrix and an intriguing wrinkle: the contestants could talk beforehand and agree, or pretend to agree, to cooperate, making the betrayals all 
the more acute. 

5 PauI A.M. Dirac, quoted in the essay "A Piece of Magic: The Dirac Equation", Frank Wilczek, contained in ref. [89], p. 102. 
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Therefore, find vectors |y/ 3 ) and |yr 4 ) such that they are orthogonal to each other and those of Eqs (8.1) and (8.2). That 
is rather straightforward: change the sign of one of the non-zero elements, or 



as the two remaining orthogonal states. H is then constructed from the orthogonal basis states by 

H = CjJvqXvql + C 2 \w 2 )(v 2 \ + CjlV'aXV'jl + QIvgXuqI (8.5) 


where \yfj)(yfj\ are matrices, in contrast to (yfj\Wj) = b which is a scalar. Observe that this entails 


I ud) = -(Wi) + W 2 ) + tr 3 ) - lv 4 )) 
I du) = i(|vri) + \y/ 2 ) - \y/ 3 ) + W A )) 
| ud) + | du) = lyq) + | yr 2 ) = \uu) + \dd) 
The matrices are straightforwardly evaluated, for example 


(8.6) 

(8.7) 

(8.8) 


V'iXvhl = 


0 0 0 0 


0 0 0 0 


in which the column element of the first vector multiplies the row of the transposed second vector to give the row of the 
matrix. Then, by explicit expansion of Eq. (8.5), 


0 0 
0 0 


if C 3 = C\ and C 4 = C 2 are chosen and, from the payoff values, C, = 6 and C 2 = 2 arise because H\uu) = 6| uu) and 
H\dd) = 2\dd). As a result, the problem has been solved in the reverse of its usual narrative encountered when, for 
example, given the Hamiltonian of a dynamical system, its eigenvectors are found, in terms of which the Hamiltonian 
is expressed as a diagonal matrix of its eigenvalues (see, for example, the discussion of Euler's theorem and rigid body 
motion in Goldstein [90]). Finding eigenvectors from eigenvalues, as is more common, is considered next. 


8.1.2 Eigenvectors of tit-for-tat 

Wir mtissen wissen/Wir werden wissen. 

- David Hilbert 6 

Now extend the matrix methods to consider the time evolution of stable strategies. The power of such methods was 
appreciated early on by David Hilbert, whose deep and profound appreciation of matrix mechanics in particular, and 
mathematics in general, enabled him to advise Heisenberg and Born to seek out a differential equation with the same 
eigenvalues when they were unsure why matrix mechanics appeared in the physics of the atom, as in Section 9.1. Had 


6 "We must know, we will know." From David Hilbert's 1930 address to the Society of German Scientists and Physicians, reproduced and 
translated by the Mathematical Association of America. Available online at http://www.maa.org/book/export/html/336071 (trans. James T. 
Smith, San Francisco State University). It was a stinging rebuke of the maxim ignoramus et ignorabimus ("We do not know and will not know") 
that was fashionable at the time, and unfortunately still is. 


www.ebook3000.com 
















76 Chapter 8 


Heisenberg followed Hilbert's prodding, it is speculated that he may have come across Schrodinger's equation before 
Schrodinger. 7 

The matrix U defines how the tit-for-tat strategy is implemented. Let two players repeatedly interact with each other 
over time. If a superscript designates the interaction number, for example \y/(t n )) = \y/ n ), where t„ is the nth interaction 
(or time step) of the game, then 

IvO = U\ ¥ n ) (8.11) 


The task is then to ferret out the mathematical meaning of "do what was done to you". A moment's reflection reveals 
that the matrix should be 


r 0 0 1 0 
0 0 0 1 
10 0 0 
k 0 1 0 0 


( 8 . 12 ) 


as follows from (using the original ud notation) U\uu) = \uu), U\dd) = \dd), U\ud) = \du) and U\du) = \ud ): what U does 
is swap the the states of the players. Clearly, there are three states where the action of fj does not change the vector 
describing the state of the players, or tj\cp) = \cp), and they are | uu), \dd), and (| ud) + \du))/\fl. Observe that V is such 
that 

Wu=(j 2 =I (8.13) 


where f is taken to mean transpose and complex conjugate (or "self-adjoint" in the appropriate parlance), although a 
matrix with real entries begs what is meant by complex conjugate, and U~ l = U anyway: the stage is simply being set for 
later. 

The question is then: what eigenvectors \q>) are associated with U and how do they compare with the eigenvectors 
\y/j) of H? Finding the eigenvectors of a matrix is a standard problem in linear algebra (e.g., see refs [60, 91]). The matrix 
equation 

0 - Xt)\q>) = 0 (8.14) 

only has solutions if the determinant \ U — Xl\ =0. That is, 

-2 0 1 

0—2 0 
1 0 -2 

0 1 0 

Thus, for U, the eigenvalues are degenerate (more than one of the same value occurs) and are equal to ±1, or 2 ; - = (-1) 7 for 
j = 1 to 4. Each eigenvalue 2 ; - is put back into Eq. (8.14) where the components of \q>j) are unknown and the components 
solved for. 


= (2 - 1) 2 (2 + l) 2 = 0 


EXAMPLE: Find the eigenvectors associated with each 2 ; - = (—iy. 
SOLUTION: Let {cpj\ = (a, b, c, d). Then from Eq. (8.14) it follows for 2, that 



7 See Manfred R. Schroeder, Number Theory and the Real World, The Mathematical Intelligencer 1 (4), 18, 1985. Although the often alleged claim 
that Hilbert also said "physics is much too hard for the physicists" is questionable (see L. Corry, David Hilbert and the Axiomatization of Physics 
(1898-1918): From Grundlagen Der Geometrie to Grundlagen Der Physik (Archimedes), Vol. 10, softcover reprint of hardcover 1st edn, 2004, Springer, 
2010, footnote 11, p. 8), he might be forgiven had he done so. 
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or c = —a and d = —b. Setting a = 1 and b = 0 is the first choice, and it shall be called X x , with the choice a = 0 and 
b = 1 being the other to be called It is simple to show that the eigenvectors associated with X 2 = +1 correspond 
to c = a and d=b. Thus, 



' 1' 


Y 


' 0 ' 


o' 

I<P1> = 

0 

-1 

, W 2 ) = 

0 

i 

- I<?3> = 

1 

0 

, 1 (Pa) = 

1 

0 


^ 0 ^ 


0 


-1 


1 


Drawing a correspondence to the eigenvectors of H, it is seen that | q>{) = |yr 3 ), \<p 2 ) = Ivq}, | cpf) = W 4 ), and 
Wa) = IvqK 


It has been found that the eigenvectors of U are the same as those ofH, and that will have consequences. When that happens, 
the combination HU - lift (called the commutator of H and U and designated [H, U\) vanishes. Because the eigenvectors 
are associated with steady-state solutions, that means the value of the payoff matrix H applied to them will not change 
from one interaction to the next. Formally, when the commutator vanishes, 

(ys n+ 1 \H\xp n+l ) = (ys n \WHUW n ) = (ys n \u\[H, Ul+UEQ \xp n ) (8.17) 

= { V n \tfUH\ V n ) = {tp n \H\ V n ) (8.18) 

Below, an analogy will be drawn between the Hamiltonian operator and the payoff matrix, as well as between the 
propagation operator and the iteration matrix, and it will be demonstrated that the time variation of an operator 0 will 
be related to its commutator with the Hamiltonian. But not yet. Here, the point is to show the mechanics of an analogous 
but far simpler system. 


8.1.3 Quid pro quo and stable strategies 

Margaret: 1 cry you mercy, 'tis but quid for quo. 


- William Shakespeare 8 


Menelaos: Zeus, lord, grant me to punish the man who first did me injury 

- Homer 9 

Ahab.- To the last I grapple with thee; from hell's heart I stab at thee; for hate's sake I spit my last breath at thee! 

- Herman Melville 10 

Quid pro quo is latin for "something for something" or, more colloquially, "tit-for-tat", and indicates an exchange of 
comparably valued objects or actions. But there are differences in how tit-for-tat strategies play out. 

• If players believe they are to meet but once, the temptation of both to violate each other (or defect) is strong, and if 
acted upon then subsequent interactions, foreseen or not, continue the defections. This represents the "Ahab" initial 
condition of mutual defection (Ahab's harpooning of Moby Dick and the the whale's amputation of Ahab's leg). 

• Were continued interactions plausible, one player might opt to initially cooperate and be soured by the other player's 
betrayal. This represents the "Menelaos" initial condition of one-sided cooperation (hosting a royal visit to suffer Paris 
making off with Menelaos's wife Helen) followed by the so-called "death spiral" of unending and alternating defections. 

• Beneficent actions or those foreseen to be long term greatly and stably benefit if both actors do the right thing the first 
time. This represents the "Margaret" initial condition of mutual cooperation, a delivering on both sides of "mercy", 
and results in the most desirable long-term outcomes for both parties. 

Observe the average payouts over time: mutual defection produces an average gain of (1 unit/interaction), alternating 
defection a gain of (2 units/interaction), and mutual cooperation a gain of (3 units/interaction). All these strategies were 
manifest in the Axelrod computer tournaments, and they are mathematically represented by the unchanging actions of U 


8 Ref. [37]: The First Part of King Henry the Sixth, V.iii.109, p. 468. 

9 Homer, The Iliad (trans. Richmond Lattimore). Chicago: University of Chicago Press, 1962, Book III, line 351. 
10 Herman Melville, Moby Dick. New York: Bantam Books, 1986, Chapter 135. 
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on the eigenvectors \uu) (Margaret), \dd) (Ahab), and (| ud) + \du))/sfl (Menelaos) that results in the same unchanging 
payouts over subsequent interactions. Unless something changes. 

What can change is the act of forgiveness (or the act of selfishness, but it is more pleasant to focus on the positive), 
which would convert the never-ending betrayals of Ahab into the somewhat better alternating betrayals of Menelaos and, 
more importantly, converts the feuding of Menelaos into the mercy of Margaret. The phenomenon is not so academic, 
particularly in war-time settings, where the spontaneous withholding of fire (or directing it to where it does no damage) 
can be mutually reinforcing: instances abound of spontaneous benevolence between opposing forces, a poignant example 
being the Christmas truce of 1914 on the Western Front during World War I: 

As darkness shrouded the wire on Christmas Eve, coloured lights appeared on the German parapet in some sectors held by Bavarian and 
Saxon troops. Small trees, again decorated with lights, were seen above the sandbags. A few exuberant British soldiers took potshots at 
the novel targets and were cheered by the enemy when a hit was scored. In other parts of the line, impromptu carol services attracted the 
attention of the British, who applauded the singers and responded with their own efforts.. .British working parties, moving cautiously in 
frozen no-man 's-land, were greeted with friendly shouts from the enemy trenches and ended their jobs puffing on cigars proffered from 
the German positions ...An English lance-corporal observed, “the Germans had a band playing.. .several hymns, our National Anthem 
and Home Sweet Home. " l 1 

Compare the end of that description to an eerily similar event almost 50 years earlier during the American Civil War: 


By the evening of December 30 1862, the two armies confronted each other in battle formation straddling a shallow stream called Stones 
River, just north of town. That evening as the soldiers awaited the battle that was sure to open next morning, bands on both sides of the 
line took turns serenading the armies, who could clearly hear both the Rebel and the Union musicians. Finally, one of the band struck 
up “Home, Sweet Home " and all of the others on both sides joined in as the soldiers sang along, concluding the concert with a poignant 
reminder of the American culture the contending forces shared. 12 


The face-off between soldiers of opposing armies is a PD [88], but something like forgiveness in the form of restraint 
can continue and expand once expressed. How would it be modeled? It would look like the insertion of a matrix with 
an off-diagonal element in the block representing the player that is forgiving. If it were player 1, then the matrix might 
look like 


110 0 
p _ 0 0 0 0 

1 " 0 0 1 0 

0 0 0 1 


(8.19) 


which acts to shift player l's state to that of cooperation regardless of the acts of player 2. It is a different kind of matrix 
than the payoff matrix or the interaction matrix: it does not commute with them. In fact, it kind of looks like scattering. 
Keep that in mind. 

The discussion of the PD underscores two central matters in quantum mechanical problems. First, the initial conditions 
specify the subsequent dynamics of an interacting system, and although the equations can look similar, the outcomes 
can be unique. Second, knowing the future dynamics requires finding the complete set of commuting constants of the 
motion [44] for which eigenvectors, or combinations of them, are followed into future states. 


8.2 Matrices and wave functions 

For a short time it appeared that there were two new theories of the behavior of matter on the subatomic level, but it was soon discovered 
that Heisenberg's quantum mechanics and Schrodinger's wave mechanics were mathematically equivalent, even though they looked quite 
different. This seems to have pleased neither founder of quantum mechanics. Each intensely disliked the other's approach . ..Heisenberg 
remarked that Schrodinger's theory was “loathsome" and “crap." Schrodinger, on the other hand, charged that Heisenberg's method 
was “disgusting." 

- Richard Morris 13 


n D. Winter, Time Off From Conflict: Christmas 1914, The RUSI Journal, 115, 1970, 42-43. 

12 Steven E. Woodworth, This Great Struggle: America's Civil War. Lanham, MD: Rowman & Littlefield, 2011. p. 189. 

13 Richard Morris, Achilles in the Quantum Universe: The Definitive History of Infinity. New York: Henry Holt, 1997, p. 91. The tale of their competition 
is even stranger than suggested here: see Arthur I. Miller Erotica, Aesthetics and Schrodinger's Wave Equation, in ref. [89], p. 80. Physics is not without 
passion. 
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In the wave mechanics approach, the wave function yr(r, t) is acted upon by differential operators, and is therefore 
likely more familiar to those with exposure to differential equations. In that formulation, Schrodinger's equation takes 
on the familiar form 

ihd t¥ (r, t) = H\f/(r, t) (8.20) 

where the Hamiltonian H = H 0 + V is a differential operator for which the kinetic energy is H 0 = —(h 2 /2m)V 2 . However, 
as Dirac pointed out, the symbolic method using ket vectors like |i p) and operators like H on which the discussion of 
the PD was based is equivalent, possesses an undeniable beauty and compactness, and seems to, as Dirac opined, "go 
more deeply into the nature of things" [40]. The matrix analogy makes intuitive that the order of operators matters 
(unless their commutator vanishes) and benefits from an appreciation of terms like eigenvector and eigenvalue. Most 
importantly, it gives a powerful and compact method to go from one representation or basis to another. In the symbolic 
notation, Schrodinger's equation becomes 

ihd t W(t)) = HW(t)) (8.21) 

Although they look similar, switching to a momentum k representation using y/(k, t ) and H 0 = h 2 k 2 /lm is an ordeal using 
Eq. (8.20) and straightforward using Eq. (8.21). The form of Eq. (8.21) makes it clear that if MO) is an eigenvector of H 
with eigenvalue E, then MO) = exp(-iEt/h)\\//(Q)), and therefore, that 

U(t) = e- lkt/h (8.22) 

where the exponential of a matrix operator is a matrix of the exponential values of the operator's action on the 

wave function. Because the commutator of a function of an operator, or f(fi) with that operator vanishes, as can be 

proven by representing f(H) as a Taylor expansion in terms of H, it is clearly the case that ( U(t), H\ = 0, analogous 
to the PD discussion. Of greater importance here, the wave function, as per Eq. (8.21), is clearly seen to satisfy the 
relation 

MO) = 0(0 MO)) (8.23) 

from which several relations follow (recall MO) is an eigenstate of H) 

MO I # MO) = E= <vr(0)|H|vr(0)> (8.24) 

I + f 2 )> = U(h)\v(t 2 )) = Uih + t 2 )|vr(0)> (8.25) 

MO MO) = MOIIOCO^COMO)) = M0)M0)) (8.26) 

Observe that Eq. (8.24) implies the commutation relation [U, H] = 0. The connection between the symbolic method of Eq. 
(8.21) with the wave mechanics approach of Eq. (8.20) is through the basis states for position r and momentum p = hk, 
which are conjugate variables (loosely, related by Fourier transforms) such that they satisfy an uncertainty relation due 
to Heisenberg, for example AxA p x > Ti/2 for the x component, and another example being energy E and time t, more 
about which in Section 8.2.3. In d dimensions, the relations between the kets are given by 

(r\k) = (2nY A/2 exp (ik ■ r) (8.27) 

(r|r ') = S(r - r') (8.28) 

(k\k!) = 8(k-i<!) (8.29) 

The formalism is up to the task of finding the emission equations developed by the 1930s handily, but an understanding 
of why these relations hold has benefit in subsequent improvements of those theories. That treatment is not beholden 
to the three-dimensional formalism, though. In fact, the one-dimensional formalism is a lot more convenient. Thus, let 
x and k x -*■ k (i.e., suppress the x subscript on k x ) be the conjugate variables and restrict attention to one dimension. The 
generalization to higher dimensions is straightforward and can be stated as needed. 
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8.2.1 Identity and Monty Hall 


Tv&ei aavrov ("Know Thyself") 


- Inscription on the Temple of Apollo 


In Phaedrus, 14 Socrates asserted "I must first know myself, as the Delphian inscription says',' referring to the engraving 
on the Temple of Apollo at the famed Oracle of Delphi (a structure existing in antiquity but brought to ruin for sectarian 
motives by 390 AD). Sage advice, even though Socrates was likely taking liberties with the meanings of the words. 15 
Why is that important here? Identity operators are not as simple as multiplying by unity: they tell us something about 
the relation of probability estimates to the basis states. 

Intuitively, the propagator U(t) is the analog of the iteration matrix U of Eq. (8.12), and that suggests an analog of the 
identity matrix I: one might expect, based on Eq. (8.5), that I should resemble something like 

I = (8-30) 

What is so special about \y/j)? Or, put another way, couldn't another basis set be used? The answer is of course: the 
normalized orthogonal states characterized by |;) corresponding to a column vector of zeros except for the ;th entry, 
which is unity (e.g., (2| = (0,1,0,0 )/\[d for N = 4) is a different basis, and it is trivial to explicitly show 

YjWkHXjWi) = = hi (8-31) 

;= 1 ;=1 

where the \y/} are given by Eqs (8.1), (8.2), and (8.3). A modification of the basis states is still constrained by a require¬ 
ment analogous to that of Eq. (8.30). The choice of basis state is dictated by other concerns, such as diagonalizing the 
Hamiltonian. Relations much like this one shall be encountered when the basis states refer to position and momentum 
(see Eq. (8.73)). 

The relation between quantum mechanics and probability is such that intuition developed from tackling probability 
problems pays handsomely in understanding quantum mechanical notation and operations, and concepts. Apart from 
the pedagogical value, a perk is that the irritation of resolving probability paradoxes, where numerical calculation is often 
at odds with intuition, forces a deeper understanding. Consider, therefore, a seemingly routine example in which there 
are N containers such that Pj = (<Pj\<Pj) is the probability that a single randomly hidden object is in the ;th container, with 
all other containers empty. Let the probability that the object was hidden in any of the containers be equally probable 
(uniformly distributed). This is mathematically described by saying the weighting factors Wj associated with each basis 
state are given by unity for all;. Thus, by construction, 

(<U2) 

Let the containers be arranged in a circle, and let one container chosen at random correspond to |1), with |2) being 
the next one clockwise, and so on, as schematically shown in Figure 8.1, where the figure has been rotated so the 
selected container is at the top. The act of opening any one of the containers at random and peering inside to find it 
empty is equivalent to setting its weighting factor to zero and replacing the weighting factor associated with the unopened 
containers to 1/-0V - 1. Such a description is fairly routine, intuitive, and uncontroversial. 

But here is where it gets interesting: instead of choosing from all the containers, what changes if the person who 
hid the object in the first place opens a container selected from all except the first container ? That is, the randomly cho¬ 
sen container | k) with 2 < k < N is exposed as empty, but 11) is not among those selected. How then are the weighting 


14 Benjamin Jowett, The Dialogues of Plato, Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 7, Plato. Chicago: Encyclopaedia 
Britannica, 1952, p. 116. 

15 To sages before Plato the meaning may have been closer to "Know your measure" and therefore a caution against pride. See Eliza Gregory 
Wilkins, "Know Thyself", in Greek and Latin Literature. Chicago: The University of Chicago Libraries, 1917. 
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Figure 8.1 Four containers, one chosen at random to hold a prize. Containers are indexed clockwise with a container arbitrarily chosen by the 
contestant (in the center) rotated to the top position and labeled "1". The state vector \<p) is such that P, = (cp,\<p,) is the probability that the jth 
container holds the hidden object. 


factors redistributed over the remaining bases given that the /ah container was empty? 16 Basis states bear a strong resem¬ 
blance to vectors, and the fixing of one basis state for special treatment and eliminating the /ah state is equivalent to 
the rotation of a coordinate system about a chosen axis until a vector cp no longer has a component along the k axis 
(alternately, a projection of q> onto the k axis vanishes, or q> ■ k = 0). In the same way, \cp) no longer has a projection 
along | k) because w k = 0. The question is therefore rephrased as an inquiry into what changes when a rotation of the 
basis states with 11) held fixed is performed. Consideration of the case N = 4 and a liberal usage of Euler rotations for the 
three states up for rotation makes the consequences plain [60, 90]. A general rotation matrix M = R z (y) ■ R y (P) ■ R z («) is 
composed of 17 


R z (0) 


10 0 0 

0 cos # sin# 0 

0 -sin# cos# 0 

0 0 0 1 


R,(#) 


10 0 0 
0 cos# 0 sin# 
0 0 10 
0 -sin# 0 cos# 


(8.33) 


(8.34) 


Because the rotation matrices are such that ■ R 5 = 1 for 8 = y or z, it follows M : ■ M = J. The action of M on \cp) can 
be constructed so as to change the weighting factor of one of the bases to 0, as shown in the next example. 


16 The choice of N = 4 here was for continuity with earlier sections, but when N = 3, the problem is commonly known as the Monty Hall 
problem. The version here is simple and classical Monty Hall rendered in quantum parlance and one door shown rather than N -2 doors 
shown (see the second example in Section 8.2.1). Allowing the players to make quantum strategies [92] makes for a more interesting puzzle, 
and the notation will be familiar from the present treatment. 

17 The arrays are 4x4 because one basis state is held fixed: the remaining three correspond to the usual three-dimensional rotation of the axes 
in the Euler angle prescription. 
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EXAMPLE: Show the action of M on \cp) for the choices a = n/A, /? = n/2, and y = n is to set u 
also sets w 4 -*■ l/yjl. 

SOLUTION: By explicit substitution and sin0/4) = cosO/4) = V 1 / 2 , 


0. Show that it 




o 


o 


V 2 


It follows ( cp'\(p') = {cp\cp) = 1, as required. 


0 l 


Vi -Vi 
Vi Vi 


EXAMPLE: Find the rotations necessary to render w 2 = w 3 = 0, and find the resultant effect on w 4 . 

SOLUTION: The action of R ? 0/4) on \q >) gives \q>') = R z O/4)|<p) = (j) (l V 2 0 l) (where the trans¬ 
pose t is used for economy). This suggests that acting on | q>') by R y (arctan(v'2))|<p') = \q>") results in 
| <p") = 0 0 \/3^ , as can be shown by direct computation. Therefore, w 4 = -^3/4. 


There remain two matters to wrap up. The first is to investigate what bearing the previous examples have on the Monty 
Hall problem, and the second is to specify what any of this has to do with quantum mechanics in a way that relates to 
emission. 

The Monty Hall problem is based on a game show in which the contestant is presented with three curtains, behind one 
of which is a valuable prize, and behind the other two are winnings of little or no value. Once the contestant picks a curtain 
(say Curtain 1), the game show host reveals what is behind one of the two curtains that were not selected (say Curtain 
3). The contestant is then given the opportunity to change her selection to the remaining unchosen curtain (Curtain 2), 
or keep her original choice (Curtain 1). The question is then, what should she do? The problem has generated a large 
literature, and the phrasing of the problem proves deeply important, but for the simple paraphrasing of the problem given 
here, the optimal and counterintuitive strategy (the one most likely to secure the valuable prize) is for the contestant to 
abandon her choice of Curtain 1 for Curtain 2. It is counterintuitive in the sense that when presented with the problem, 
many people believe the actions of the game show host do not change the probabilities of the valuable prize being behind 
Curtains 1 and 2, which were originally equal, and which are assumed to remain equal. 

Insofar as the Monty Hall problem is an N = 3 version of the previously considered container problem, it is clear that 
the act of revealing what is behind one of the two unchosen curtains is tantamount to rotating the probability vector 
| (p) such that one of the weighting factors vanishes while the weighting factor for Curtain 1 is held fixed. But this is 
accompanied by a readjustment of the weights to make the sum over probabilities for the primed case equal unity just 
as for the unprimed case, or 

WW) = HR t R|<P> = {<P\<P) = 1 (8.35) 

Why is this a paradox to many people? Simply put, it is because P Y = 1/3 but P 2 now equals 2/3: after being shown an 
empty container, the contestant's best strategy is to switch her choice, as the probability she shall chose the prize doubles. 
This result is deeply unintuitive: most people (some vehemently) will maintain that the probability for each curtain is the 
same, or 1/2. The appreciation of why it is not is the gift the paradox provides, and it may help to reconsider the problem 
as a question of measurements. Before the host reveals an empty curtain, the probability that the prize is behind the 
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curtain chosen by the contestant is 1/3 and the probability that it is not is 2/3 (a factor of 1/3 for each of the other two 
curtains). A measurement reveals information, and having the host reveal an empty curtain is taken as a measurement. 
But is it really? The contestant knows that one of the two curtains that the host can chose from is empty. The host shows 
an empty curtain. Has his doing so changed her knowledge? No, it has not. How do we know that? The initial state 
vector is 

w=,1>+lT>= 7f(i) + ^(;) ,836) 


where the bar over |1) means "not |1)", or |1) = |2) + |3). Now the host reveals the empty curtain, that is, seems to make 
a measurement. If Curtain 1 has the prize, the probability is 1 /2 that the host will reveal Curtain 2 and the same for 
Curtain 3. If Curtain 2 has the prize, the probably is unity that the host will reveal Curtain 3, and if the prize is behind 
Curtain 3, likewise, the probability is unity that the host will reveal Curtain 2. Therefore, the alleged "measurement" of 
the host is represented by a matrix M given by (compare Eq. (8.5)) 

M = ~{|1><2| + 11 >(31} + |2)(3| + 13) < 21 (8.37) 


and is read as "If contestant chooses 1, host shows 2 half the time and 3 half the time (the first two states); if the 
contestant chooses 2, host shows 3 (third state); and if the contestant chooses 3, host shows 2 (fourth state)". The 
advantage of Eq. (8.37) is that it is coded as in the algorithm given in Section A3.4, and can show the evolution of the 
winning averages over time, as in Figure 8.2. The phrasing is a consequence of the curtains being labeled, whereas in the 
previous formulation the labeling was fixed by the selection process. The names have changed, but the problem is the 
same. So, what does the measurement change? Nothing. If one works it out, M| cp) = \cp), but to show it explicitly: 


(11 < ? , > = (1 |M|<p) 


0 

1/2 

1/2 

1 

0 

0 

1 

1 

0 

1 

0 

1 

0 

1/2 

1/2 

1 
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0 

1 

1 

0 

1 
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1 


(8.38) 


(8.39) 



Game 


Figure 8.2 Monty Hall average winnings versus played games. The average number of winnings for the stay and switch strategies in a Monty Hall game 
as calculated using the algorithm of Section A3.4. 
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where the first is the probability the prize is behind Curtain 1, and the second is the probability that the prize is not behind 
Curtain 1, following the actions of the host. Further, 

WW) = (<p|M t M|<*>) = {cp\(p) = 1 (8.40) 

Although M|<p) = | cp) makes M look like an identity operator, it is not. It is more like a rotation (something like L z not 
changing |i/r) as far as L 2 is concerned in Section 9.1.2), and a consequence of the fact that the \j) from which | cp) is built 
are equally weighted. 


8.2.2 Conjugate variables 

(the) uncertainty relation specifies the limits within which the particle picture can be applied. Any use of the words “position " and 
“velocity" with an accuracy exceeding that given by Eq. (1) (AxA p x > h) is just as meaningless as the use of words whose sense is not 
defined. (Footnote: In this connection one should particularly remember that the human language permits the construction of sentences 
which do not involve any consequences and therefore which have no content at all...) 

- Werner Heisenberg [93], p. 15 

Let \x n ) be a discrete basis such that the action of the position operator X on it produces the eigenvalue x n , the upper 
case notation to reinforce that it is not being identified with the position operator any more than x n is being identified 
with position (a rather duplicitous caveat, as they will clearly be so identified below). Let another operator K act to 
increment the index n < N, or 


X\x n )=x n \x n ) (8.41) 

K\X„) = \x n+l ) (8.42) 

If position is periodic or loops back on itself, much like the ring of curtains in Figure 8.1, so that x N+ j = Xj, then it is 
obvious (perhaps after a small bit of pondering and the realization that {Xj\x t ) = S kl ) that K is 

N 

*= I>«+i>< x »l (8-43) 


EXAMPLE: Prove K N = I. 

SOLUTION: Using mathematical induction, first show that K n = \x n+] -){Xj\. Clearly, it is true for n = 1. Assume 

it is true for n then prove true for n + 1: 

K n+1 = KK n = £ 2l* J+1 ><x,|x ;+ «X*/l 

i= i '=i 

= EiwiX*/i 

as required, where (xj\X[) = 8 jt is used. Setting n + 1 N and using 

?=Xl*„X*„l (8.44) 

as anticipated by Eq. (8.30), completes the proof. 


EXAMPLE: Prove K n K m = K n+m . 

SOLUTION: The previous example makes clear KK n = K n+l . Therefore, K n k m = I< n ~ 1 K m+1 . Repeating the process 
n times gives the desired conclusion. Alternately, using K m \Xj) = \x j+m ), it follows k n k m \Xj) = K n \x j+m ) = \x j+m+n ). 
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The operator K has its own eigenvectors and another operator X can be constructed that increments \k n ) in the same 
way, or 

K\k n ) = k n \k n ) (8.45) 

X\k n ) = IK+i) (8.46) 

although the second relation must be proven below rather than baldly asserted here, but as a consequence, an equation 
for X similar to that of Eq. (8.43) holds as well. The relation K N = 1 has consequences: it demands that = 1, which 
identifies 

k n = exp(i2nn/N) (8.47) 

Those familiar with Chekhov's gun 18 know full well the notational choice of k n and X is not happenstance, but for 
now treat them as general. Knowing what k n is allows for the identification of the C" in the relation 

\K) = Y, c ]K) (8-48) 

where the n superscript is for now an index but, as its placement suggests, will be identified as a power. The expression 
K\k n ) can now be expressed two ways: using the eigenvalue relation of Eq. (8.47) or using the definition of Eq. (8.43). 
Equating the two different forms shows that 

N N N 

e ann,N g ^ = £ £ C’^XxJx,) 

but knowing that {Xj\x t ) = 8j t entails C" <x exp(-i2xjn/N). The realization that with such a C", it follows that the kets \k n ) 
and \x n ) are Fourier transforms of each other, suggesting that the normalization should be the conventional 1/y/N. Thus, 

\k n ) = ^='Z e - ,2 * in/N \x n ) (8.49) 

VNj= i 

Identifying x n = na entails k n = 2nn/{Na). The IV -*■ oo, a -*■ 0 limits such that Na = L is finite, plus the generalization to 
three dimensions, recovers Eq. (8.27). More importantly, it is seen that 

K = e iai = ^^(-iaky (8.50) 

The designations x n and k n and the actions of K and X to increase the indices of x n and k n kets bring to mind position 
and momentum in a box of length L (and with sharp discharge, Chekhov's gun fires), but one also could have been 
discussing just as easily time and energy. Parameters which share a relation given by Eq. (8.49) are "conjugate variables." 
An eigenstate of one of them cannot simultaneously be an eigenstate of the other, and as a consequence Heisenberg's 
uncertainty relationship AxA k >1/2 constrains the magnitude of spread in the values they can simultaneously take on. 

The commutator of x with K can be used to deduce what the all-important commutator \x, k\ is. A straightforward 
manipulation shows that (using x n+l = x n + a) 

[x, K] \x n ) = (xk - Kx) \x n ) 

= ( x n + a)\x n + a) -x n \x n + a) = a\x n + a) (8.51) 

but for vanishingly small a, then a\x n + a) « a\x n ) and K « I + iak using the leading order approximation to the exponen¬ 
tial in Eq. (8.50). Taken together, this implies [x, K\\x n ) « ia[x, k\\x n ) = a\x n ) and so 

[x,k\ = i (8.52) 


18 "Chekhov's gun" is a rule in dramatic writing that if a gun is shown on stage or mentioned in the first chapter of a literary work, it must be 
fired in a later scene or chapter, otherwise it should not be shown or mentioned. And, yes, it is most certainly gun, not phaser. 
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That is a pretty convenient relation: the commutator of the operators x and k results in a number (albeit an imaginary 
one), not an operator, and this can be used to establish the relationship more firmly than relying on the smallness of a 
by showing that if Eq. (8.52) holds, then Eq. (8.51) is recovered for arbitrary a. Using the series expansion of K directly 
gives 

Vx,t\ = ^j [ {~iay[k,y} (8.53) 

Applying the commutator Eq. (8.52) results in xti = + k!x, easily proven using mathematical induction, which, 

when inserted into [x, V\ results in 


[x, k J ] = 0- 1 (8.54) 

[x, IC] = a £ ^l^i-iakr 1 = ak (8.55) 

a result seen to be identical to Eq. (8.51) because \x n + a) = K\x n ), confirming again that the commutator relation of Eq. 
(8.52) is valid. As a side benefit, Eq. (8.54) implies that the commutator of an operator 0(£) (defined in terms of its Taylor 
expansion in k) with x is equivalent to a derivative of that operator, or 

[x, 0(A)] = id- k O(k) (8.56) 

a finding important to determining the current density operator. Following the procedure outlined above, it is straight¬ 
forward to show also that 

|x ; , fc] = ijx’~ x = id k {x’) 

[d(x), a] = id k O{x) 

Lastly, from the Taylor expansion 

(8-59) 

then k -*■ —id x as seen when comparing this equation to Eq. (8.50). In the continuum limit (N -* oo, a -* 0, Na -* L ), this 
relation is important. 

Before leaving, consider the identity operator of Eq. (8.44) once more, but let the "box" holding the wave functions 
become so large as to be considered infinite (N -* oo) and consider the wave number bases, so that 

?=£lWJ (8.60) 

with an equivalent version using \x n ). Observe that the weighting factor for each \k n )(k n \ is unity (all the wave functions 
V k (x) are weighted equally in the wave function representation), but instead, what if the weights were chosen to reflect 
the probability of their actual occurrence? For example, if P n was the probability that the state | <p n ) occurs, then the 
"density operator in the <p n description" p is defined by 

P=p n WM (8.61) 

The expectation value gives the number density as a function of position x, or 

N 

p(x) = {x\p\x) = P n \cp n {x)\ 2 (8.62) 

such that the sum over all x n bases gives the total particle number N. The density matrix, however, is more general: 
off-diagonal matrix elements are obtained by 


(8.57) 

(8.58) 


Pm,n ~ ( X m\p\ X n > 


(8.63) 
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where a more matrix-friendly notation is adopted. The sandwiching between the same basis state has particular impor¬ 
tance in treating matrices: a sum over the diagonal elements of a matrix is known as taking the "trace", or Y* n P n ,„ = Tr P- 
Off-diagonal elements become interesting when quantum tunneling is present, and they serve as the starting point for 
the development of the Wigner distribution function of Section 9.3.2. In the continuum limit, an important form of the 
density obtained from Eq. (8.62) that applies to electrons in a metal is 

P(l f) = ^ f dkf PD (E)\ n (r, t)\ 2 (8.64) 

but note that a factor of g = 2 for spin has not been included whereas electrons require it: the factor shall be part of the 
emission equations when they are treated. 

Knowing that there is a density operator p makes one suspicious that there is likely a current density operator j, and 
so there is, but to find it requires a bit more groundwork in the form of understanding the commutation relations of the 
conjugate operators. 


8.2.3 Uncertainty 


As I listen, I am uncertain - see: Heisenberg's uncertainty principle - what to do to H... discussing math while Rome burns - if they knew 
what I'm thinking. 

- Morris "Moe" Berg 19 


Just under five months before the surrender of Germany after the Battle of Berlin in World War II, W. Heisenberg, 
an extraordinary physicist and one of the fathers of quantum mechanics, doctoral advisor to Edward Teller ("father" 
of the hydrogen bomb), and director of the German nuclear energy program, traveled to Zurich, Switzerland to give a 
seminar on S-matrix theory a day or two after the onset of the Battle of the Bulge when Nazi forces pushed more than 
20 kilometers into Allied-held territory. By modern sensibilities, travel for such purposes at such times by persons of such 
importance to projects of such consequence would already be considered unusual, but perhaps even more unusual was 
the former Chicago White Sox baseball icon Morris ("Moe") Berg in the audience with a pistol in his pocket. Berg, at the 
behest of the US Office of Strategic Services (OSS), attended the seminar and was authorized to assassinate Heisenberg 
then and there should Berg suspect Heisenberg was involved in efforts to build an atomic bomb for the Nazis, as it was 
uncertain whether Heisenberg was so engaged [94]. 

Uncertainty can be confusing, but coming to terms with it need not be so fraught with peril. 

Heisenberg's Uncertainty Principle is a central part of quantum mechanics, and concerns the precision to which conju¬ 
gate variables such as x and k (or E and t) can be measured. Arguments based on this principle afford a quick estimate of 
the electric fields required to extract electrons from metals via field emission. A simple means to understand the principle 
is afforded by the methods of the previous section. 

The expectation values (x) = (ip\x\gr) and Ti(k) = (y/\hk\y/) give the most probable position and momentum of an elec¬ 
tron, but there is a spread around these values: uncertainty is a constraint on how small the simultaneous spread of 
each is. "Spread" in statistics often makes use of the "root mean squared," or rms, value of a quantity, as encountered in 
Eq. (3.14). Applied here, the concept suggests that the square of the departure of a measurement from its mean value is 
the relation sought, or 

Ax 2 = (ys\(x- (x)) 2 \ip) = (ys\x 2 - 2x(x) + (x) 2 | yr) = (x 2 ) - (x) 2 (8.65) 


with an analogous equation holding for A k 2 . Such a formulation connects pleasingly with intuition because it accords 
with the usual notions concerning probability and statistics: using Eq. (8.73), 

(ip\x n \y/) = j x n \{x\\p)\ 2 dx= j x n P{x)dx (8.66) 

for n = 1,2, and P(x) = |yr(x)| 2 being the probability that x obtains. Heisenberg's Uncertainty Principle places a limit on 
how small both Ax and A k can be made. 


19 From notes scribbled by Morris Berg while attending a seminar by Heisenberg on S-matrix theory, December 18, 1944, University of Zurich, 
recounted in ref. [94], p. 399. 
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The following argument gives an intuitive idea of what that limit is. A simultaneous measurement of x' = x - (x) and 
k' = k - (k) is constrained by the fact that | xp) cannot simultaneously be an eigenstate of both. Such a measurement M is a 
sum of x and k (not a product, as k will change the x basis states, and likewise with x for the k basis states). A general form 
is M = x! + Xk!, where X is complex for which the real and imaginary components remain to be specified. The question 
is then, how small can both Ax and A k simultaneously be made? Doing this amounts to adjusting the dimensionless X so 
as to minimize both of them. 

By construction, (\p\M\yr) = 0, but (y/\M^M\xp) > 0 because \cp) = M\yr) can be treated as a new wave function and 
(cp\cp) > 0 (an intuitive observation if \cp) is thought of as a vector). Letting X = X r + iX t then 

{cp\cp) = (x' 2 ) + (X 2 + X 2 )(k' 2 ) + X r {x', %'} + iX,[x', &'] > 0 (8.67) 

where the anti-commutator {A, B) = AB + BA is introduced. The real and imaginary parts of X remain to be chosen. One 
constraint is that Ax and A k be minimized simultaneously, but the absence of another means that one of the components 
of X can be treated as arbitrary (the wave function can be visualized as "rotated" until this is so). It is convenient to take 
the real part X r = 0. Then, using Ax 2 = (x' 2 ), A k 2 = (k' 2 ), and [x', k!) = [x, k\ = i, Eq. (8.67) can be recast in a manner that 
allows X i to be specified: 

{cp\cp) = Ax 2 + X 2 Ak 2 — X t >0 (8.68) 

which is minimized when X t = 1/2 A k 2 . That choice entails Ax 2 A k 2 >1/4, or, as more often encountered, where p x = hk: 

AxA p x > | (8.69) 

with an analogous equation holding for energy (A E) and time (At). The next example uses the uncertainty relation for 
A E and At to estimate field strengths in field emission, and is analogous to a treatment by Gomer [95] that did the same 
using A p x and Ax. 


EXAMPLE: Use the energy-time version of Eq. (8.69) to estimate the fields required for field emission from a 
metal due to quantum mechanical tunneling processes. Assume the Fermi level and work function are those of 
copper (p = 7 eV, ® = 4.5 eV). Assume the energy spread of the emitted electrons is roughly 1 eV (see ref. [96]). 
SOLUTION: Very crudely, an electron can pass through a forbidden region (the tunneling barrier) when product 
of the uncertainty in its energy with the time spent tunneling is approximately h/2. Assume the energy uncertainty 
is comparable to the energy spread A E ~ 1 eV. The uncertainty in time is comparable to the transit time At ~ W/v p 
through the barrier, where W = <t>/F is the width of the barrier and v p = \flpjm is the Fermi velocity. Then, from 

AEAt » h/2, it follows _ 

Fw OAE /l^ =8 _ 7 ev 
h \ p nm 

a number comparable to a typical field characteristic of field emission. 


8.2.4 Representations 

...we have interpreted the orthogonal matrix as rotating the coordinate system... These two possibilities: (1) rotating the vector keeping 
the basis fixed and (2) rotating the basis (in the opposite sense) keeping the vector fixed have a direct analogy in quantum theory. Rotation 
(a time transformation) of the state vector gives the Schrodingier picture. Rotation of the basis keeping the state vector fixed yields the 
Heisenberg picture. 

- George Arfken [60], p. 201-202 

The expectation value of an operator H in a rotated basis |i p') = R\yr), or {ys'\H\xp'), can be thought of as the expectation 
value of a rotated operator R'HR in the unrotated basis \yr), or (y/\H'\y/). That is, one can put the "change" on the bases 
(Schrodinger) or on the operators (Heisenberg). The Heisenberg and Schrodinger representations, and the commutator 
relations, provide an elegant formulation of the quantum mechanical current density, and the power of the symbolic 
representation favored by Dirac becomes evident. 

Recalling the relations of Eq. (8.24) and its companions, finding expectation values for time-dependent processes is 
a matter of evaluating an operator 0 sandwiched between a wave function \\p). Thus a time-independent 0 between a 
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time-dependent |y/(f)} is the Schrodinger representation, whereas the Heisenberg representation would put the time depen¬ 
dence on the operator 0(f) and use time-independent wave functions \\p). Consequently, the expectation value can be 
given in one of two ways: 


(W0\v) = {{vmW) 6(&«k(0)» m iw(t)\d s \yr(t)) 

= (yr(0)| ( U\t)O s U(t )) |vr(0)> = <vr(0)|d H (£)M0)> 


(8.70) 

(8.71) 


Therefore, in the Schrodinger representation the wave functions are time dependent and the operators 0 = O s time 
independent, with the variation in time governed by Eq. (8.21). By contrast, in the Heisenberg representation, the wave 
functions are time independent and the operators 0 H (t) = U r (t)b s U(t) are time dependent, with the variation in time 
governed by 

-|D h (£) = (d t U') O s U+ lfO s (d t U) = ( i/h) [H s , b H (t)} (8.72) 

where H s = H 0 + V(x) and the last equality is a consequence of Eqs (8.21) and (8.22) (for which the proof is pedagogi- 
cally useful). This leads to the conclusion that the time variation of an operator in the Heisenberg representation is related to its 
commutator with the Schrodinger Hamiltonian . 20 

To show the connection to the wave mechanics formalism, the continuum versions of Eq. (8.44) for \x) but also | k) 
are useful. Recalling that conjugate variables are related by Fourier transforms, in the continuum limit, the identity 
operator(s) become 


1= J°° dx\x)(x\ = ± j°° dk\k)(k\ 
(xW) = J°° (x\k)(k\x')dk 

= — f e^-^dk = S(x - x’) 

J-o o 


(8.73) 


(8.74) 


and, similarly, ( k\k') = 8{Ji- k'), where 8(x) is the Dirac delta function. The identity operator allows for easy switching 
between x space, for which (x|yr) = y/(x), and k space, for which (k\y/) = ip(k). The commutator relation |x, k\ in the 
continuum case can be reestablished by noting that, first, if |x) is an eigenstate of x such that x|x) = x|x), then, using 
Eq. (8.73), 


k\x) = — [ k\k){k\x)dk= — [ k^\k)dk 

In J -oc 2^ J_ x 

= ^~ idx) h / eikx \ k ) dk =(~ id xM 


(8.75) 

(8.76) 


where ke lkx = -id x e lkx is used. A vanishing of the commutator indicates that the order of the operators is not important 
(they can be thought of as being able to "move past" each other), but an operator like (—id x ) is different because it acts 
upon what it precedes and thereby changes it, and so the expectation is that [x, k\ will not vanish. In fact, 

(x\xk\yr) = J dx , (x\x\x , ){x , \k\y/) = x(-id x yr(x, t)) (8.77) 

(x\kx\yr) = -id x (x\p(x)) = -iyr(x) + x(-id x ip(x, t)) (8.78) 


Subtracting Eq. (8.78) from Eq. (8.77) results in Eq. (8.52), or [%,k] = i. Another relation of consequence is that an 
operator of the form V{x) behaves as 21 

(y,\V{x)\ ¥ ) = J dx 1 J dx( ¥ \x\x')(x'\V(x)\x)(x\yf) (8.79) 

= H(x)Mx,f)| 2 (8.80) 


20 A third representation, the interaction representation, combines the two for Hamiltonians of the form H = H 0 + H t , where H 0 is the 
time-independent part of the Hamiltonian (see Rammer [97]), but that representation is beyond present needs. 

21 AIthough IvdO) instead of |yr) should be used, fastidiousness would clutter, hence the more economical notation. 
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The relation {x'\V(x)\x) = V(x)8(x' —x) has been used, where V(x) is an unspecified but well-behaved operator that, not 
surprisingly, will be identified with the potential energy, just as H 0 = h 2 k 2 /2m is identified with the kinetic energy, for 
which 2 

(x\H 0 \y(t)) = ^(x|&V(f)) = VfcO < 8 - 81 ) 

with d 2 V 2 in three dimensions. These considerations will be of great importance in finding the current density operator 
in Section 9.2.1. 



CHAPTER 9 

Quintessential problems 


... there is one physical element which makes up the system of the bodies that move in a circle, and besides this four bodies owing their 
existence to the four principles... These four bodies are fire, air, water, earth ... This world necessarily has a certain continuity with the 
upper motions: consequently all its power and order is derived from them. (For the originating principle of all motion is the first cause. 
Besides, that element is eternal and its motion has no limit in space, but is always complete; whereas all these other bodies have separate 
regions which limit one another.) So we must... assign causality in the sense of the originating principle of motion to the influence of the 
eternally moving bodies... men seem to have assumed that a body that was eternally in motion was also divine in nature; and, as such 
a body was different from any of the terrestrial elements, they determined to call it “ether". 

- Aristotle 1 

The "fifth element" of Aristotle was said to fill the celestial sphere in which the perfect motion of the planets and 
stars occurred. Its perfection was unlike the four terrestrial essences of earth, air, fire, and water, and was therefore 
known as "quintessence" ("quintus" meaning "fifth" in Latin and "essence" being the constitution of something; Hamlet 
was therefore speaking oxymoronically and ironically when he referred to man as a "quintessence of dust" 2 ). Aristotle 
referred to celestial sphere stuff as "aether", and as Aristotelian thought was the basis of much medieval physics, the term 
carried forward. Not until the famous experiments of Michelson and Morley showing definitively that the understanding 
of this tenet of medieval physics was mistaken did aether die out as an explanatory principle. What survived, though, is 
the word quintessential to refer to that which is the pure essence or most perfect embodiment of something, much like 
the perfect circles thought to describe the orbits of the planets. 

Three paradigmatic problems 3 in emission physics merit the honorific of being a "quintessential problem": (i) the 
hydrogen atom, (ii) tunneling through and transport over a barrier, and (iii) the harmonic oscillator. 

The reasons are easy to state. The quantum mechanical treatment of the hydrogen atom is quintessential in almost 
every way the word can be meant: it brings methods, concepts, and approaches (and the units of Section 2.2.2) to the 
physicists' lexicon, pleasingly does it by showing that there are more perfect motions than just circles, and in the process 
illuminating how probability in quantum mechanics compares to its conventional understanding. For tunneling and trans¬ 
port, by allowing electrons to be found where they are classically forbidden, exhibiting behavior that frustrates intuition 
by being just downright strange (e.g., good luck finding a consensus on what the tunneling time of an electron through a 
barrier is) the tunneling problem represents all that is infuriating about quantum mechanics apart from its making emis¬ 
sion physics interesting in the first place. And lastly, the harmonic oscillator, by virtue of being so well-studied classically 
and quantum mechanically, lurks behind just about everything from mathematical techniques to particle properties and 
so merits the revere it so often receives. Yet so much has been written on these iconic problems that trying to upstage 
earlier bards is hubris. 4 Rather, it is useful to treat the quintessential problems briefly, but in a way that shows the power 
of the techniques they enable and the concepts they exemplify in a manner that will illuminate the remainder of the 
present study. As all are well covered [41^14], some familiarity with each is assumed, or at least strongly encouraged. 


1 Aristotle, Meteorology (trans. E.W. Webster), Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 8, Aristotle I. Chicago: Ency¬ 
clopaedia Britannica, 1952, p. 445. 

2 Ref. [37]: Hamlet, II.ii.304-305, p. 947. 

3 The list is inclusive, not exhaustive. 

4 Hubris, or the offense of overweening pride so hated by the gods, figured prominently in Greek tragedy and more than once thrusts a hero into 
a bad (Odysseus) or fatal (Ajax) circumstance in the Odyssey of Homer. Here, it is not so dire, as the bad poetry of Christian is comical compared 
to that of Cyrano in the play Cyrano de Bergerac by Edmond Rostand, but the derisive laughter of an audience is disincentive enough to those 
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9.1 The hydrogen atom 

Thus water, besides the oxygen, which is one of its elements in common with many other substances, contains another element as its 
constituent base or radical, and for which we must find an appropriate term. None that we could think of seemed better adapted than the 
word hydrogen, which signifies the generative principle of water, from vSop aqua, and yetvopai gignor. We call the combination 
of this element with caloric hydrogen gas; and the term hydrogen expresses the base of that gas, or the radical of water. 

- Antoine Lavoisier 5 

Hydrogen atoms, the unit of the gas named by the great French scientist Lavoisier, present a puzzle of such unique 
simplicity and complex realization that it cannot but help serve as a paradigmatic example of a physics problem: Bohr's 
quantized planetary model of a single hydrogen atom successfully explained the spectral line dependence of the empirical 
formula behind the Balmer wavelength series for hydrogen of (1/2) = R H (2~ 2 — n~ 2 ) with n an integer and R H the Rydberg 
constant for hydrogen. The Coulomb potential is spherically symmetric and so a standard account of the hydrogen 
atom will often start with the Laplacian in spherical coordinates, as in Section Al.3.1, use a separation of variables in 
Schrodinger's equation, and then slough through a tedious solution of differential equations. But the analogies between 
the Coulomb and gravitational potential of Newtonian physics offers pleasing similarities to identify constants of the 
motion that form the basis of a quantum mechanical treatment. 


9.1.1 Constants of the motion 

The number of commuting constants of the motion is easy to infer: three dimensions indicate three of them, but care 
is required because operators in quantum mechanics do not always commute and that unique feature will underlie the 
discrete energy spectrum of bound states. Such an outlook enables a great deal to be inferred even when it is not directly 
calculated, apart from its utility in providing the atomic orbitals used in Section 21.4, and it shows the power of an 
inspired guess. One constant of the motion is already known, energy, and so 6 

H\nlm) = E nlm \nlm) (9.1) 

Two more constants of the motion are left. Drawing inspiration from gravitational motion, when orbits (as for a satellite) 
are characterized by a constant radius r and velocity magnitude v, the product rv is expected to be constant. To see why, 
from elementary mechanics, the magnitude of the angular momentum about the axis passing through the plane of the 
orbit is mrv; the vector itself is L = rxp where p = mv. For convenience, work in a coordinate system such that the 
symmetry axis is z. Rotations in time about that axis are characterized by cp = cot. That is, for a satellite in a circular orbit, 
the radius vector is 

f = r (cos cpx + sin cpy) (9.2) 

whereas the velocity vector v = dr/dt is 

v = v (- sin cpx + cos cpy) (9.3) 


with v = roo. Because the magnitudes r and v are constant, rxv = rvz is also constant, as intuited. The physics will not 
change if the coordinate system is rotated itself: if rotated about z by an angle cp' (using the embedded 3x3 rotation 
matrix in R z (<p) of Eq. (8.33)), the magnitude of L is unchanged because 


1 cos cp' 

sin cp' O'! j 

f r sin 0 cos tp\ 

- sin cp' 

cos cp' 0 

r sin 9 sin <p 

{ o 

0 ljl 

y r cos 6 ) 


5 Antoine Lavoisier (trans. Robert Kerr), Elements of Chemistry, Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 45, Lavoisier/ 
Fourier/Faraday. Chicago: Encyclopaedia Britannica, 1952, p. 31. Lavoisier, the father of chemistry and member of the French Academy of 
Science, gave hydrogen its name and showed that hydrogen gas resulted when water was decomposed. To fund his famous laboratory, he 
associated with Ferme, financiers who collected taxes for the Crown. That led to his condemnation by the leaders of the French Revolution, 
particularly Jean-Paul Marat (a doctor by training). Lavoisier was tried and condemned, then guillotined on May 8, 1794. A magnificent portrait 
of him and his wife Marie-Anne Paulze is in the Metropolitan Museum of Art in New York. 

6 The notation kowtows to convention, leaving the hapless reader to know when m is the electron mass and when, like here, it is not. 
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f r sin 0 cos (<p - cp') 
r sin 0 sin (<p — <p') 
k r cos 9 



= r' 


(9.4) 


where, although 6 = n /2 for a satellite with an equatorial orbit, 6 was left in to show the general case. The rotation on 
v proceeds analogously, and so P x v' = rvz again. The z • L = L z component of angular momentum appears to be one of 
the sought constants of the motion (of course it is, but the narrative phrasing is for dramatic effect). How might that be 
shown? 

From the classical relation L = rxp, it follows that L z = xp y —yp x . Thus, the quantum mechanical operator is 
L z = -iti (xd y —yd x ) in the coordinate representation, or 

L z = fi {xky-yk^j (9.5) 

more generally. This is the "canonical procedure" for going from the Poisson brackets of classical mechanics to the 
commutator relations of quantum mechanics; in gauge field theories, the translation is more elegantly captured in the 
path-integral formalism with its alluring relations to statistical physics. Although far afield from present concerns, the 
deep relation between symmetries, invariances, and conservation laws deserves a brief digression. 

Both the gravitational and the coulomb potentials are 1/r potentials, meaning that the relevant term is the magnitude 
of r, not its direction: rotating the coordinate system will not change the potential. Said another way, V(r') = V(r) if r' 
and r are related by Eq. (9.4). Consider, therefore, an infinitesimal rotation about the z axis by an amount Sip, and using 
Eq. (9.4), implies 


V(r') = V(x + S<p y,y — 8<p x, z) « V(f ) + y8cpd x V — xSipd y V 

= (i + l Y L z) = ex P i}^/K)l z ) V(r) (9.6) 

The angular momentum operator is therefore seen as the generator of rotations, and the form of this equation looks familiar 
having seen the unitary operators of Eq. (8.22): just as invariance under translation results in conservation of energy and 
momentum, so too does invariance under rotation result in conservation of angular momentum [98]. And with that, the 
"canonical procedure" seems to make sense. 

So, does L z commute with the Hamiltonian H = h 2 k 2 /2m + V(r), where k 2 =k x +k 2 + k 2 and similarly for r 2 ? It should 
be possible to show so in any coordinate system, but taking our cue from Lear ("seek thine own ease" 7 ), cartesian 
coordinates are particularly easy, and so they shall be used. For even greater ease, number the axes labels rather than 
use ( xyz ), that is, in the (ijk) notation, the cartesian axes are such that (x,y,z) = (x 1 ,x 2 ,x i ) and similarly with k. Matters 
will be made still easier if some properties of commutators, introduced in Section 8.1.2, are generalized, particularly Eq. 
(8.52). To that end, introduce the Kronecker delta function that equals 1 when its arguments are equal (i = j) but 0 
otherwise, and the Levi-Civita symbol e ijk , defined to be +1 if (ijk) is a cyclic permutation of (123) (e.g., (312)), to be 
-1 if (ijk) is anti-cyclic (e.g., (321)), and 0 if two or more of the indices are equal. Defining e ijk in terms of its cyclic and 
anti-cyclic behavior allows it to be generalized to more dimensions, such as e™, with advantages that will nevertheless 
not be capitalized upon here. With the new and improved notation, the relations 

[Xi,pj\ = ihS t] 

4 = Z e pM- 

are easily expressed. Also, use the p = hk notation so as to not provoke confusion between index k and wave operator 
k. So, for example, L 3 = e 312 x,p 2 + e 321 x 2 p, = x l p 2 - x 2 p Y = L z (remember that Xj and p k commute for j + k). A rela¬ 
tion for when products of operators appear in the commutator is needed: for operators A, B, and C, explicit evaluation 
shows 

[AB,C\=A[B,C] + [A,C\B=-[C,AB] (9.9) 


(9.7) 

(9.8) 


7 Ref. [37]: King Lear, m.iv.24, p. 1084. 
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Now to find the commutator of H with L z . Using Eqs (9.7) and (9.9), consider the kinetic energy operator p 2 /2m first. Its 
commutator with L z will contain 

X [Pk’ e m X iPj\ = [fl’Xlfo I - [p 2 2’ X 2Pl\ 
m 

= [Px’*Py] ~ [Py » yPx\ ( 9 - 10 ) 

with just the surviving parts being shown, where the second term on the right-hand side just has the roles of x and y 
reversed from the first. Making use of Eq. (8.54) and working it out, [p 2 ,xp y ] = -2ihp x p y , and so the two terms on the 
right-hand side of Eq. ((9.10)) are the same but for their sign, and so their sum vanishes. 

The harder commutator is [y,L z ], but it, too, cracks quickly in the coordinate representation for which Pj = -ihdj. 
Taking refuge behind clever notation is not going to help now: it is better to confront what is under consideration in its 
full detail. Terms of the form 

-in [V(r),xd y ] y/(r) = -ihV{r)x ( d y y/(r )) + ihxd y ( V(r) V (r)) 

= ihx (d y V(f)) \p(j) (9.11) 

appear, but the second term just switches the roles of x and y, and so, after the cancellations, 

[V(r),L z \ = ih (xd y V - yd x v) = in (xd y r - yd x r) d r V(r) 

= in (x(y/r) - y(x/r)) d r V(r) = 0 (9.12) 

As fast as that was, it can be made faster still if the results of Section Al.3.1 are invoked, in particular, Eqs (A1.53) and 
(A1.54): when substituted into L z , then in polar coordinates, 

L z = -ihd lp (9.13) 

meaning that L z depends on variations in the axial angle cp alone. Because V(r) depends only on the magnitude of |r| = r, 
the commutator of L z with V(r) must vanish, as will all commutators of operators that depend on different coordinates. 
Thus, \\p) = \nlni) is an eigenstate of L z , and so 

L z \nlm) — mh\nlm) -*■ ( r9cp\nlm) cx e' mq ‘ (r0\nl) (9.14) 

So is it seen that L z commutes with the Hamiltonian H and is a constant of the motion. The values that m may take up 
will be considered after the remaining constant of the motion is identified. 

Flush with the success of finding L z , it might be naively expected that L x or L y might be the last term sought. Alas, a 
brief calculation shows that neither of them commute with L z because 

= ihl k (9.15) 

with the indices a cyclic permutation of (123). 


EXAMPLE: Explicitly prove Eq. (9.15) for the case [L y , Lj. 

SOLUTION: Retain only non-zero terms, observing that [x^Pj] = 0 for i ± j, and use Eq. (9.9) to find 
[L r L z \ = [~xp z + zp x ,xp y - yp x \ 

= [zPx’ X Py] ~ [xPz^Px] 

= z[p x ,x]p y -y[x,p x ]p z 
= ih{zp y -yPz) 

: iht x 
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But notice something interesting: if L ± is defined as 

L ± = 4 ± &y 


then 


[L ± ,L Z \ = +TiL ± 

but that in turn means (and one should work it out) 

[L + t_,L z ] =0 


Suddenly the quest for the final constant of the motion comes in view: if L 2 = L 2 + L 2 + L 2 , then (work this 

l 2 = L + L_ + l 2 — hL z 


(9.16) 

(9.17) 

(9.18) 
out, too) 

(9.19) 


and so L 2 is the last commuting constant of the motion (it commutes with L z , and L z commutes with H, so therefore 
L 2 commutes with H). But that is great: if one has the stamina to work it out using Section A 1.3.1, the Laplacian of 
Eq. (A1.63) is the same as 


V 2 


J__d 
r 2 dr 


(9.20) 


although one might have expected this. 8 Eq. (9.20) reveals something perhaps surprising: the eigenvalues of H do not 
depend on the eigenvalue m, that is, the energy is degenerate with respect to L z , so that states of differing m but with the same 
L 2 and eigenstates for r have the same energy. In short, H\nlm) = E nl \nlm). When spin is included in the description of 
the electron, then the degeneracy is removed, but that is a problem for another time. 


9.1.2 Ladder operators 

The operators L ± have a utility more than just being aesthetically tantalizing: they are known as "ladder operators" 
because of the effect they have on \\p) = \nlm), and from those effects, the nature of the eigenvalues can be inferred even 
without solving the differential equations which underlie the Laplacian. To see how, first observe that L f ± = L :f . Letting 
L_\y/) = \d>), then clearly 

(v\ L + L_\y/) = (yfL + \L_v) = ^ 0 (9.21) 

But what effect does L_ have on \y/)? Look at 

L z L ± \nlm) = ([ L Z ,L ± ] + L ± L Z ) \nlm) 

= ( ±hL ± + mhL ± ) | nlm) = Ti(m± \)L ± \nlm) (9.22) 

In other words, L ± \nlm) acts just like \nl(m ±1)). This conclusion was come to by simply considering the commutation 
relations, and it is the non-vanishing of the commutation relations that give quantum mechanics its particular panache. It 
is possible, though harder, to come to the same conclusion in the coordinate representation. From Eqs (A1.53), (A1.54), 
and (A1.55) and after a good amount of tinkering, 

L ± = -i(x- iy)p z ± iz(p x + ip y ) (9.23) 

= he ±iv (±d g + i cot0dJ (9.24) 

which, when coupled with {cp\m) = e imq> shows that e mup -*■ e' (m±v>,t ’ when L ± acts upon | m). The ladder operators, which act 
to raise or lower m by one step in | nlm), cannot do so forever: the eigenvalues of Z, 2 are bounded in size by the eigenvalues 
of L 2 , after all, so there must come a point where the last application kills |i/r). Let |y/_) be such that = 0, and 

analogously, let L + \ys + ) = 0. But this means, using Eq. (9.19), that 

L 2 |i/0 = h 2 m(m- l)|yr_) (9.25) 

L 2 |i/r + > = h 2 m(m + l)|i/r + > (9.26) 


s Actually, given the form of the Laplacian or the gravitational analog, one should have expected this: the angular momentum operator concerns 
rotations, that is, changes orthogonal to t. 
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One sees that if m = +1 is the maximum value, then m = —l is the minimum value. For this reason, the eigenvalues of 
L 2 are taken to be the positive values h 2 l(l + 1). Similarly, l cannot increase without bound: it, too, is bounded below by 
/ = 0 (the eigenvalues of L 2 cannot be negative) and above by the values of energy, or rather, n. The last step to finding 
E nl is to consider the radial equation, but most of the work in dealing with the angular part of the problem has been 
accomplished without having to explicitly find the form of the eigenstates of L 2 , typically referred to as Y lm (0,<p), 9 simply by 
exploiting the consequences of symmetry, arguments about what the commuting constants of the motion have to look 
like (something that will be advantageous in Section 21.4), and the introduction of ladder operators. 10 

That such feats are possible is a reflection of the deep relation between symmetry groups and conservation laws, and 
works well beyond the rotation groups that appear from solid state to particle physics [101]. Mathematical beauty has a 
role to play in how physics unfolds [89]. But there is one more point that can be made before considering the last (radial) 
equation: angular momentum has both magnitude and direction, it is a vector. However, as seen in the first run-in with 
the hydrogen atom in Section 2.2.2, the ground state was symmetrical (rotation-invariant), and unless something breaks 
that symmetry, be it an electric or magnetic field or the coupling of electron spin to angular momentum, why should the 
energy levels care about what orientation the atom has (put another way, L 2 has magnitude but not direction, so does 
direction not figure in the Hamiltonian)? In fact, they do not: the energy levels depend only on the quantum number n, 
or H\nlm) = E n \nlm), at least in the simple model. Such will be shown in Secton 9.1.3. When the symmetry is broken or 
coupling to other effects taken into account, then the sub-orbitals characterized by / acquire an energy difference, and 
the parlance of s, p, d, and / to refer to them (taken from their historical designation of sharp, principal, diffuse and fine for 
/ = 0,1,2,3, respectively [41]) arises. 

The ladder operator of Eq. (9.23) and the fact that there is a highest (or lowest) angular momentum state for which 
L + \nll) = 0 allows the Y lm {9, <p) states to be determined, along with the normalization condition given by 

1 = (lm\lm) = 2n f sinOdO \Y lm (9,cp)\ 2 = 1 (9.27) 

Jo 

where advantage has been taken of the fact that ( q>\m ) <x e lm,p . Letting Y lm (9, q>) = (2 n)~ ll2 e im,p ® lm {9) then [41] 


d 6 ® u {0) = l cot(0)0 (9.28) 

with solutions of ® n (9) <x (sin0) ; . Then repeated applications of the ladder operator can generate the remainder, or 


x —V- ( -———'j (sin0) 2/ 

sin 0 \ d(cos 6)) 


(9.29) 


but this is very close to Rodrigues's formula (Section A2.5, Eq. (A2.14)) for the generation of Legendre polynomials 
P;(cos0). In fact, \/2® m (9) = \jfll + l)P ; (cos 9), and so, apart from normalization. 


®lm 


sin m 6»| 


d(cos9)) 


Pfi cos 9) 


(9.30) 


where the lack of concern with the normalization constants, which complicate the narrative, is because the calculation 
of average quantities or expectation values finds those constants in both numerator and denominator of the ratios that 
arise, and so cancel (apart from the fact that these normalization constants are both extensively documented or easily 
calculated numerically). 

Standard accounts of {9cp \ Im) = Y lm {9, cp) hew rather closely to their representation in terms of spherical coordinates and 
include their proper normalizations. Yet, when probability is treated, there is advantage in their cartesian representation 
given the coordinate transformations of Eqs (A1.48)-(A1.50) (it aids in the visualization of chemical bonding [102]). 
Moreover, for the purposes of calculating averages, the normalization is far less important than the relative magnitudes. 
To reinforce that the real and wnnormalized angular functions are under discussion, refer to them as A [m (in contrast 
to the radial functions R nl below). The s sub-orbital A 00 = 1 is easy: there is no angular dependence (s sub-orbitals are 
spherically symmetric). The normalization associated with A 00 is 1/ \[Ak. The next sub-orbitals are composed of Y l 0 -*■ A 10 


9 The form of the Y lm , and their normalization, are fodder for quantum mechanics [42-44] or electrodynamics [99, 100] textbooks, but the 
needs here are different, apart from concerns about the derisive laughter of audiences again. 

10 The term "ladder operator'' will be eclipsed by the terms "creation and annihilation operators" which, apart from being a more riveting 
terminology, point to states being occupied and unoccupied as a result of transitions, as for the harmonic oscillator in Section 9.3. 
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and Fj ! ± Y 1 4 1±l . Keep in mind that the magnitude of the m subscript on A jm relates to L z eigenvalue, but its sign 
has been adapted to refer to how the Y lm functions are combined. Thus, 

A i,o = z/r -* p z 
A 1+l = x l r Px 

K-\=y/r^Py (9.31) 

where r 2 = x 2 + y 2 + z 2 and the —► symbol designates the association with the spdf notation convention favored in dis¬ 
cussions of bonding. The normalization associated with A lm is \/3/4;r. The last orbitals to be of interest here are A 2m , and 
they are also made up of combinations of the Y lm to make real functions: 

A 2 0 = Xz/r) 2 - 1 -> d zz 
A 2+l = V3 (ixz/r 2 ) -» d a 
A 2,-i = V3 (2 yz/r) -» d yz 
a 2,+2 = V^((x 2 - y 2 )/r 2 ) -> d x2 _yz 

A 2 _2 = V3 (2 xy/r 2 ) ->• (9.32) 

The normalization associated with A 20 is i/5/ l 6k, as can be shown by numerical or analytic integration. 


9.1.3 The radial equation 

The contribution of angular momentum may now be included back into the radial equation for the hydrogen atom 
introduced in Eq. (2.6), but, with the introduction k 0 = mac/h = ay 1 , and the more convenient representation of 

V nlm (r,e,(p)= l ^p-Y lm {6,(p) (9.33) 


then Schrodinger's equation becomes (where the nl subscripts are suppressed on u(r) until they are needed) 


dr 2 


- — | u(j) = -k 2 u{r) 


(9.34) 


valid only for r > 0. The form of this equation has the look and feel of the one-dimensional Schrodinger's equations that 
will dominate the tunneling calculations in store, and so experience with this one is good preparation. The intent here, 
as with the angular part, is not to rigorously solve Eq. (9.34) (particularly because solutions to the equation have been 
very well treated elsewhere, see, for example. Chapter 14 in ref. [103]), but rather to highlight generally useful methods 
on a problem for which there is ample documentation as to its exact solution. 

By rendering Eq. (9.34) to have the look and feel of a typical one-dimensional problem allows one-dimensional termi¬ 
nology to be employed. In particular, to prevent concerns about r < 0 solutions creeping in that make no sense, visualize 
the potential barrier as going to oo at r = 0, so the wave function is not found there. Now an effective potential of the 
form 

h 2 (1(1 + 1) 2* e \ 




(9.35) 


exists, where the minimum of the potential is at r min = 1(1 + 1 )a 0 , and has a depth h 2 k 2 / (2ml(l + 1)). The effective potential 
is shown in Figure 9.1. 

In the limit of large r, becomes negligible, and so the equation for u(r ) goes over into d 2 u(r ) - k 2 u(r) « 0, sug¬ 
gesting solutions of the form u(r) ~ e -foc , a relation already encountered in the treatment of Eq. (2.6). Pull out this part 
of the solution by letting u(r) = g(r)e~ kr to obtain 

f , 2fc„r-/(/+in 

< d 2 - 2 kd r + 0 ^ -- | g(r) = 0 (9.36) 

If g(r) were a polynomial, the action of d r would be to reduce the power of each term in the polynomial by one, and 
similarly d 2 would reduce the power by two. But the factors of (1/r) and (1/r 2 ) do just that as well, and so the urge to try 
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Figure 9.1 The effective potential V eff (r) associated with u n ,(r) as a function of r/a 0 of Eq. (9.35), where a 0 is the Bohr radius (0.052918 nm) for the 
angular momentum states / = 0, 1, 2, 3, and 4. The gray lines labeled with n values correspond to E n = -h 2 /2ma 2 n 2 . 


a power series solution for gif) seems reasonable - until one appreciates that an infinite series expansion will wrongly 
suggest that g(r) ~ e 2kr when r is large, as it is the solution to {d 2 r - 2kd r } g(r) = 0, and that grows faster than the e~ kr 
of Eq. (2.6) declines. Something therefore has to truncate the infinite series expansion, and that, in turn, will lead to 
particular values of energy which are allowed, that is, the bound states. This can be seen by formally inserting the power 
series, finding the recursion relation between the coefficients, and then finding what constraints exist on k to terminate 
the series. That would be the careful approach. Instead, rashly consider what happens when g(r) <x r n (where n is an as 
yet unspecified power) and see what can be said about the values of n and k. Inserting g(r) = r n into Eq. (9.36) results 
in 11 

[«(« - 1) - /(/ + 1)] r” -2 + 2(k 0 - nk)r"~ l = 0 (9.37) 

The value of r is arbitrary, and therefore the coefficients of r" -2 and r” -1 must separately equal 0. 

The second term vanishes when nk = k 0 , and so only particular values of the energy E(k) are allowed that satisfy the 
relation ^ 

fc=~ (9.38) 

which implies that the energy levels are quantized according to E n = h 2 /{2ma 2 0 n 2 ), where a 0 = l/k 0 is the Bohr radius. 

The first term on the left-hand side of Eq. (9.37) vanishes if / = n — 1, which additionally confirms that n is an integer 
because l is: the relation 1 = n - 1 specifies the maximum angular momentum state. But what about the lower angular 
momentum states? Setting 1 = n-2 leaves terms behind in Eq. (9.37). Because the eigenstates of u nl {r)/r are orthogonal, 
gir) must therefore contain other powers of r, the coefficients of which can be found by enforcing the condition of 
orthogonality between the u nl ir)/r. But there is a simpler way. 

When l changes from l=n—1 to l=n — 2, a term C n _ 1 r n ~ ] can be added to g(r) with C n _ x adjusted until Eq. (9.37) 
holds again. That is, g(r) is modified by changing r" to r" - C n _ l r n ~ l . Similarly, as; increases, every time then a 

term C n _-r n is to be appended to g(r) and the resulting Eq. (9.36) set equal to zero. Setting the coefficients of the different 
powers of r to zero (r is arbitrary, and doing so is the only way to achieve Eq. (9.37)) will determine the C n _j values. After 
the unnormalized u nl (r ) are found, their normalization can be found at leisure by the orthogonality relation 

(n'l'm'\nlm) = S nn ,S u ,S mm , -*■ [ u n ,,,(r)u nI (r)dr = 6 nn ,6 w (9.39) 

Jo 

Does this prescription work? Yes, but that is better seen by example, although a reduction of Eq. (9.37) to dimensionless 
form in terms of s = k 0 r proves useful. Preloading the relation k = k 0 /n results in 


11 Technically, letting #(r) = Ar” is more exact, but as the wave function can be specified up to an arbitrary coefficient, it is simply easier to take 
A= 1. 
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Observe that this dimensionless equation can be rewritten as 

where the grouping of terms more directly suggests the relationship of k to k 0 (second parentheses) and then of / to n 
(first parentheses) for the maximum angular momentum case for which g(s) oc s n . 


EXAMPLE: Find u nl (r) for n = 1. Express in terms of s = k 0 r. 

SOLUTION: For n = 1, g(s) = s n = s, l = n- 1=0, and the left-hand side of Eq. (9.41) identically vanishes. 
Therefore 

u l0 (r)/r = N l0 e~ s (9.42) 

where N l0 is the normalization. 


EXAMPLE: Find u nl (f) for n = 2. Express in terms of s = k 0 r. 

SOLUTION: 

• For n = 2 and 1= n - 1 = 1, g(s) = s n = s 2 , and the left-hand side of Eq. (9.41) identically vanishes. Therefore 

u 2l (r)/r = N 21 se~ s (9.43) 

• For n = 2 and 1= n - 2 = 0, g(s) = s n - = s 2 - C x s, and the left-hand side of Eq. (9.41) becomes 2 - Q = 0. 

Therefore, C x = 2 and 

M 20 (r)/r = N 20 (s - 2)e- s/2 (9.44) 


EXAMPLE: Find u nl (f) for n = 3. Express in terms of s = k 0 r. 

SOLUTION: 

• For n = 3 and 1= n - 1=2, g(s) = s n = and the left-hand side of Eq. (9.41) identically vanishes. Therefore 

u i2 (r)/r = N 32 s 2 e~ s/3 (9.45) 

• For n = 3 and / = n - 2 = 1, g(s) = s n - C^s"- 1 = s 3 - C 2 s 2 , and Eq. (9.41) results in 6 - C 2 = 0. Therefore, C 2 = 6 
and 

u il (r)/ r = N 31 (s-6)e- s/3 (9.46) 

• For n = 3 and 1= n - 3 = 0, g(s) = s 3 - C x s 2 - C 2 s, and Eq. (9.41) becomes (9 - C 2 )s + (2C l - 3C 2 ) = 0. Requiring the 
coefficient of each power of s equal 0 results in Q = 9 and C 2 = 27/2. Therefore, 

u i0 (r)/r = 1 V 30 (s 2 - 9s + (27/2))<T s / 3 (9.47) 


The requirement that the u nl (r) functions be normalized as per Eq. (9.39) is the last step to determining N nl . As an 
example, consider 1V 30 . Expressed in terms of s = k 0 r, the normalization is determined by 

1 =J u 30 (s) 2 ds = N 2 0 (s 2 -9s+y)Y 2j / 3 <fe (9.48) 

Employing Eq. (A2.3) shows that JV 30 = 4\/3/243 (see the denominator of Eq. (9.50) in the example below for what is 
involved), and so for ( r6cp\nlm ) = y/ nlm with n = 3 and 1 = m = 0, 


V3oo( r ’ S ’<P) = “ 9 V+ Y, 


where the last factor is (1/4^) 1/2 = Y o o (0, ip). 



(9.49) 
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Fixation on the normalization factors obscures that they can often be sidestepped: there are easy ways to use the wave 
functions without them, and such methods will be seen to be useful in finding current density or quantum efficiency. As 
a straightforward example, the expectation value of 1/r for the |300) can be calculated without an explicit form for the 
normalization by 


(300| (k 0 r) 1 1300) = 


/ 0 °° x(2s 2 - 18s + 27 ) 2 e~ 2s ^ds 
/ 0 °° s 2 (2s 2 - 18s + 27 ) 2 e~ 2s ^ds 


/2\ (5!) - 12(4!) + 48(3!) - 72(2!) + 36(1!) 
V 3 / (6!) - 12(5!) + 48(4!) - 72(3!) + 36(2!) 


9 


(9.50) 


where, after a change of integration variables to s = 3ic/2, the Gamma function (Eq. (A2.4)) of Section A2.2 has been 
used. Should the hydrogentic wave functions yr nto! (r, 0, cp) = ( r0cp\nlm ) = R nl (r)Y lm {6 , cp) actually be needed, there are many 
tabulations of them (see, for example, refs [41, 102, 104]): it is sufficient to know that they are available (or how in 
principle they may be evaluated) rather than what they are. 


9.1.4 Orbital shape and probability 

Orbital shapes can be expressed in any convenient coordinate system, with cartesian and polar frequently encountered. 
In polar coordinates (300|300) is ^ 

(300|300) = ^ \R 30 (r)\ 2 4xr 2 dr (9.51) 

where the independence of the s orbitals on 9 and cp allows the angular integrations to be performed and results in the 
factor of 4k (a factor offset by the normalization factor of 1 / \J~4k, being the last term in Eq. (9.49)). Apart from the r 2 dr, 
which has units of volume, the integrand is the probability per unit volume that a measurement will reveal an electron 
at the position r. An indication of its behavior is given in Figure 9.2 for the first three levels of the related function u nlm (r ) 
for the s states (l = m= 0), a function more pleasant to plot because of the absence of the 1/r dependence of R nl (r ) for 
small r. 

Visualizations of the orbitals in cartesian coordinates that portray them as solids taking the shape of spheres, weather 
balloons, and dumbbells show the probability density surface of a constant value (say, where \y/\ 2 dv = 1/2) but an image 
is potentially misleading if it suggests the electron is only inside the solid figure or uniformly smeared throughout it. The 
distributions represent the probability of finding an electron in the small volume dv = dxdydz at the location (x,y,z). A 
plot of where repeated measurements of position found an electron will produce a fuzzy cloud that gives a better intuitive 
feeling of what the distribution is. 

The hydrogen atom problem gives full expression to the subtle notion of probability that permeates quantum mechan¬ 
ics. To model the probability distribution, a uniformly distributed random number r n can be generated and compared to 
\y/ nlm (x,y,z)\ 2 : if r„ is smaller, an electron has been found inside the small volume (conversely, if r„ is greater, no elec¬ 
tron was found). After many such "measurements", a graphical representation of the locations where the electron was 
found gives a fair picture of what the probability distribution looks like. The process of comparing a random number to 
a probability distribution is commonly called an "acceptance/rejection method" (alternately, Neyman's method [105] or 



s = k 0 r 


on u nlm (s) for / = m = 0 and n= 1,2,3 (the s orbitals). The curves 






have been scaled by their 








Quintessential problems 101 



10 15 20 

Figure 9.3 A representative distribution of measurements that find an electron for the wave function y/ 300 (x,y, z) evaluated in the x-z plane (y = 0) 
using the algorithm of Section A3.6. 

"hit-or-miss" [106]) as when applied to evaluate integrals in Section Al.2.5 (as opposed to modeling carrier transport 
[81, 107, 108], where Monte Carlo entails a great deal more). A simple method to generate those fuzzy images is given 
by Section A3.6, used to generate a representative slice of sightings in the x-z plane associated with the wave function 
y/300 (an s orbital). The results are shown in Figure 9.3. Compare those findings to the n = 3 line of Figure 9.2, where the 
gaps are anticipated by the behavior of the u nl (k 0 r) function. 

As with human anatomy, a slice looks different than the full body. By evaluating all the slices in the y direction, 
not just the x-z plane, and stacking them like a loaf of bread, an idea of the three-dimensional shape of the orbital 
emerges and resembles the probability distribution used to select the points. Visualization can still be a challenge, and 
the usage of projections simplifies the visualization. A projection is obtained by integrating the distribution along one of 
the coordinates, say y, and is therefore given by 


p{x,z) = J \yr nlm (x,y,z)\ 2 dy 


(9.52) 


which then is a measure of the probability that an electron is found with the coordinates x and z for any y. The shape 
of the orbital is then related to the projection, and an example is shown in Figure 9.4 for the more visually interesting 



Figure 9.4 A large number of observations 
p 320 (x, z): the sum over the y coordinate is 
n = 3, or \nlm) = 1320). 


—15 -10 -5 


in be represented in a plot of how often a 
eant to mimic Eq. (9.52). The algorithm is 


electron is found with particular coordinates. Shown here is 
in Section A3.6. The orbital shown is for d B of Eq. (9.32), and 
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Figure 9.5 Using Eq. (9.52), a contour plot (left) of p 210 (x,z ) compared to the "measurement" scatter plot (right). 



Figure 9.6 Same as Figure 9.5, but for p 320 (x,z). 


d a orbital of yr 320 (x,y, z), which agrees (as it should) with the more conventional plot of the contours of the integrand of 
Pt, 2 o( x > ? ) shown in Figure 9.6 (for chemical bonding in the larger atoms, the p orbitals are of particular importance, for 
which the p z orbital is also shown in Figure 9.5). 

The orbitals of many electron atoms differ from the simple orbitals of the hydrogen atom. More importantly, when 
atoms are brought into proximity with each other, their outermost electrons are shared and lead to chemical bonding, 
but molecular orbitals lay on even greater complexity. It is common to assume that the molecular orbitals are linear 
combinations of the atomic orbitals, or LCAO for short [53, 102], In contrast to the Bloch waves favored in solid state 
physics, the LCAO methods are favored in treating chemical interactions [104], They shall be encountered again in 
Section 21.4 when the band gap is considered. 


9.2 Transport past barriers 

If one separates the wave equation for a hydrogen atom in an homogeneous electric field in parabolic coordinates, one finds that one of 
the equations has a potential energy which becomes negatively infinite for infinite values of the coordinate ...There are thus no stable 
stationary states possible for a hydrogen atom in such a field... The values of the field for which the dissociation becomes appreciable 
are... about a tenth of the field which makes the classical Bohr orbit unstable. The effect increases very rapidly with a drop in the ionization 
potential of the atom, so that we should expect it to be particularly marked for certain metallic atoms at the surface of a conductor. The 
aeona effect, or pulling of electrons out of metal by fields of the order of 10 5 e.s.u., is probably to be accounted for in this way. This is 
confirmed by the fact, that when one uses the data ofR.A. Millikan and C. Eyring to plot the reciprocal of the field against the logarithm 
of the current minus one fourth the logarithm of the field, the points so obtained lie on a straight line. 

- J. Robert Oppenheimer [109] 
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In a pleasing parallel to Lavoisier's studies of hydrogen, consideration of the hydrogen atom in strong fields allowed 
Oppenheimer 12 to anticipate that tunneling phenomena, wherein an electron can pass through a region where it is 
classically forbidden, may explain the pulling out of electrons from a metal surface subject to intense electric fields. As 
Oppenheimer sagely observed, there was not a constant of the motion as normally required in the modus operandi of 
quantum mechanics: eventually, a hydrogen atom under field would lose its electron, even if it took (as Oppenheimer 
calculated) lO 10 ’ 0 seconds to do so for a field of 1 Volt/cm. Therefore, the consideration of hydrogen helped motivate the 
next quintessential problem that will be of even greater importance to the understanding of electron emission: emission 
over, and under, a barrier. 

9.2.1 Density matrix 

And there's nothin ' sure in this world 
And there's nothin' pure in this world 
Look for something left in this world 
Start again 

- Billy Idol 13 

Pure states are convenient to work with, but a description of probability density p and current density ] is more 
complicated and requires a mixed-state representation to obtain a quantum mechanical current density j(x, t), for which 
three essential components are required: (z) the continuity equation (Eq. (7.19)) relates the time-derivative of density 
to the spatial derivative of current density, (ii) the time derivative of an operator is related to the commutator of that 
operator with the Schrodinger representation Hamiltonian H S = H 0 + V(x) (Eq. (8.72)), and (Hi) the commutator of an 
operator with k is related to the spatial derivative of that operator (Eq. (8.58)). Once in hand, these three tools make 
quick work of finding a means to evaluate j(x, t). 

From the eigenstates defined by \yrf(f)) = U(t)\y/j(0)) such that H s \y/j) = Ej\y/j), the density operator pit) is 

Ph( 0 = 'Zf(E j )\v j mvm (9.53) 

where/(E,) is a measure of the probability that an electron has an energy Ef. in three dimensions, for example, it would 
be the Fermi-Dirac distribution as in Eq. (8.64), but in the one-dimensional case it is termed the “supply function',' as 
in Section 11.1. Letting the operator 0(f) in Eq. (8.72) be the density operator, then it follows 

ihd t p H (i) = [H s , p H (t)] (9.54) 

Observe that (x\ p{t)\x) = p(x, t). Because of Eq. (8.79), it follows that 

{x\[V(x),m]\x) = 0 (9.55) 

and so the commutator becomes 

[*■•»)]=£[ p -«'>]=-£l {*•«)} ' 9 ' 56 ' 

where the anti-commutator notation j k, p j =kp + pk was introduced after Eq. (8.67). If; is defined such that 

|957) 

J H (t) = ^m{p s ,nk} u(t) (9.58) 


12 Oppenheimer's fate, though far less grim than Lavoisier's, might be seen as symbolically parallel: On June 29, 1954 (9 years and 13 days 
after the Trinity test ushering in the atomic age, which Oppenheimer's leadership made possible), the Atomic Energy Commission (AEC) 
announced the revocation of his security clearance, deeming him a risk due to "fundamental defects in his character" and "his disregard for the 
obligations of security." The testimony of Edward Teller, who championed the hydrogen bomb that Oppenheimer publicly advocated against, 
was instrumental. P.J. McMillan summarized the incident thus:" ...by taking away the clearance of the man who had replaced Albert Einstein 
as the public face of scientific genius, the government told the scientists: We want your work, but we don't want you." (P.J. McMillan, The Ruin 
of J. Robert Oppenheimer and the Birth of the Modem Arms Race. New York: Viking Penguin, 2005, p. 262). 

13 Billy Idol, lyrics to White Wedding, 1982. 
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then the quantum version of the continuity equation of Eq. (7.i9) is 

f/M0+J^(f) = ° (9.59) 

which looks much like its classical analog, but appearances are not the same as properties. Expressed in terms of its wave 
function representation, ] is 

(y/\j H (x,t)\y/) = ^ (i/r(f) | j/3 5 , A:| M0> 

= {VV, t)d x y/(x, t ) - yr(x, t)d x y/\x, t)} (9.60) 


as is verified by expanding the anti-commutator and using Eq. (8.60) with the replacement \x n ) -* |x). 


9.2.2 Transmission probability 


Insistence on clarity at all costs is based on sheer superstition as to the mode in which human intelligence functions. Our reasonings grasp 
at straws for premises and float on gossamers for deduction. 

- Alfred North Whitehead 14 


The mathematics behind calculating a transmission probability are fairly straightforward to work out. The understand¬ 
ing of the implications, though, is difficult because they tend to contradict expectations. Begin with the methods first, 
anyway. Basis states that are chosen with the constants of the motion in mind, much like finding the stable strategies in 
Section 8.1.3, are particularly important. In the absence of potential variation (R(x)is constant), then H s = H 0 = h 2 k 2 /2m, 
for which the basis states are \k). For them, the current density operator is 

j k (x) = | vl (x)d x \p k (x) - \p k (x)d x \pl (x)} (9.61) 


where ip k (x) = (k\x) = e lkx , and where the normalization for now is assumed to be unity. A simple substitution into 
Eq. (9.61) results mj k (x) = Tik/m, which looks like a velocity alone, whereas the current density encountered in Eq. (3.5) 
was the product of charge, velocity, and number density. Remember that j k is a number current density, so the charge 
current density is a product of j k with q, and that, because the momentum was specified to a particular \ k) rather than 
a distribution J^ k f(k)e ,kx , the constant number density is subsumed in the normalization: that the number density is 
constant everywhere for \k) is a consequence of Heisenberg's uncertainty relation and the unique specification of the 
momentum k. Therefore, the new viewpoint does accord with the old framework. 

Currents flow, and it is commonsense borne of classical mechanics that the current flowing towards the barrier from 
the left should be equal to the current that gets transmitted plus that which is reflected, or j inc =j re f +jtr am ■ How that 
intuition is preserved is edifying. Let the normalization of y/ inc be unity, and choose ys tram = t(k)e ,k ' x and \p re f = r(Jc)e~ lkx . 
The values of k and k! are determined by the height (in energy) of the particle above the local "ground": if the height in 
Region 1 is h 2 k 2 /2m then in Region 2 it is h 2 k' 2 /2m + V 0 . That this is so is straightforward to show: from Schrodinger's 
equation in the absence of a potential, 

~bih ¥k(x)=Ek¥k(x) (9 - 62) 


Substituting E k = Ti 2 k 2 /2m and ip k (x) = N k e ±lkx satisfy that equation. Having a non-zero, but constant, potential V 0 is 
tantamount to replacing E k ^*E k - V 0 and k 2 -*■ (k 1 ) 2 = k 2 - k 2 , where k 2 — ImVjti 2 . Accompanying this, one expects 
that the number density and the current density should be continuous at the boundary (taken to be at x = 0), but 
that requires the wave function i p k (x) and its first derivative d x \p k (x) to be, likewise, continuous at x = 0. To make both 
requirements so, the normalization factors r(k) and t(k) have to satisfy 


WjSD + VV(°~) = VW(° + ) ( 9 - 63 ) 

d x y mc {(r) + d x vq e /( 0“) = d x i// [ram (0 + ) (9.64) 

where the left-hand side refers to Region 1, and the right-hand side to Region 2 of Figure 9.7, a form that looks delightfully 
like a matrix equation: in fact, putting in the forms for y/ inc , \p ref and ys tram , and treating the coefficients as unknown 


14 Alfred North Whitehead, Adventures of Ideas, 1st Free Press Pbk. edn. The Free Press, 1967, p. 72. 
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Figure 9.7 Wave function incident on a barrier. An incident wave (inc) of the form y/ lnc = e ,kx hits a barrier of height V 0 and separates into two waves, 
one reflected (ref) of the form yr Tef = r(,k)e~ ,kx and one transmitted (trans) of the form w tram = Kk)e tk,x . The relation of k! to k is h 2 k 2 /2m = h 2 k! 2 /2m + V 0 . 


components of the coefficients vector, but the exp(±ikx) as the components of the multiplying matrix (and evaluating 
them at x = 0), results in 

(i i)U)-(i -:,)(?) 

The incident wave is normalized for convenience to unity (top component of the left-hand side coefficients vector) and 
no waves are incident from the right (bottom component of the right-hand side coefficients vector). In the matrices, the 
top row is for the wave functions, and the bottom row for their derivatives. 2x2 matrix equations are not much of a 
challenge: multiplying both sides by the inverse of the left-hand side matrix and expanding results in 


/ 1 \ m(k + k'\ 

\m) 2 k\k-k') 

Identifying the components of the column vector on the left with those on the right shows that 



(9.66) 


(9.67) 

(9.68) 


Recall that Eq. (9.60) refers to a probability current and therefore, because the basis states | k) correspond to stationary 
states, the current density;^ is spatially uniform. The probability of being transmitted over the barrier is therefore a ratio 
of the transmitted probability to the incident probability current densities, that is, the transmission probability D(k) is 15 


j inc (k) 


4 kk' 

(k + k'f 


(9.69) 


where the first relation is general and the second form is specific to step function potentials. The reflection probability 
is 1 - D(k). For k > k 0 (an ominous caveat that portends complications for energies below the barrier height), D(k) 
increases monotonically from D(k 0 ) = 0 to D(k » k 0 ) 1, shown in Figure 9.8 for k = k 0 \[x, a rather odd represen¬ 

tation, but one that will be useful below. About that k > k 0 caveat: what does happen when k < k 0 ? First, k' —> ik! 
where now k' = J\k 2 - k%\. Second, y/ tram -*■ t{k)e^ k!x and is real, so that j tram = 0 (all particles are reflected). Third, 
r(k) -*■ (k - ik!)/{k + ik') so that \r(k)\ 2 = 1 , yet another way of saying that all particles are reflected. 


15 The usage of D for the transmission probability follows the hallowed tradition established by Fowler and Nordheim [36], Murphy and Good 
[110], and Modinos [111], and the treatment here shall follow that precedent. Alas, the literature is not uniform, as, for example, T is used by 
Kemble [112] (who reserves D for the current density!) and Kane [113], among others. In short, cave lector, "let the reader beware" (more or 
less). 
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Figure 9.8 Transmission probability for a step function. The factor x is the ratio between the barrier height V 0 and the kinetic energy E of the incident 
particle, or x = VjE. The solid circles correspond to the usual D(k) = Dkkjtfi) for a step barrier. The dashed lines correspond to transmission over a cliff 
for progressively higher cliffs: as the cliff height becomes infinite, the transmission probability approaches 0, meaning all particles are reflected, a 
paradoxical condition which has no classical analog. 


The quantum result confirms the classical intuition that the incident current has to be the sum of the reflected and 
transmitted currents, and that step barriers reflect all particles whose energy is insufficient to surmount them. But right 
about here is where quantum mechanics begins getting peculiar in several ways. First, even when the kinetic energy 
is greater than the barrier height for a potential step, or h 2 k 2 /2m > V 0 , the transmission probability is less than unity: 
some reflection always occurs, and that defies a classical expectation. Second, instead of a step barrier, what about a step 
cliff, where V 0 is negative instead of positive? Following through the treatment, one sees that this amounts to replacing 
—k 2 —<■ +k 2 in Eq. (9.68), and so some reflection always occurs even when going over a cliff. Astonishingly, the reflection 
becomes greater the higher the cliff is, resulting in complete reflection when the height of the cliff becomes asymptotically 
large. That has no classical analog. Such an outcome is coherent (follows from the mathematics) but to expectations 
borne of macroscopic physics it is unintelligible, and so there is little or no clarity. Lastly, in the case where the barrier is 
not a step, but rather a rectangular bump, the transmitted wave function need not be driven to zero within the barrier, 
such that the particle has a non-zero probability of appearing on the other side. The particle is said to have "tunneled 
through" the barrier, and that is where matters get very interesting. 


9.2.3 Tunneling 

I have dwelt longer upon the history of the Samians than I should otherwise have done, because they are responsible for three of the 
greatest building and engineering feats in the Greek world: the first is a tunnel nearly a mile long, eight feet wide and eight feet high, 
driven clean through the base of a hill nine hundred feet in height. The whole length of it carries a second cutting thirty feet deep and 
three broad, along which water from an abundant source is lead through pipes into the town. This was the work of a Megarian named 
Eupalinus, son of Naustrophus... 

- Herodotus 16 

Herodotus, the ancient Greek historian often referred to as the father of history, wrote a massive record of the rise 
of the Persian empire and its subsequent clash with the Greek city-states during the Greco-Persian wars (of which the 
Battle of Marathon mentioned in Section 7.1 was a part) culminating in the failed campaign of Xerxes in 480 BC to 
conquer the Greeks. But his The Histories was more than that: although it did chronicle the battles and intrigues, it also 
entertainingly included fantastic tales and a fairly detailed account of the ancient world, from customs and manners and 
beliefs to engineering accomplishments. Herodotus's identifying of the tunnel of Eupalinus as a marvel of the ancient 
world (a kilometer-long tunnel under a mountain to carry water to the capitol city) may have been instrumental 
to its discovery around 1884 on the island of Samos, birthplace of the mathematician Pythagoras. The tunnel was a 


5 Herodotus, Herodotus: The Histories (Aubrey De Selincourt). Middlesex: Penguin Books, 1954, p. 228. 
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masterpiece of the application of geometry and calculation to a difficult problem of excavating two holes on either side 
of a mountain such that they intersect in the middle. That it was executed at the behest of the tyrant Polycrates (to hide 
the water supply so as to prevent an invading army from denying that essential resource to the city during sieges) does 
not diminish the engineering accomplishment. 

Tunneling calculations are difficult, and always have been. Quantum mechanics makes them complicated in a com¬ 
pletely different way. In quantum mechanics, tunneling refers to electrons in a forbidden region, such that the probability 
of finding an electron where the potential energy associated with the barrier exceeds the energy of the electron is 
non-zero. Two exercises emphasize different aspects of the tunneling problem, and both make use of the formalism 
developed above: (t) the wave packet problem describing a single electron can be time evolved and its interaction with 
the step function barrier or (ii) the stationary problem describing the variation of electron density near the potential 
barrier. 


9.2.3.1 Wave packet 

The density associated with a single state \k) is constant everywhere, a consequence of Heisenberg's Uncertainty Principle. 
A localized electron therefore corresponds to a spread of momentum values, as in the commonly considered Gaussian 
wave packet, otherwise known as the minimum uncertainty wave packet, for which AxA k =1/2. More generally, though, 
the wave function is given by 


V(x, 0 = 




e*i* + r(k_j)e ik -i x 

tikyv 


(x < 0) 

(x^O) 


(9.70) 


where v 2 = kj - k 2 g , k 0 = \j2mV 0 /ti, and k_j = -kj. If k 0 > kj, then k is imaginary. The negative index k_j notation corre¬ 
sponds to states that move away from the barrier (to the -x direction) instead of toward it (to the +x direction) so that 
the summation need not be further divided into positive and negative momentum states. The coefficients C ; - = C(kj) are 
found from the orthonormality of the basis states ys k (x) and so 

C(kj) oc J i^.(x)V(x, 0 )dx (9.71) 

Choosing Cj to correspond to Gaussian wave packets is often treated (e.g., [44]), but their evaluation can be a challenge, 
especially when first encountered. To have a computationally easier time of it, consider the simpler circumstance where 
at t = 0, the wave function is given by yr(x, 0) = 0(x + /)©(—x), for which 0(x - x 0 ) = 1 if its argument is greater than 0, 
but 0 otherwise (the Heaviside step function). The Cj for such a circumstance can be evaluated easily when using plane 
wave basis states, but the wave packet that results is no longer a minimum uncertainty wave packet as it would be in 
the Gaussian case. That is a fairly minor price to pay for the ease of calculation which results. 


EXAMPLE: Find the C ; - for i//(x, 0) 

= 0(x + /)0(-x). 


SOLUTION: From Eq. (9.71), 



C, ‘ C °J 

[ e ik i x @(l - x)@(x)dx 


= C»J 

ry'vjx^X(t-e-'V) 

(9.72) 

where C 0 is a global normalization that is taken to be unity. 



With the choice of Eq. (9.72), simulating the time evolution of a wave packet is gentle. Starting the clock a bit earlier 
at time t = —t 0 allows for the visualization of the wave packet as it comes in to collide with a barrier. It is convenient 
to work in a system of units such that kj = (j/N)k min with N being the number of states in the packet, with k min being 
conveniently chosen (e.g., n/A), and with lengths and energies such that Ej = kj. Such a choice of units is friendly to 
calculation. For the choices of t 0 = 20 and N = 48, then focusing on a region of space -40 < x < 20 allows the wave packet 
to be seen colliding with the barrier (a larger visualization region would show more than one hump in the wave packet. 
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the location and spacing of which are a consequence of finite N and the spacing between the various momentum states 
chosen). A small N has the additional advantage of rendering the packet, such as it is, to look more like a wave (i.e., a 
smoothly varying crest) than a rectangular pulse. Simulations of that collision using the algorithm given in Section A3.5 
are shown in Figure 9.9. The first square (upper left) waterfall plot in that figure represents the "no barrier" configuration: 
a top-down view of it would nicely show the spreading of the wave packet that often occupies discussions of wave packet 
evolution, and so it is shown as the left-hand image in Figure 9.10. The right-hand image shows the other peculiar case 
of the impact of the wave packet with the cliff potential characterized by V 0 = -1 (lower right block of Figure 9.9), in 
which only a portion of the packet is transmitted. 



Figure 9. 

upward vertical direction. Units are chosen such that: t = -20 to t = +20 in 24 steps (lowest line to highest). The left boundary is at x = -40.0 and the 
right boundary is at x = +20.0. The potential barrier is of height V 0 = k 2 0 , and S = k? where k, = jx/AN and N = 48. The step potential barrier is to the 
right, beginning at the dashed line. The lower right-hand plot is for a negative potential V 0 = -1 such that k a = i. 



Figure 9.10 Top-down views of wave packet spreading. Left: in the absence of a barrier. Right: the wave packet encounters a cliff potential. The 
horizontal axis is position, the vertical axis is time. Observe in the former the elongation of the ellipse after t = 0 (the high point). Observe in the latter 
the reflection of much of the wave packet and the transmission of a portion, a behavior without a classical analog. 
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9.2.3.2 Density variation 

Consider now not one electron with different probabilities for various energies in the packet, but rather a collection of 
electrons such that the probability of an energy state being filled is governed by the Fermi-Dirac distribution, or 

Wit)) = e-^WiO)) = X e^CjWj) (9.73) 

where \Cj\ 2 is the probability that the state characterized by Ej is occupied and 

p = (\p(t)W(t)) = <vr(0)M0)> = ^ I Cj(Ej)y/j(x) \ 2 (9.74) 


where yfx) = (xW(kj)) and is as previously defined in Eq. (9.70). The Cj, though related to f PD {E), are not the same 
as for the one-dimensional problem. Because the goal is to find a density variation that has some resemblance to the 
one-dimensional variation near the surface of a metal, take a look at how Eq. (3.26) is written if the distribution function 
corresponds to a Fermi-Dirac distribution. It becomes 

J < = T, / (£)'^ /«W*,} <” 5 > 

The quantity in the brackets is the Fermi-Dirac distribution with the transverse velocity components integrated over: in 
the future, it will be known more generally as the supply function, but for now treat it simply as an interesting choice. 
For simplicity, consider the zero temperature limit, for then the Fermi-Dirac distribution function becomes a Heaviside 
step function itself, and the quantity in curly brackets in Eq. (9.75) becomes 


i/ V 

2^ Jo 


* k L dk x = -{k 2 p - k%) 


(9.76) 


An integration of Eq. (9.76) over dk x from —k F to +k F (including the factor of \/(2n), plus the spin of the electron g = 2) 
will, like the density integration of Eq. (6.8) before it, result in the number density of electrons at zero temperature of 
p = kl/ln 2 , as in Eq. (6.24). 


EXAMPLE: Find the Cj that mimic Eq. (9.76). 

SOLUTION: In numerically convenient units leading to Eq. (9.72), the maximum momentum k p occurs for 
j = N + 1. Therefore, let k x = (jk p )/(N + 1), which makes C w+1 = 0 (the supply function vanishes at k x = k p ). 
Inserting k x -*■ (jk p )/(N + 1) into Eq. (9.76) and ignoring overall factors such as l/4;r (the ratio of the density to its 
boundary value will eliminate common factors), it is found that 


.2 _ (N+ l) 2 -j 2 
Jj (N + l) 2 


(9.77) 


With Eq. (9.77), the density is given by 


p(x) ex Cf 


||(e'V + r(k_ ; yV)| 2 

|f(k ; yV| 2 


(x < 0) 
(x^O) 


(9.78) 


the numerical evaluation of which, using the algorithms of Section A3.5, gives the densities normalized by their values 
on the left boundary (at x = -3), as shown in Figure 9.11. When the barrier is lower than the maximum momentum 
present, or k 0 < k F , then the density in the barrier region is non-zero (pix > 0)) and over the barrier emission is seen to 
occur. However, when k 0 > k p , it is clear that pix » 1) -► 0, but that near x « 0 the density under the barrier exponentially 
tapers off but is non-zero for regions close to x = 0. For an infinitely high barrier, the reflection probability would be unity, 
and the electron density would vanish at the barrier. But it is also seen that if, instead of a step function, the barrier had 
finite thickness, then for thin enough barriers, the density of electrons at the right-hand boundary need not vanish: if 
the density does not vanish, then electrons are present on the right-hand side of the barrier, and tunneling is said to have 
occurred. 
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Figure 9.11 Electron density i 
k 0 > 1.0, all of the wave vectors in p(x) are smaller than k 0 . The barrier 
area). Non-zero density is seen even in those cases for x > 0, and there 
region, that is, a region where they are not classically allowed. 



rting at x = 0 (dashed line) and continuing to the right (g 
ons quantum mechanically penetrating into forbidden 


9.3 The harmonic oscillator 

What has happened will happen again, and what has been done will he done again, and there is nothing new under the sun. 

-Ecclesiastes 1.9 17 

The harmonic oscillator problem is the remaining quintessential problem of emission physics, for which the quantum 
mechanical treatment demonstrates techniques and subtitles useful to understand. Although three-dimensional formu¬ 
lations can be undertaken, a one-dimensional approach is sufficient. Two approaches to solving the harmonic oscillator 
problem are considered: first, using a wave function approach, and second, by a less conventional but useful quantum 
distribution function approach. 


9.3.1 Wave function approach 

New things are made familiar, and familiar things are made new. 

- Samuel Johnson 18 

The harmonic oscillator is characterized by the cyclic transfer between kinetic energy KE = p 2 /2m to potential energy 
PE = Kx 2 / 2 (where K is a constant). The position x(t) and the momentum p(t) = m{dx/dt ) are therefore [90] 


x(t) = x 0 cos loot + (/)) 

pit ) = -x 0 moo sin loot + <j>) (9.79) 

where x g is the maximum excursion and (jo is a phase, so that the total energy E = KE + PE = mx 2 oo 2 / 2 is a constant of 
the motion, and Schrodinger's equation is then, by virtue of the canonical procedure for transitioning between classical 
and quantum mechanics, 

(“££ + t^ (W2 )*' w=e *' w <9 - 80) 

where a new term, k 0 , has been introduced and defined by K/2 = h 2 k g /2m, so as to make the energy-like terms obvious 
and k 0 x dimensionless. As with the ladder operators for the hydrogen atom, Schrodinger's equation can be solved without 
having to endure differential equations by making use of the commutator relations between k = p/Ti and x given in 


11 The New English Bible With the Apocrypha. New York: Oxford University Press, 1971, p. 789. 

18 Samuel Johnson, Lives of the English Poets: Prior, Congreve, Blackmore, Pope. Cassell & Company, London, 1891. From the section on Pope. 
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Eq. (8.52). The first step is to write matters in a more agreeable way as 
(& + k 4 0 x 2 ) \y,) = k 2 n \y,) 


(9.81) 


where E = Ti 2 k 2 /2m. The subscript n appended to k n anticipates that the energy E = Ti 2 k 2 /2m will be quantized and that 
the corresponding energy levels E n will be labeled by n. The harmonic oscillator Hamiltonian offers a very direct and 
simple method to show the correspondence to the more familiar classical relations using Eq. (8.72): the velocity and 
force are (where only the commutators that survive are shown) 


dt h [ 2m \ 2m L J m 


d i \h 2 ld , J ih 2 Id r , 

—p = - —— -x 2 , hk\ = \x M = -Kx = F 

dt h [ 2m \ 2m l J 


(9.82) 

(9.83) 


which, apart from the hats, have the same forms as their classical counterparts. 

The left-hand side of Eq. (9.81) is suggestive: it almost begs to be represented as (~itc + k 2 0 x) (ik + k 2 x) as it would if k 
and x commuted, but because they do not. 


{-ik + k 2 xj {ik + k^xj =k 2 + k A 0 x 2 + itf {x, k] 
= k 2 + k%x 2 - k 2 0 


(9.84) 


Therefore, analogous to the ladder operators of Section 9.1.2, introduce what will be called the creation a' and annihi¬ 
lation 2 operators 

o' = kg 1 {-ik+k 2 0 x) (9.85) 

a = k~ l {+ik+ k^x\ (9.86) 


EXAMPLE: Show that la, a f ] = 2. 

SOLUTION: 

[a, a'\ = k~ 2 |ik + k 2 x, -ik + k 2 %J 
= k~ 2 {ik 2 0 \k,x\ - ik 2 \x, k\) 
= k~ 2 (2 ik 2 0 l-i)) = 2 


From the last example and Eq. (9.84), it then follows that 

{tfcl+l)W)=(^ S j W) (9.87) 

As for the apocalyptic terminology, and as was done for the angular momentum ladder operators in Section 9.1.2, 
consider what H = {h 2 k 2 /2m) (a^a + 1) acting on d\y/) and a*\y/) give, and do so for the annihilation operator first, that 
is, find Ha\x//). Apart from a coefficient of h 2 k 2 /2m, it involves finding 

(fl f a + = (aa t - [a,2 f ] + i)a|i^> 

= d(d'd-i)\ ¥ ) = (^-2 S )d\ v ) (9.88) 

In other words, the state a\y/) is an eigenstate of H but with an energy h 2 k 2 /m lower than the state \cp) = d\ ¥ ) (one finds, 
if one cares to prove it, that the action of a* increases the energy by the same amount). Observe that this amount is twice 
the characteristic energy fi 2 k 2 0 /2m. But just as ( ¥ \y/) > 0, so to is {(p\cp) = (i// |a t 2| y) = ( kjk 0 ) 2 - 1 > 0 and equals 0 (for 
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the ground state) when k 0 = k 0 (call that case n = 0). Succinctly, the ground state energy is h 2 k 2 /2m = tioo/l, and the 
energy E n of the higher states, as a consequence of the arguments leading to Eq. (9.88), goes as 

E n = (2n+ 1) ^ (9.89) 

and correspond to an application of (S t )" on the ground state. Instead of referring to the eigenstates as \y/ n ), it is simpler 
to call them | n). And so, 

(n' ]i t tj nj = 2n5 nn , (9.90) 

Normalization is imposed by demanding (n\n) = N 2 (0 \a n {a r ) n \ o) = 1. In principle, this could be accomplished by 
patiently commuting all the annihilation operators to the right, and the creation operators to the left. Or, one can cheat 
and notice 


(n\ n) = {N n /N n _ l f{n-\\atf\n-\) 

(9.91) 

= (N n /N n _ x ) 2 {n-\\a<a + 2\n-\) 

(9.92) 

= (N n /N n _ x ) 2 (n- l\2(n- l) + 2\n- 1) 

(9.93) 

ll 

af 

ii 

(9.94) 

from which it is concluded by mathematical induction that 

N n 1 

(9.95) 

N o 


Determination of the ground state is accomplished by observing that 


(x \a\ 0} = (d x + k^x) y/ 0 (x) = 0 (9.96) 


EXAMPLE: From Eq. (9.96), find the normalized ground state wave function for the harmonic oscillator. 
SOLUTION: The equation can be rewritten as dy//y/ = kjxdx, which has the solution yr 0 Qc) <x exp (~{k 0 x) 2 /2). 
Demanding that its integration over all x gives unity shows that 


Vr 0 (x) = N 0 exp 



H^) 


(9.97) 


Use of the creation operators and the normalization relations allows for finding the higher energy states. The expo¬ 
nential factor of ys 0 (x) persists in i/r„(x), so try a solution of the form 

(x\ n) = y/„(x) = N n H n (k 0 x) exp {-k 2 0 x 2 / 2) (9.98) 

where H n (k 0 x) is a function determined by the action of the creation and annihilation operators. From the normalization 
of the wave function, it follows ^ 

J e-^ x)1 H n {k 0 x)H n ,{k 0 x)k 0 dx = y/n2 n n\8 m , (9.99) 

Changing the variables to s = k 0 x, the actions of a f and a give recursion relations 

ay/Jx) = V2ny/„_ , (x) -> ^HJs) = 2 nH n _, (s) (9.100) 

as 

a i y n (x) = \j2(n + l)i/r „ +1 (s) ^ jH n (s) = 2sH n (s) - H n+l (s) (9.101) 


the combination of which shows 


H n+l (s) = 2sH n (s) - 2nH n _ x {s) 


(9.102) 
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These are the relations of the Hermite polynomials, one of a number of well-characterized orthogonal polynomials 
[60, 114]. Given the relations between the H n (s) just deduced, the question is then whether an efficient method of 
generating them can be found. The recursion relations of Eqs (9.100) and (9.101), and awareness of how the creation 
operators give rise to the higher n states, results in 

H n (s) = /l 2 {-d s + s) n e-^l 2 (9.103) 

but a simpler relation is possible. For y = cs p for constant c and p, 

d s y+ cpx p ~ ] = 2cpx p ~ l (9.104) 


and so 

& (~d s + cpx p ~ 1 ) e~ y = d® (-d s ) e~ ry 

With p -* 2 and c -*■ 1/2, then Rodrigue's formula for the Hermite polynomials results and is given by 
H n (s) = /{-d s ) n e- sl 

The first few are 


(9.105) 


(9.106) 


H 0 (s) = 1 
H l (s) = 2s 

H 2 (s) = 4s 2 — 2 (9.107) 

H 3 (s) = 8s 3 - 12 s 

H 4 (s ) = 16s 4 - 48s 2 + 12 

For the harmonic oscillator, the potential energy is symmetric in x, that is, V(x) = Kx 2 = V{—x). The wave functions 
though come in two types: those which do not change sign as x -»• —x and are called the even parity solutions \p 2n (-x) = 
+yr 2 „(x), and those that do and are called the odd parity solutions i/Q n +i( -x ) = ~Wm+ iC*)> as can be anticipated from the 
behavior of {-d s ) 2n and (-d s ) 2n+1 in Eq. (9.106). Parity is one of those interesting quantum mechanical properties that 
is physically measurable, but which has no large-scale classical analog [41]. Represented as P, it behaves as P\x) = |—x). 
Therefore, P 2 = I is another identity operator, making identity (Section 8.2.1) more nuanced. Additionally, for potentials 
that depend on the relative separation of the interacting particles (as in the Coulomb potential), then parity commutes 
with the Hamiltonian, [H, P\ = 0, so that parity is a constant of the motion, meaning that states prepared with a definite 
parity preserve that parity as they evolve in time. 

9.3.2 Quantum phase space approach 

In classical statistical mechanics the relative probability ...is given for statistical equilibrium by the Gibbs-Boltzmann formula...In 
quantum theory there does not exist any similar simple expression for the probability, because one cannot ask for the simultaneous 
probability of the coordinates and the momenta. 

-E. Wigner [115] 

...the use of negative numbers as an abstract calculation permits us freedom to do our mathematical calculations in any order, simpli¬ 
fying the analysis enormously and permitting us to disregard inessential details. The idea of negative numbers is an exceedingly fruitful 
mathematical invention. Today a person who balks at making a calculation in this way is considered backward or ignorant, or to have 
some kind of mental block ...we have a similar strong block against negative probabilities. 

- Richard P. Feynman [116], p. 236 

The quantum distribution function approach updates classical phase space ideas to the quantum realm. Moments 
of a distribution are again related to the density, current density, and energy spread encountered in Section 7.4, but 
not without some effort. The Wigner function is a well-known example [117], and its consideration is immeasurably 
improved when it is applied to a well-understood problem like the harmonic oscillator, even though it can be applied 
to electron emission. For the harmonic oscillator, the quadratic nature of the harmonic potential energy V(x) = Kx 2 /2 
makes the the time evolution equation of the Wigner function particularly straightforward. 
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The usage of distributions to calculate number density and current density are familiar from Section 7.3, but their 
relation to the operators p H and j H of Section 9.2.1 requires sandwiching those operators between suitably chosen bras 
and kets to insure that number and current density are given by 19 

p(x,t)=—f fix, k, t) dk (9.108) 

2^ J -oo 

J(x, t) = — [ —f (x, k, t) dk (9.109) 

2^ 7-oo m 

The Wigner distribution fix, k, t) contains all the quantum mechanical effects yet satisfies relations much like Boltzmann's 
equation of Section 7.4. This is not as simple as it may appear. Without loosing too much, the discussion can be confined 
to one dimension in space and similarly for momentum. Phase space points move around with time, but the number 
of particles, or points, in a small region has to be conserved (a concept that will reappear in Section 33.3.1, treating 
Liouville's theorem), or 

fix,k,t)dxdk = fix', k', t')dx'dk' (9.110) 


For infinitesimal times dt, the phase space points move as x' = x + tikdt/m and k! = k + Fdt/ti. The volume of phase space 
therefore changes: treating the probability as an incompressible fluid [65] (which entails that the number of particles in 
the phase space volume element does not change as it evolves), then 

dx'dk! = \* X dxdk = | \dx dk = dx dk (9.111) 

\d k x' d k k'\ | hdt l| y ' 

and so consideration can be focused on what/(x, k, t) is doing in Eq. (9.110). Before doing so, though, a small aside is 
instructive: observe that the linear transformations under discussion can be represented as [118] 


? = M = I o i £llkl=T(<a)-| 


(9.112) 


where T is a translation operator, a = itikdt/m ) and b = iFdt/h). Translation operators commute with each other, or 
[T ia', b'), T(a, 1;)] = 0, but they do not commute with the rotations R(0) (analogous to the lower right 3x3 matrix of Eq. 
(8.33) or Eq. (9.4)), that is, |T(«, b), R(0)] f 0. Non-commutation of rotations and translations is easily appreciated from 
experience: turning right then going forward a step lands one in a different place than the reverse order of going forward 
one step then turning right. The translation and rotation transformations constitute the Euclidean group and are known 
as canonical transformations [97, 118], 

Substituting Eqs (9.108) and (9.109) into the continuity equation (Eq. (7.19)) for a particle in a linear field F results 
djix,k,t ) = —d/ix, k,t) + |d/(x, k,t) (9.113) 

for which the solutions are 

fix, k,t + dt)=f(x- ^ dt, k - | dt, tj (9.114) 

Compared to Eq. (7.16), x = hk/m and k = F/h. If F is constant, then for longer times [97] 

(9.115) 

For a steady-state distribution (df/dt = 0), the paths or trajectories of physical particles must therefore follow the contour 
lines of a surface plot of the distribution function fix, k) in phase space. The Wigner function itself is therefore constant 
along paths of constant energy. That will be useful. 

So far, nothing in the discussion has departed from the classical outlook. The difficulties start when the form of fix, k) 
in Eq. (9.108) is sought. Naively, it might be thought that/(f?)|yq.(x)| 2 , as suggested by Eq. (9.53), is the form, but then 
how Eq. (9.109) would follow from Eq. (9.56) becomes a problem. Something a good deal more involved is required. 


the current density J explicitly has a q in the coefficient, so that it is a charge current density with units of amp/cm 2 . 
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Wigner proposed the simplest of the many possible candidates, and it is (leaving the H subscript off of the operators for 
convenience) [115, 117] 

f(x,k,t) = 2 J e 2 ^ {x + y\p{t)\x-y)dy (9.116) 

This is pretty good: the density p(x, t) falls right out when an integration over dk is performed, as the integral over e 2,ky 
gives a <5(y) function (see the n = 0 form of Eq. (9.123) below). After the dy integration, the warm result of Eq. (9.108) 
is regained with f(E)\(x\ip(t))\ 2 as the integrand, in keeping with Eq. (9.53). The time evolution of the Wigner function, 
though, is a good deal more challenging. Given Eqs (9.54) and (9.56), 

|/(x, k,t) =-2 J°° e 2iky dy i y {x + y\^-d,{pit), k)\x-y) 

+ <* + y| l - [V(x),p(t)] |x-y>} (9.117) 

where the curly brackets denote the anti-commuator [A, B\ = AB + BA, compared to the commutator [A, S] = AB — BA. 
Take each expression in turn. Recalling Eq. (8.50), in general, a function G(x) that can be Taylor expanded is therefore 
such that G(x)\x±y) = G(x ±y)\x±y). That indicates 

(x + y| d- x | pit), £ j | x-y) = k ( d x+y + d x _ y ) (x + y\ pit)\x - y) 

= 2kd x (x + y\p{f)\x-y) (9.118) 


and likewise, 

(x + y\ [V(x),p(t)} \x-y) = (y(x + y)-V(x-y)){x + y\p(t)\x-y) (9.119) 

Introducing the shorthand 

V(x,k-k') = J e 2l(k - k '» { V(x +y)~ V(x - y)} dy (9.120) 

results in the time evolution equation for the Wigner function 

±f(x,k,t) = ~^f(x,k,t) + f V(x,k-k')f{x,k’,t) dk' (9.121) 

The intrusion of quantum effects is now evident in the non-local nature of the function V(x, k - k'): it depends on contri¬ 
butions at a distance y from x. 

There is much that makes a study of the Wigner function fascinating in its own right. A few examples conveys that. First, 
a trajectory interpretation can be had by constructing a Wigner quantum potential under which the electrons move [119, 
120]. Wigner trajectories have much in common with an alternate formulation of quantum mechanics leading to Bohm 
trajectories [121-124] through the introduction of a quantum potential, which is introduced in Eq. (11.9). The Wigner 
trajectory concept sheds interesting light on the nature of tunneling in general and resonant tunneling (see Section 
19.2.4) in particular [125-129], Second, the Wigner function provides a satisfying take on the perennial problem of the 
spreading of the wave packet that dominates introductory treatments of quantum mechanics: the spread increases with 
time, 20 but to see it, the usual treatment based on finding the time-dependent wave function ip(x, t ) is rather tortuous. The 
elegance of the Wigner function analysis for the same problem [44, 85, 97, 118] is by comparison aesthetic. In particular, 
and as observed by Kim and Noz [118], whereas the uncertainty product AxA k for a Gaussian wave packet increases 
with time, the uncertainty associated with the Wigner representation can be formulated in a time invariant manner so 
that the uncertainty is represented by a sheering ellipsoid whose area remains constant over time and dictated by h yet 
is enclosed in a box of dimensions (Ax)(hAk) that keeps increasing. Such concepts are tremendously interesting but, alas, 
not to the point here. 

Instead, the concern here is with how quantum effects modify the classical description of the phase space. The answer 
is illuminated by Taylor expanding the terms V(x + y) - V(x - y) to obtain 

V( X + ,)-V(x-,) = f i ^GL.(l) 1 "v( X ) (9,122) 


20 See Figure 9.9, where for the high barrier case (k 0 =4), as the wave packet moves away from the barrier, a noticeable widening of it 
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along with 


to obtain 




lf(x,k,t) = -^d x f+^(d x v)d k f 


n 2 2 ” 


(-if 


{d 2n +l y) (af+1/) 


(9.123) 


(9.124) 


For treating electron emission barriers, V{x) is such that all of the higher order gradients must be retained [130, 131], 
making a simulation based on Eq. (9.121) the preferred approach. But the harmonic oscillator is different: for it, all 
the higher order derivatives of the potential vanish. This has consequences. For the time-independent harmonic oscillator 
distribution function, Eq. (9.124) becomes df(x,k) = 0 where 


0 


hk d 1 , v , d 
~mTx + h^Tk 


(9.125) 


EXAMPLE: Show that for a smooth function qr(E) that depends only oi 
OcpfE) = 0 where 0 is defined by Eq. (9.125). 

SOLUTION: Expand <p(E) as a Taylor series in E of the form 

r the total energy E = h 2 k 2 /2m + V(x), then 

cp(E) = £ CjE j 



With OB' = jE> 1 OE, then the vanishing of OE is enough to achieve the demonstration. Explicitly, 


'h 2 k 2 
^ 2m 

+ V(pc)j 


|=0 


as required. 




EXAMPLE: Find the n = 0 Wigner function f 0 (x, k). Show that it is / 0 (e) along a classical trajectory defined by 
Eq. (9.79). Finde. 

SOLUTION: From Eq. (9.98), 

Vo(x + y )¥ 0 (* “ y) = -p exp {~k 2 0 (x 2 + y 2 )) 

Then, from Eqs (9.116) and (5.11) and using the relation 


it is found that 


Ux,k)= 2k ^e-W [ e -t x2+2iky dy 

= 2 exp k 2 0 x 2 ~(jfj ^ = 2e ~ e "*• /o( e ) 


where e — 2 E/Tko = k 2 x 2 + (k/k 0 ) 2 . 
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EXAMPLE: Find the n = 1 Wigner function f } (x, k). Show that it is /j(e) along a classical trajectory defined by 
Eq. (9.79). Finde (again). 

SOLUTION: From Eq. (9.98), 


ViOc + y)Vi(x-y)= -pHJk'Qc + y))H^k„{x - y))e-^+^ 
\ n 

= 4 = (** -y 2 ) e-W+rt 


Then, using the same relations as the previous example, and using Eqs (5.11) and (5.12), 


where e — 2E/hco = k 2 0 x 2 + (k/k 0 ) 2 . 


fi(x, k) = 2 (2e — 1) *■» /i(e) 


(9.127) 


The last two examples explicitly show the harmonic oscillator, f{x,k) depends only on the combination 
e = k 2 0 x 2 + (k/k 0 ) 2 , a rather interesting result that generalizes to higher n. Along the classical trajectories, 

f (k~ l cost),k 0 sin6) =f(k~ l , 0) = f(0,k 0 ) (9.128) 

for 2n > 9 > 0. The second example shows further that for x in units of 1 /k 0 and k in units of k 0 , the Wigner function 
for the harmonic oscillator appears as rotationally symmetric in phase space, reinforcing that the classical trajectories in 
those particular units are circles. But here, it means more: it means there is an easier way to find the Wigner function 
for larger n than 0 or 1 because f n (x, k) need only be evaluated at k = 0, after which e = k 2 x 2 ->■ k 2 x 2 + (k/k 0 ) 2 with 
x -* k~ l cos (0) and k-> k 0 sin(0). 

This means that there exists a (simpler) way of fi nding/„(x, k) by simply fmding/„(x, 0) first, and then replacing k 2 x 2 —> e 
at the end. As practice, one can show this easily for f 0 (x, k) and/,(x, k). For general n, a bit more is required. Let 

f n (x, 0 ) = 2N 2 n J H n (u + s)H n (u - s)e~ u2 ~ s2 ds (9.129) 

where s = k 0 y and u = k 0 x. The functions H„{x) are polynomials, meaning that H n {u ± s) can be Taylor expanded about s 
(not u, because the use of the orthogonality relation of Eq. (9.99) will be needed) with a finite number of terms, and 
because of the astonishingly convenient relation of Eq. (9.100), it is found 


H n (u ±s) = (±1)” V nl (±2 uy'H {s) 

“ 0 jKn -])l nj 

This form allows the orthogonality relations to be used, and it follows that 
f„(x, 0) = 2 J yf n (x + y)ip„(x - y)dy 

= 2N 2 j H n (u + s)H n (u - s)ds 
2 " (-2 u 2 ) j 

= 2(—l)"e _ " £ ^- I— 


which, after the replacement u 2 -* e, 
equation, but also that the next few 


O '!) 2 («-/)! ' 

gives the final form of/„(e). One can check that/ 0 (e) and /j(e) are covered by this 


f 2 (e) = 2 (l — 4e + 2e 2 ) e~ e (9.132) 

/ 3 (e) = | (-3+ 18e — 18e 2 +4e 3 )e“ E (9.133) 

/ 4 (e) = | (3 - 24e + 36e 2 - 16e 3 + 2e 4 ) <T e (9.134) 
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Figure 9.12 The Wigner function f n (e), where e = tfx 2 + (k/k 0 ) 2 for the harmonic oscillator (see Eqs (9.132)-(9.134)). The energy E = {h 2 k 2 /2m)e. 
Observe that / 2 „ +1 (e -*■ 0) = -1 and f 2n (e -* 0) = +1. Observe also that only / 0 (e) is non-negative. 



k 0 x 


Figure 9.13 The density p n (x) for n = 2 (Eq. (9.135)), normalized to its maximum value. Observe that p 2 (x) > 0. 

and are shown in Figure 9.12: the lines rotated about the vertical axis would give a surface plot of the phase space f n (x, k). 
Lastly, observe that although f n (x, k) can be less than 0, the density p n (x) given by Eq. (9.13) is not. As an example, the 
case p 2 (x) is shown in Figure 9.13, for which 



(9.135) 


as can be shown by substitution and straightforward integration after the replacement of e ^ (k 0 x) 2 + (k/k 0 ) 2 in 
Eq. (9.132). 






PART II 

The canonical equations 


And he that calls on thee, let him bring forth 
Eternal numbers to outlive long date. 

- William Shakespeare 1 

The word canonical, particularly in mathematics and the natural sciences, refers to an authoritative or orthodox for¬ 
mula or rule. The "canonical equations" here are the four equations of electron emission (thermal, field, photo, and 
secondary), plus one other imposing limits on that emission (space charge), all of which were developed by the 1930s 
even if subsequent developments generalized or modified the equations. The canonical forms are pervasively used in the 
literature: before examining how they have changed, understanding from whence they arose is profitable. 

The enduring popularity of the canonical equations is in no small measure derived from their reasonably good account 
of the phenomena they treat, the simplicity of the equations, and the distillation of the phenomena into a small number 
of empirical (albeit not always well-understood) parameters. But reverence is not the same as uncritical acceptance, 
so the equations will be brought forth in a manner not necessarily the same as the way they were first developed: as 
Thomas Jefferson advised, "...laws...must go hand in hand with the progress of the human mind. As that becomes more 
developed, more enlightened, as new discoveries are made, new truths disclosed, and manners and opinions change with 
the change of circumstances, institutions must advance also, and keep pace with the times." 2 


^ef. [37]: Sonnet 38, p. 1459. 

2 Jefferson to H. Tompkinson (aka Samuel Kercheval), July 12, 1816, in The Writings of Thomas Jefferson, ed. P. L. Ford. New York: G.P. Putnam's 
Sons, 1892. The quote appears on the Jefferson Memorial in Washington, DC. 
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A brief history 


Det er ganske sandt, hvad Philosophien siger, at Livet maaforstas bagliends. 

Men derover glemmer man den anden Sietning, at det maa leves forlxnds. 

- S 0 ren Kierkegaard 1 

The theoretical equations schematically illustrated in Figure 10.1 are often designated by the names of those who 
first developed them. Visually, the emission mechanisms are distinguished by where the emitted electrons originate and 
by how those electrons are able to move past a finite thickness barrier. Most metals have a reasonably hefty barrier, 
characterized by the work function ® that is several electronvolts above their Fermi energy p: for copper, O = 4.5 eV, 
whereas for cesium, O = 2.14 eV. For emission over the barrier, regardless of thickness, the electrons have to be provided 
with sufficient energy so that they may overcome it, which can be accomplished by heating, absorption of photons, or 
acquiring energy via collisions with other electrons. For emission under the barrier, the barrier must be made thin enough 
that electrons have some probability of tunneling through it. Most tunneling electrons originate near the Fermi level, 
and the insensitivity of p to the temperature T is reflected in the insensitivity of the field emission current to T as well 
for metals. Following the numbering established in Figure 10.1, the canonical equations are described by the following 
terms (along with references to the articles introducing them): 

• field emission (Chapter 13) for electron emission by the application of high electric fields that enable quantum 
mechanical tunneling, leading to the Fowler-Nordheim equation [36] 

• thermal emission (Chapter 12) for electron emission by heating the cathode so that a thermal tail of the distribution 
function has sufficient energy to overcome the surface barrier, leading to the Richardson-Laue-Dushman equation [33] 

• photoemission (Chapter 14) for electron emission by the absorption of photons such that the electrons have sufficient 
energy to pass over the barrier, leading to the Fowler-DuBridge equation [34] 

• secondary emission (Chapter 15) for electron emission by bombardment with high-energy primary electrons that 
transfer energy to secondary electrons such that they have sufficient energy to pass over the surface barrier, leading 
to the Baroody equation [35] 

• space-charge limited emission (Chapter 16) for the limiting of current due to charge in the anode-cathode (AK) gap 
that acts to suppress the field at the surface of the cathode or limit the amount of charge crossing the gap [132], 


10.1 Thermal emission 

The development of an equation describing thermal emission by Richardson in 1901 predicted that ln(J/ \fr) was a linear 
function of \ /T. Some 12 years later after an experimental demonstration that (i) the effect was not the consequence of 
chemical processes and (ii) that electric current in metals was carried by electrons, Richardson modified that to ln(J/T 2 ) 
being linear [133], saying 

Taken together these experiments prove that the emission of electrons does not arise from any interaction between the hot filament and 
surrounding gases or vapors nor from any process involving consumption of the material of the filament. It thus follows that the emission 
of electrons from hot tungsten, which there is no reason for not regarding as exhibiting this phenomenon in a typical form, is not a 
chemical but a physical process. 

Motivated by the difficulties faced by that theory in the explanation of conductivity and specific heat, Richardson 
(in 1912) and von Laue (in 1918) ascertained the correct T 2 dependence of the coefficient. In 1923, Dushman's quantum 


1 "It is quite true what Philosophy says, that life must be understood backwards. But do not forget the other Phrase, that it must be lived 
forwards." Soren Kierkegaard, Joumalen JJ, 167 (1843), Soren Kierkegaards Skrifter, Soren Kierkegaard Research Center, Copenhagen, 1997, 
vol. 18, p. 306. 
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Figure 10.1 Electron emission schema: (1^1) material is to the left, the vacuum region is to the right; (5) cathode to the left, anode to the right, vacuum 
in the anode-cathode (AK) gap. Field (1) and thermal (2) shown for metals (no band gap); photo (3) and secondary (4) for semiconductors (band gap). 
Field emission relies on tunneling through the surface barrier, the others on emission over the barrier. Space charge (5) is not an emission mechanism but 
rather a current limitation mechanism. 


mechanical analysis related A RLD to fundamental constants. Fowler in 1929 modified A RJD to include a factor of 2 for elec¬ 
tron spin. By the 1930s, the so-called "Edison effect" was dubbed "thermionic emission" and, given its preeminence in 
vacuum tubes used in radio communications, experiments were directed to exploring low work function coatings such 
as barium, cesium, and thorium to improve electron emission at lower temperatures. It rapidly became clear that the 
surface work functions were not uniform nor characterized by a single work function <t>: rather, different crystal faces of 
polycrystalline refractory metal surfaces (often tungsten) exhibited different values of ®. In fact, most early thermionic 
cathodes were tungsten because they were dependable, easy to clean by heating, and hard to kill. By comparison, more 
exotic cathodes that relied on other materials were poisoned by unknown processes. The high work function, however, 
was a limitation to achieving high current density, and so thorium (which was unfortunately radioactive) and submono¬ 
layer coverages of alkali (e.g., cesium) and alkali earth (e.g., barium) metals were investigated and found to substantially 
reduce the work function and greatly improve thermal emission current. 

In one of the more emblematic instances of the impact of good luck on the fortunes of science, Arthur Wehnelt in 
1904 in Germany was working on cathode ray tubes using thermionic emitters made of platinum which, like other 
metals, had a fairly high work function. Vacuum technique being what it was in those days, stopcock grease was used 
in the vacuum systems, and some of it had contaminated the cathode surface, curiously leading to much higher emitted 
current. Wehnelt, who was fluent in the technology behind vacuum tubes, electron guns, and emission from metal 
oxides, surmised that the metal oxides were behind the superior emission, and thus undertook an exploration of what 
the best metal oxide was for thermal emission. He concluded that it was barium oxide (BaO). Thus was born the metal 
oxide thermionic cathode, used in radio, radar, TV, displays, microwave ovens, and all manner of electron beam devices. 
As Sommer observed [134] 2 

The BaO cathode is a truly amazing case of a researcher interpreting an accidental observation and putting it to practical use. It is ironic 

that this observation could not have been made in a modern high vacuum system! 

The need for radar in World War II drove the development of semiconductor, oxide, and matrix emitter thermionic 
cathodes. A pivotal improvement was the invention of dispenser cathodes that continually rejuvenated the low work 
function barium coatings on the surface by diffusion through a porous sintered tungsten matrix, to replace the barium 
lost to evaporation [135], Increasing the temperature of a dispenser cathode increases the current density but does so at 
the cost of increasing evaporation rates (Section 29.3). As barium exhaustion signals the lifetime of the dispenser cathode 
has been reached, keeping the current density low by using beam compression gets the needed current densities in the 
beam tunnel (Section 33.2). 


10.2 Field emission 

Subjecting surfaces to very strong electric fields results in the emission of electrons, as first observed by Wood in 
1897 [135], Later in Germany, Lilienfield (University of Leipzig) was laboring to improve his X-ray tubes in the early 


2 Lest fans of other emission mechanisms knowingly smirk, the article containing the quote is entitled The Element Of Luck In Research 
- Photocathodes 1930 to 1980, humbling for its implications. 
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1900s, when he found that the measured current between two metals in vacuum was larger than he had anticipated, 
with a fluorescence that was observed in the "anti-cathode" region of his tube. Lilienfeld described the phenomena 
thus [136]: 

The tubes at present in general use depend for the production of the electrons upon one of two effects, known for many decades: Either the 
ionization of residual gases (gas-filled tubes) or the heating of the cathode to a bright glow (thermionic or Edison-Richardson effect). The 
tube to be described, however, utilizes the electrons released by a phenomenon which l have only recently discovered. The main feature 
of the phenomenon is the releasing of the electrons on an unheated and cold cathode in the absence of any kind of ionization, either the 
ionization of the residual gases or an ionization through a hot spark discharge (electric arc) which pulls molecules from the electrodes and 
converts them into ions. 

and coined the phrase "Autoelektronische Entladung" (or "auto-electron discharge" which became "field emission"), 
attributing it to electrons being liberated from his metal tips [137], In 1923, Schottky attempted to give a non-quantum 
mechanical account of both thermal and field emission [138] by proposing a lowering of the emission barrier whereby 
(j> = ® - sjAQF, which also contained his clairvoyant conjecture about field enhancement effects for small bumps on 
larger bumps being multiplicative (more about this in Section 30.4). Yet it was the successful prediction of the behavior 
of current density as a function of field that gave an early success to the new field of quantum mechanics. In Jan¬ 
uary of 1928, Oppenheimer wrote an elegant manuscript (see Section 9.2) in which, using a method hauntingly similar 
to those of Eqs (11.7) and (11.8), he predicted that atoms subjected to fields would, in a process unknown to clas¬ 
sical physics, eventually lose their electron. He conjectured that the "pulling of electrons out of a metal by fields on 
the order of 10 5 e.s.u." 3 explained the data of Millikan and Eyring [49], Oppenheimer predicted that the current J 
observed by Millikan and Eyring was such that Inwas linear in 1/F. On May 1 of the same year, Fowler and 
Nordheim [36] argued instead that ln( J/F 2 ) was linear in 1/F: as a practical matter, Millikan's data supported both 
formulations fairly well (the first cautionary moral in Section 4.1). Fowler and Nordheim's theory was amended by Nord¬ 
heim afterwards to include the Schottky factor for barrier lowering that led to ® ->• <f> = ® - \/aQF in the exponential 
argument. 


10.3 Photoemission 

In 1887, Heinrich Hertz was conducting experiments to demonstrate the existence of electromagnetic waves, in which 
he used a Tesla transformer to produce sparks in a primary gap that caused sparking in a secondary gap a distance 
away. His investigations led him to conclude that premature sparking at the secondary gap was a consequence of the 
ultraviolet light from the primary spark [139], Hertz's assistant, Philipp Lenard, continued the investigations and observed 
that light striking certain metals induced the emission of electrons, for which Lenard received the Nobel Prize in 1905. 
The difficulties in explaining both the promptness of photoemission and the insensitivity of the energy of photoemitted 
electrons to the intensity of light (particularly, red light of any intensity did not lead to photoemission) paved the way for 
Einstein's explanation using Planck's corpuscular theory of light (Section 3.3), in which Einstein boldly proposed [57]: 

The usual conception that the energy of light is continuously distributed over the space through which it propagates, encounters very 
serious difficulties when one attempts to explain the photoelectric phenomena, as has been pointed out in Herr Lenard's pioneering paper. 
According to the concept that the incident light consists of energy quanta ...one can conceive of the ejection of electrons by light in the 
following way. Energy quanta penetrate into the surface layer of the body, and their energy is transformed, at least in part, into kinetic 
energy of electrons. The simplest way to imagine this is that a light quantum delivers its entire energy to a single electron: we shall assume 
that this is what happens. 

By 1928, Fowler [68] was able to exploit Sommerfeld's electron theory of metals in much the same way as he did in 
his treatment of field emission to show that the photoelectric threshold frequency was sharp (at a particular frequency 
a> 0 ), and that the corresponding energy was equal to the thermionic work function, or hoj g = <t>. DuBridge [140] mod¬ 
ified Fowler's formulation and consequently accounted for the effects of temperature on the energy distribution of the 
electrons, as well as the total yield, the metric of which is the quantum efficiency. Photoemission from semiconductors, 
particularly from compound photoemitters, such as the I-V material ("I" referring to a column 1 element of the Periodic 
Table and "V" to a column 5 element) cesium antimonide (Cs 3 Sb), was important in the development of semiconductor 
band structure theory. High quantum efficiency materials are important, likewise, for photodetectors. 


hn CGS 


is related to MKSA units by 10V 1 statV/cm = 1 V/m. 
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10.4 Secondary emission 

Secondary electron emission was discovered in 1902 by Austin and Starke, but its theoretical description was complicated 
by the fact that the energy loss mechanism of the primaries and the generation of the secondaries demanded a quantum 
mechanical treatment that was difficult [141], one of the first being by Wooldridge [142]. Baroody [35] lamented this 
fact, observing 

There would seem to be qualitative value, however, in a much simpler theory in which the primary is considered to interact in a classical 
manner with a free electron gas, the possibility of momentum transfer to the lattice being introduced by assuming a finite mean free path 
for elastic collision.. .Although the use of classical principles greatly simplifies the calculations, the problem remains complex and it is not 
clear that all the mathematical difficulties have been adequately met. The final results are therefore probably more limited in significance 
than would have been judged from the starting assumptions. It is partly because of this circumstance, and partly because it is felt that 
more attention should be given the Sommerfeld model, that the present paper is offered. 

and so unlike the other canonical equations, Baroody's formulation came after the madness of World War n rather 
than before it. His formulation makes use of concepts concerning penetration into the metal, generation of the excited 
electrons, and their transport back to and emission from the surface, for which analogs exist in Spicer's semi-empirical 
three-step model of photoemission. Secondary emission is utilized in photomultipliers, magnetrons, crossed-field ampli¬ 
fiers [12], and diamond current amplifiers [143, 144]), but secondary emission processes also figure in damage and 
degradation, as when some of a high-energy beam in a particle accelerator strikes cavity walls: a rough distinction between 
the secondaries utilized by electron sources and those contributing to wall damage is that the secondary yield of cath¬ 
odes is desired to be high, but the secondary yield of walls and cavities is purposely suppressed by exploiting low-yield 
materials to suppress emission. 


10.5 Space-charge limited emission 

Space-charge limited emission theory is often associated with thermal emission: Langmuir's ground-breaking treatment 
observed that an understanding of space charge was crucial in showing that current due to thermal emission was not, as 
argued in Germany at the time, due to chemical reactions. Because current from a thermal cathode rose as the temper¬ 
ature rose, departures were observed from Richardson's equation for a given anode voltage. Langmuir wrote [132] 

This apparent discrepancy between the results of calculation by Richardson's equation and the facts observed with a tungsten lamp seemed 
at first to confirm the growing opinion that in a very high vacuum the thermionic current is very small, if not entirely absent. Experiments 
on the Edision [sic] effect in tungsten lamps, made some time ago by the writer, throw a great deal of light on the cause of the apparent 
failure of Richardson’s equation at high temperatures. The observations therefore seem of sufficient interest to warrant their publication. 

Langmuir mentioned in a footnote that C.D. Child found the same relation for the maximum current pulled across 
a planar gap for positive ions in 1911. For electrons, the relation therefore goes by the name of the Child-Langmuir 
equation (or often more compactly as Child's law), and it has since become " ...one of the best-known limits in the fields 
of non-neutral plasma physics, accelerator physics, sheath physics, vacuum electronics, and high power microwaves..." 
[145] Space charge is inextricably linked with geometry: 10 years after Langmuir's first paper, he and Blodgett [146] 
found space-charge limited current between coaxial cylinders, and a year later between concentric spheres [147]. Later 
refinements include initial velocities, voltage differences of such a magnitude that electrons traversing the gap became 
relativistic [148], and, ultimately, quantum mechanical effects for very small AK gaps [149], Space-charge effects addi¬ 
tionally appear for electron transport in semiconductors: when scattering dominates, the Child-Langmuir transforms into 
the Mott-Gurney law [81]. Within thin films, space-charge effects during transport can lead to self-quenching behavior, 
as shown in simulations by Mumford and Cahay of Metal/CdS/LaS cold cathodes [150], 


10.6 Resources and further reading 

Those who cannot remember the past are condemned to repeat it. 


- George Santayana 4 


4 George Santayana, Reason in Common Sense, The Life of Reason, Vol. 1. New York: Dover Publications, 1980. 
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Learning from others is second nature to humans: we do it more readily and precisely than any other animal. 

- Franz de Waal 5 

A rich history is available for the various electron emission mechanisms. A sampling of useful resources is suggested 
here, grouped according to kind: full-length books, book chapters, review articles, and general articles that have emerged 
as standard references. Of the books, a few describe more than one emission mechanism in depth, while others focus on 
one emission mechanism. A quasi-chronological account is: 

• De Boer, Electron Emission and Adsorption Phenomena [151]: Extensive and historically useful overview of emission 
physics, phenomena, and literature up to the time of publication (1935). 

• Birdsall and Bridges, Electron Dynamics Of Diode Regions [148]: Standard resource on space-charge limited flow and the 
Child-Langmuir law. 

• van der Ziel, Solid State Physical Electronics [152]: Concise chapters on all emission mechanisms, solid state physics, and 
semiconductor device concepts. 

• Dobretsov and Gomounova, Emission Electronics [153]: Comprehensive and economical treatment of all emission mech¬ 
anisms and space charge. 

• Modinos, Field, Thermionic, and Secondary Electron Emission Spectroscopy [111]: An excellent and thorough resource. 

• Gomer, Field Emission and Field Ionization [95]: A classic treatment of both theoretical and experimental matters with a 
historical background. 

• Fransen et al., On the Electron-Optical Properties of the ZrO/W Schottky Electron Emitter [154]: Extensive analysis and data of 
thermal-field emission (or the Schottky emitter), particularly useful for treating the transition from thermal emission 
to field emission. 

The series Advances in Imaging and Electron Physics (ed. P.W. Hawkes) and the earlier Advances in Electronics and Electron 
Physics series contain many invaluable books and book chapters, in particular 

• Dyke and Dolan, Field Emission [155] 

• Swanson, and Bell, Recent Advances in Field Electron Microscopy of Metals [156] 

• Murata and Kyser, Monte-Carlo Methods and Microlithography Simulation for Electron and X-ray-beams [157] 

• Hawkes, Electron Optics [158] 

• Binh, Garcia, and Purcell, Electron Field Emission from Atom-Sources: Fabrication, Properties, and Applications of 
Nanotips [159] 

• Fransen, Van Rooy, Tiemeijer, Overwijk, Faber, and Kruit [160], On the electron-optical properties of the ZrO/W Schottky 
electron emitter [160] 

• Binh and Semet, Electron Field Emission from Atom-Sources: Fabrication, Properties, and Applications of Nanotips [161] 

• Jensen, Electron Emission Physics [85] 

• Swanson and Schwind, A Review of the Cold-Field Electron Cathode [162]. 

Book chapters and reviews that focus on one of the canonical mechanisms abound, with particularly good treatments 
being the following: 

• Thermal emission 

1. Dushman, Thermionic Emission [33] 

2. Becker, Thermionic Electron Emission and Adsorption Part I. Thermionic Emission [163] 

3. Herring and Nichols, Thermionic Emission [164] 

4. Haas and Thomas, Thermionic Emission and Work Function, Chapter 2 [135] 

• Field emission 

1. Dyke and Dolan, Field Emission [155] 

2. Gadzuk and Plummer, Field Emission Energy Distribution [165] 

3. Zhu, Vacuum Microelectronics (all chapters) [166] 

4. Swanson and Schwind, A Review of the Cold-Field Electron Cathode [162] 

• Photoemission 

1. Gorlich, Recent Advances in Photoemission [167] 

2. Sommer and Spicer, Photoelectric Emission [168] 

3. Rougeot and Baud, Negative Electron Affinity Photoemitters [169] 

• Secondary emission 

1. McKay, Secondary Electron Emission [170] 


’Franz B.M. de Waal, The Ape and the Sushi Master: Cultural Reflections By a Primatologist, 1st edn. New York: Basic Books, 2001, p. 20. 
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2. Dekker, Secondary Electron Emission [141] 

3. Hachenberg and Brauer, Secondary Electron Emission From Solids [171] 

Reviews or extensive treatments have been listed in the bibliography, but there are particularly notable treatments 
for thermal emission [110, 172, 173], field emission [79, 96, 110, 173-177], photoemission [134, 178-181], secondary 
emission [35, 157], and space charge [145, 182], 
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CHAPTER 11 

Anatomy of current density 


“Beauty is truth, truth beauty," - that is all 
Ye know on earth, and all ye need to know. 

- John Keats 1 


Truth, n. An ingenious compound of desirability and appearance. 


- Ambrose Bierce 2 


Current density as encountered in Eq. (3.5) is complicated by several considerations: (i) particles have a distribution of 
energies, leading to the distribution approach of Eq. (7.9), ( ii) current is directed into a barrier such that the transverse 
components of the distribution can be summed over, leading to the notion of a "supply function" suggested by Eq. (9.75), 
and ( Hi) a transmission probability governs the distribution of particles transported past the barrier. A formulation of the 
current density which incorporates these wrinkles and bears a resemblance to the current density of the distribution 
function is ^ 

J(F,T) = ^ ™D(k)f(k)dk (11.1) 

where Tik/m is the velocity and D(k)f(k) is the distribution of particles making their way past the barrier. Eq. (11.1) looks 
and feels right at a gut level, even though a compelling derivation of it is lacking so far: it has truthiness , 3 It also requires 
unpacking: 

• The equation is one-dimensional, and so k x -*■ k. 

• The lower limit of integration, normally — oo, is 0 because no electrons are incident from the vacuum boundary (right) 
side. 

• The supply function f(k) is the integral of the distribution over the transverse momentum as in the brackets {■ ■ ■ } of 
Eq. (9.75), and includes spin. 

• The factor D(k)f(k) is the distribution of emitted particles. 

• The arguments of J(F, T ) reflect that the supply function is temperature dependent (T) and the transmission probability 
is field dependent (F). 

Were the vacuum to be replaced by a material and the barrier by an interface barrier characteristic of 
metal-semiconductor (and metal-insulator-semiconductor, MIS) boundaries as typical of Schottky barriers, then 
electrons would be incident from the right, the integration region extended to -oo, and the distribution function/(k) 
would have to be replaced by something accounting for the rightmost boundary, most simply by including a right-hand 
side supply function. Indeed, Eq. (11.1) would generalize to the Tsu-Esaki formula [183] 

J TE (V b ) = 2n[ ^ _/(fe,) ) dk (1L2> 

where E k , = E k + qV b and V b is the voltage bias across the barrier, as will be used in treating resonant tunneling diodes 
(RTDs; two barriers) of Section 19.2.4 to understand negative differential resistance. For his work with RTDs, Esaki shared 
the Nobel Prize in 1973. 

The genesis of all four of the emission equations can be traced back to limiting forms of Eq. (11.1) after ( i) the integral 
is recast in terms of energy, (ii) the supply function is specified, and (tit) the transmission probability is approximated 


1 John Keats, "Ode on a Grecian Urn", in Poetry: An Introductory Anthology, ed. Hazard Adams. Boston: Little, Brown and Company, 1968, p. 166. 
2 Ambrose Bierce, The Devil's Dictionary. New York: Dover Publications, 1993, p. 136. 

3 The word truthiness to suggest something held to be true not because of evidence but because of gut feeling was coined by the comedian 
Stephen Colbert in 2005 and, owing to its popularity, has made it into some dictionaries a decade later, for example http://dictionary.reference 
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based on an image charge potential. Although far simpler than their original genesis, such an approach reproduces the 
canonical emission equations quickly and compactly. Performing the first step, switch from a momentum integration 
over dk to an energy integration over dE under the assumption that energy is parabolic in momentum, that is, using 
h~ l dE = ( hk/m)dk so that ^ 

J(F,T)= f D{E)f{E)dE (11.3) 

Inti J 0 

Using this equation as the starting point, consideration of the supply function f(E) and the transmission probability D(E) 
can be undertaken. 


11.1 Supply function 


... two social and economic forces ...do indeed play a fundamental role in determining wage inequality, even in more sophisticated theories: 
the supply and demand of skills. In practice, the supply of skills depends on, among other things, the state of the educational system ... The 
demand for skills depends on, among other things, the state of the technologies available to produce goods and services ...At a minimum, 
they influence the relative power of different social groups. 

- Thomas Piketty 4 


The relation of demand to supply is a general and powerful notion appearing across disciplines (albeit that physics 
benefits from a less convoluted relation than economics). The demand, to push the analogy, is current density in the 
form of electrons making their way past the barrier, and it is dependent upon the (temperature dependent) supply of 
electrons from the bulk material. The supply function is a boundary condition, specifying the coefficients attached to the 
incoming wave functions in Eq. (9.78), and was implicit in Eq. (9.75). To make f(k) explicit. 


{2k) 2 Jo 1 


2nk L dk L 


+ exp{/7 (E x + E x - p )} 


(11.4) 


where /? = 1 /k B T, E x = h 2 k 2 /2m and E ± = Ti 2 k 2 j2m are used, and the x suffix has been temporarily reinstated to reduce 
ambiguity. Pleasantly enough, the integral is straightforward: letting u = p{E ± + E x - p), then 


f{E ) = _J?L_ [ du = _2?1_ i n (\ + ^{u-e x )\ 

A x) Kph? J m 1 + e“ Kph 2 V ) 


(11.5) 


The dropping of the x subscript can now be done because all vestiges of three-dimensionality have been subsumed in 
Eq. (11.5), and so let E X ^E below. To emphasize an important point, past discussions of f(k) will now be replaced by 
present discussions using/(E), as both refer to f {E{k)). Mathematicians would look askance, but it is convenient to do so. 


EXAMPLE: Verify that an integration over the zero-temperature supply function reproduces the density 
p = kp/3K 2 . Let E x -y E = h 2 k 2 /2m. 

SOLUTION: For p= l/k B T^oo, Eq. (11.5) is non-zero only for E < p, for which f{E) = (m/Kh 2 )(p- E) -*■ 
(k 2 - k 2 ) /2 k. Performing the integration over k from —k p to k F (the range over which the integrand is non-zero) 
results in 

k ? k 2 F - k 2 _ k\ 


l f k * ki - k 2 k 2 

p = — / - dk = -A: 

2k J_k 2 k 3k 2 


11.2 Gamow factor 

Cry woe, destruction, ruin, and decay 

The worst is death, and death will have his day. 

- William Shakespeare 5 


4 Thomas Piketty, Capital in the Twenty-First Century (trans. Arthur Goldhammer). Cambridge, MA: Belknap Press of Harvard University Press, 
2014, p. 305. 

5 Ref. [37]: King RichardII, in.ii. 102-103, p. 653. 
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e ikx 



r(k) e- ikx 

x v ° i 1 

t(k) eikx 







Region 1 

Region 2 

Region 3 


Figure 11.1 A barrier of finite thickness L and the decay of the transmitted wave function passing through it when E < V 0 . 

When an electron of energy E encounters a one-dimemsional step function potential of height V„ such that E < V 0 , 
the wave function in the barrier region is no longer oscillatory as with the incident wave function, but decays as in 
Section 9.2.3. Setting h 2 K 2 /2m = | E— V 0 \, then the wave function dies out as y/ trans (x) = t(k )exp (-k 0 x) , 6 If instead of a 
step function potential, a potential barrier of width L were encountered, then the wave function would have decayed to 
e~ K ° L before emerging from the other side to continue its oscillatory motion, as in Figure 11.1. The transmission probably 
should then go as something like D{E) « exp (-2 k 0 L). Why? A hand-waving explanation is because the current density 
relation contains terms of the form ( h/m)yi^d x yi ~ ( hx 0 /m)p which resemble the product of velocity and density, and 
as transmission probabilities are ratios of transmitted current to incident current, the expectation of how D(E) varies is 
anticipated to be D(E) ~ e~ 2K ° L . The Gamow factor is contained in the argument of the exponent, and for the rectangular 
barrier is 0 rec (E) = 2 k 0 (E)L, the subscript designating "rectangular". Such a hand-waving argument can be put on firmer 
footing. 

Gamow introduced 0(E) to describe the decay of the emission probability of an alpha particle attempting to escape 
the nucleus of an atom [184], In dealing with potentials that arise from the consideration of real surfaces from which 
electrons long to escape, the decay of a nucleus via alpha particle emission serves as a good template of how to proceed. 
Normally, the evaluation of the decay factor would require a wave function matching method commonly known as 
WKB [185], although the acronym JWKB, referring to Jeffries-Wentzel-Kramers-Brillouin, would be more accurate. 
The JWKB method is an approach to match wave functions at points where the wave function passes from above to 
below, or vice versa, when encountering a smoothly varying potential. The Gamow factor is often used in place of 
a proper JWKB treatment because it captures the most important physics with minimal effort. How the transmission 
probability D(E) is obtained from 6(E) can be done several ways [186], and will become justified when the full mechanics 
of treating general barrier problems is undertaken in Chapter 18. 

A simple argument can render the Gamow factor plausible here (similar to an approach by Miller and Good [187]). 
Start with Schrodinger's equation 

ihd t y/(x, t) + ^ d 2 — y(x)^ ys(x, t) = 0 (11.6) 

and assume y/ takes the form 

yr(x) = R exp (iS) (11.7) 

for two real functions R(x, t) and S(x,t). Clearly, R(x,t) 2 = p(x,t) is the number density. When Eq. (11.7) is inserted into 
Eq. (11.6), a pot pourri of real and imaginary terms spill out. Collecting the real terms together, and doing the same with 
the imaginary terms, both have to be separately set equal to zero, and so 

0 = — ((d S) 2 - + V(x) + hd.S 

2m l R J 

° =i {h R+ ^ 2 M ^ + ^ } (11 - 8) 


6 The term k 0 is used instead of the simpler k because for general and awkwardly shaped potentials a spatially dependent k(x) will useful, as in 
Eq. (11.21). Hence, using k 0 reinforces that it is constant. 
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Comparing to the Hamilton-Jacobi equation of classical mechanics [90, 122], the term ( hd x S ) /2m resembles kinetic 
energy. Similarly, the quantum potential introduced by Bohm [121] is 

(11.9) 

and is added to V(x) for Bohm's prescription to avoid a conflict between macroscopic and microscopic realism in quantum 
mechanics through the introduction of a trajectory concept analogous to that in Section 9.3.2. If the electron density is 
smoothly varying, then £2(x) will be small and ignorable. 

The point to focus on is that the kinetic energy term identifies Tid x S/m as a velocity: letting the (number) current 
density J be the product of a velocity and a density, as in Eq. (3.5), results in 

J(x, t) = R(x, tf ( ^d x S(x, o) (11.10) 

The same form is obtained using Eq. (11.7) in Eq. (9.60) (try it). Inserting J(x, 0 into Eq. (11.8) results in the imaginary 
equation becoming (try this too) 

0 (d t p + d x j) = 0 (11-11) 

for which the quantity in parentheses is the familiar continuity equation of Eq. (7.19). But there is more: J(x) inside 
Eq. (11.8) in place of d x S gives 

d x J(x) = i (E - V(x) - Q(x)) - ^J(x) 2 ] (11.12) 

For steady-state problems (d t p = 0), the current density is spatially uniform (d x J = 0) and so Eq. (11.10) shows that the 
phase S(x) evolves according to 

d x S = i-\j^ \E - V(x) - Q(x)| (11.13) 

Therefore, for E larger than the maximum of the potential and £2(x) small, then d x S behaves much like k 0 , but it is 
when E < V max that k 0 -* hc 0 and exponential decay results because e ,K ° x -*■ e~ K ° x . Thus, a reasonable expectation is that 
the prototypical Gamow factor of 2 k 0 L generalizes for an arbitrary potential to the form 


0(E) i 


(11.14) 


where V(x ± ) = E so that the integrand vanishes at the integration limits, and the factor of 2 out front is because the 
density behaves as 11//1 2 . Eq. (11.14) is the usual formulation of the Gamow factor. 


EXAMPLE: Find the Gamow factor 0 rec (E) for a rectangular barrier of height V 0 = h 2 k]/2m and width L 0 , that is, 
for V(x) = V 0 ®(x)®(L 0 -x), using Eq. (11.14). 

SOLUTION: Define L rec = L 0 . Then, 

0 rec (E) = 2 f ° dxJ k 2 0 - k 2 = 2\l k 2 0 - k 2 L 0 (11-15) 

Jo v v 

= 2 x 0 (E)L rec (11.16) 

where E = Ti 2 k 2 /2m and V 0 = Ti z k 2 0 /2m in Eq. (11.14). 


The quantity 0 rec = 2 k 0 L is the prototypical or constant barrier Gamow factor containing three important factors [188]: 
a numerical coefficient characteristic of the overall barrier shape (2), a wave number characteristic of the barrier height 
(k 0 ), and a length characteristic of the barrier width (L). Other simple barriers can be treated analogously. 
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EXAMPLE: Find the Gamow factor 0 M (E) for a triangular barrier V(x > 0) = V 0 - Fx using Eq. (11.14). 
SOLUTION: Define the length L^E) =(V 0 -E) /F. Then, 

\[2m _ 

6 tri (E) = 22L— / dxy/V g -E-Fx (11-17) 

= ~K„(E)L tri (E) (11.18) 


EXAMPLE: Find the Gamow factor 0 quad (E) for a parabolic barrier V(x) = $ ( 1 - (2 x/L 0 ) 2 ) + n, where L 0 is the width 
of the barrier at E = n, using Eq. (11.14). Observe that V 0 = F(0) = n + 4>- 

SOLUTION: Define the length L qua (E) = + 4> ~ E)/4> as the positive root of the equation V(L qua /2) - E = 0. 

Then 


\plm /-V®/ 2 I $ ~ 

9 qua (E) = 2 2-— / dxJ2--(L 2 0 -Ax 2 )+^- 

n J-L qm (E)/2 V L„ 


= —K 0 (E)L qua (E) 


(11.19) 

(11.20) 


In all three cases, the factor k 0 (E) is as defined in Eq. (11.15), or (Hk 0 (E)) 2 /2m =V 0 — E. Observe that the shape factors 
(2 for rectangular, 4/3 for triangular, and n/2 for quadratic) are dimensionless, different for each case, and in proportion 
to how much of the area of height x(E) and width L(E) is taken up by the integrand (completely in the case of the 
rectangular barrier). The shape factor approach will be the focus of a more general discussion in Section 11.3, and make 
a brief appearance in Section 25.2. Returning to the wave function y/(x), for general potentials V{x) it is useful to define 
a generalization of k 0 that is spatially dependent, or 



and the zeros x ± such that k-(x ± ) = 0. With the neglect of the quantum potential Q(x), then from Eq. (11.13) d x S = k(x), 
and for time-independent situations where the current density J is constant, R(x) 2 <x 1 /jc(x). Thus, the leading order 
approximation is (said this way because the same result is obtained using a perturbation-like approach as done by Miller 
and Good [187] and Schiff [42]) 

V(x) « -^= exp j Kix'W^ (11.22) 

when the energy of the incident electron is below the barrier maximum. As density is proportional to |vr(x)| 2 it follows 
that D(E) <x exp (-2kL) to this order. A careful reader might have a disquieting feeling that something is being obfuscated 
(it is), that shifting back and forth between a real k(x) and imaginary ix(x) in the definition of d x S is cavalier (it is), and 
that a proper account of the barrier might reveal weirdness that the present analysis hides (it will), but 0(x) is adequate 
to the needs of developing the canonical equations. Those discussions will wait. 

11.3 Image charge potential 

...I feel it,/I know my image now ...I start the fire I suffer. 

-Ovid 7 


7 R. Humphries, Ovid Metamorphoses, Narcissus. Indiana University Press, 1983, lines 462-465. 
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The method of images is a remarkably convenient tool to solve boundary value problems. The method is based on the 
simple observation that if the boundary conditions imposed by conductor surfaces can be mimicked by a distribution of 
charges (used with abandon in Section 30.3), the replacement of the metal surface by those charges is also a solution to 
the boundary value problem, and by the uniqueness theorem (if two solutions agree at the boundaries, they are the same 
solution) the resulting potential is the only solution [99]. For a flat metal plane over which a charge q is suspended, the 
metal surface is held at zero potential and all the electric field lines are normal to the surface, 8 but the same boundary 
conditions can be had if a charge of equal and opposite sign is equidistant from the surface but inside the metal. As every 
point on the metal surface is equidistant between the charge and its image, the potential vanishes, and so the electrostatic 
problem is solved. Representing a metal surface by an image charge is preferable to the horror of trying to find the surface 
charge density directly. 

The form of the image charge potential is revealed by considering the force between the external charge and its image: 
the charges are equal and opposite in magnitude, and equidistant from the surface boundary. If x is the distance of the 
external charge to the surface, then the distance between the charge and its image is 2x. Consequently, the force on a 
charge due to its image is 


m = - 


r 

4ne(2x) 2 


Q 


(11.23) 


The image charge potential is then determined by integrating over the force, or 

V image (x) = 


o=r( _«L_W—= - 

Jx \ 4jce 0 (2xf) 16 JIE'PC x 


(11.24) 


For an electron emitted from a metal surface, the consequence is that in addition to a barrier described by V 0 subject to 
an electric field giving rise to —Fx, there is a Coulomb potential of the form —Q/x (see Eq. (2.8)) due to the image charge 
inside the metal and serving in its stead. The barrier for metals is characterized by its height above the Fermi level q, or 
V B = q + ®, where <t> is the work function, so that 


V lmage (x) = F + « 


(11.25) 


for x > x min where x min is the smaller root of the equation V image (x) = 0. The potential V image (x) is shown alongside the 
triangular barrier (Eq. (11.25)) with Q -*■ 0) in Figure 11.2. There is much to notice. First, the image charge potential 
does not encounter the x = 0 origin, and the location of the maximum of the barrier will clearly move around with field. 
Second, the barrier maximum is reduced by comparison to the triangular barrier by an amount commonly called the 


Figure 11.2 T 

parameters, t 



0) for copper-like 


8 Were the electric field lines not normal to the surface, electrons at the surface would move in response until a distribution of charge was 
established such that the field lines became normal. 
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Schottky barrier-lowering factor. Third, the zeros of the 9(E) integrand, or x ± (E), change depending on the location of E, 
but arguably the most important instance is at E = n- Consider each in turn. 

The combination \i + ® - E appears with such regularity that it is convenient to define A (E) = + O - E. The zeros of 

the 9(E) integrand are then solutions to —Fx%. + Ax ± - Q = 0, whereas the location of the barrier maximum is the solution 
to d x V(x 0 ) = 0, or 


r ± (E)=^{A±VA 2 -4QE} 

(11.26) 

O' | fc, 

II 

w 

■v 

(11.27) 

L(E) =x + -x_= ^ \JiX 2 - 4QF 

(11.28) 

V max = F + ® - \Z4QF 

(11.29) 


Observe in Eq. (11.29) that the height of the barrier above the Fermi level is O - y/4QF, where the Schottky lowering 
factor y/4QF is the amount by which the barrier is lowered due to the image charge compared to the triangular barrier. 
Giving the image-reduced work function its own symbol is desirable: since the barrier is reduced, use <p, or 

</> = $- y/4QF = (l-y)^ (11.30) 

where y 2 = 4QF/4> 2 represents the fractional height of the image charge barrier compared to the image-free barrier. In 
terms of r/> and x 0 , an alternate method of expressing the image charge potential of Eq. (11.25) is 

+ f(*-*o) 2 < 1L31 > 

which is suggestive in that near x « x 0 the potential resembles a parabola, yet for large x and small F, when x 0 is negligible, 
the potential more closely resembles a triangle. Observe that (f> is the height of the barrier to an electron at the Fermi 
level E = n, for which A(jF) = <J>. 


EXAMPLE: For an electron at the Fermi level E = u (alternately, A = <t>) find the width of the barrier, the zeros 
x ± (p), the location x 0 and y 2 . Assume copper parameters of = 7 eV and ® = 4.5 eV. Let F = 8 eV/nm. 

SOLUTION: 

L(n) = i V® 2 - 4QF = 0.36934 nm 
x + (jd) = ^ (® + LF) = 4.6592 nm 
x_ (n) = ^ (® — LF) = 0.96581 nm 
x min = x_(0) = 0.03202 nm 

x=- = 0.21213 nm 

o F 

y 2 = 0.56887 
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Richardson-Laue-Dushman equation 


If the theory ...is correct and if all of the usual disturbing factors are eliminated in the design, the resulting retarding-potential 
current should have the same exponential dependence on the voltage characteristic of the temperature extending to the point of 
saturation ...Equally significant is the flatness of the saturated region. This is rarely seen in retarding-potential experiments, since the 
accelerating voltage usually manages to collect more current by influencing trajectories, reducing space charge, altering the average 
potential outside of patchy surfaces, and by other devious means. 

- Haywood Shelton [189] 


12.1 Approximations 

...in science, we can make successive approximations to the truth, in which each new stage results from an improvement, not a rejection, 
of what has gone before. 

- Bertrand Russell 1 

The Richardson-Laue-Dushman (RLD) equation predicts current density due to thermal emission, generally from 
metals that, through the application of coatings such as barium to a porous tungsten base, have a low work function 
(e.g., dispenser cathodes. Figure 12.1). Typical operational parameters are T = 1173 K, F = 0.1 eV/pm, and <t> = 2.0 eV. 
Standard accounts of the RLD equation are given by Guth and Mullin [190] and Murphy and Good [110]. 

For the transmission probability, observe that fields for typical thermal emission measurements are comparatively 
weak, 2 so weak in fact that electrons even a wee bit below the barrier maximum p + <p are prevented from escaping. Next 
observe that even at temperatures that thermal cathodes endure, only electrons in the thermal, or Maxwell-Boltzmann, 
tail of the supply function /(E) contribute to the current density. Using the approximation ln( 1 + x) « x in Eq. (11.5) pulls 
out the Maxwell-Boltzmann term. Therefore, and using the convenient term p = l/k B T, 

D m; (/i)«0[/f-(^ + 0)] (12.1) 

^ exp[ -p(ji - JS)] (12.2) 

When these approximations are inserted into Eq. (11.3) then 



where the coefficient A RLD = is as encountered in Eq. (5.50) and, importantly, <j> is the Schottky reduced 

work function of Eq. (11.30). When cp -*■ ®- \JAQF in Eq. (12.3), the equation is often referred to as the Schottky 
emission equation (sans a coefficient of (1 - r) that accounts for the small reduction due to quantum mechanical reflection, 
and will be encountered again in Section 19.2.3). 


Bertrand Russell, A History of Western Philosophy, and Its Connection With Political and Social Circumstances From the Earliest Times to the Present Day. 
New York: Simon and Schuster, 1945, p. 836. 

2 Calling such fields "weak" is most certainly a relative statement: compared to typical macroscopic fields, they are rather strong, for example the 
fields of 0.1 MV/m characteristic of gradients leading to lightening strikes are similar to a dispenser cathode grid at 100 V and a cathode-anode 
gap of 250 pm for a field of 0.4 MV/m [191], 
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Figure 12.1 Dispenser cathode. A small thermionic dispenser cathode compared to a US dime (diameter 17.9 mm). The white arrow points to the 
emission surface. Image courtesy of Bernard Vancil (e-Beam). 


12.2 Analysis of thermal emission data 

Using Eq. (12.3) to analyze the data of Shelton (Figure 6 of ref. [189] and Figure 12.2, a resource worthwhile for its expla¬ 
nation of the complexities in thermal emission measurements) is undertaken with the data given in Table 12.1. Retarding 
potential (RET) and saturation current (SAT) measurements are two methods of measuring the current density from a 
hot source. They are different in that the RET measurements assess the temperature-independent thermal constants, but 
the SAT experiments give an A coefficient that must be reduced to account for changes in the work function due to a 
temperature variation given by (see Eq. (28.1)) 

®(T) = ®„ + aT (12.4) 

Both data sets are fitted by y(x) = A exp(-Bx), but because the RET measurements give the actual thermal constants, 
A rec = A rld . A fit to the logarithmic data gives B rec = 53.0 for which <t> 0 = B/1000k B = 4.57 eV. For the SAT measurements, 
A sat = 974.37 A/cm 2 K 2 and B sat = 55.14. Because of the temperature dependence of® in Eq. (12.4), A sat = A RLD exp(a/k B ), 


10-9 
10 _1 ° 

* 

N E 10- 11 

I 

^ 10- 12 
10 -13 
10- 14 

0.52 0.56 0.6 0.64 0.68 

1000/T[K] 

Figure 12.2 Thermal emission data taken from two [211] tantalum crystals using a saturation current (SAT) method and a retarding-potential (RET) 
method (based on Figure 6, ref. [189], data from Table 12.1). The fits to y(x) = A exp(-Bx) give A = A^ for RET; A and B were determined by fitting for 
SAT. The difference B sat - B m exhibits the linear temperature dependence of 0(T) = 4> 0 + aT. 









Richardson-Laue-Dushman equation 


Table 12.1 Thermal emission data of Shelton: the retarding potential and saturation 
current methods of measuring thermal current density as a function of temperature. The 
columns labeled RET (retarded potential) and SAT (saturation current') correspond to J/T 2 
and are therefore in units of A/cm 2 K 2 . Data digitally extracted from Figure 6 of ref. [189]. 


looo/r (k) 


RET 


SAT 


0.66531 

0.64416 

0.62314 

0.60566 

0.58625 

0.56664 

0.54546 

0.52692 


6.0567 xIO- 14 
1.7273x10‘ 13 
5.3231 xIO" 13 
1.4199X10" 12 
3.8574 xIO" 12 
1.1364x10-” 
3.3281 XIO’ 11 


1.1969X10- 13 
3.5677 xIO - ’ 3 
1.1295X10-' 2 
3.0421 xIO -12 
9.0186X10- 12 
2.6164x10-" 
8.5922x10-" 
2.3619x10-’° 


so that 


a Ta = k B In 


'±l) 

\ a ridJ 


= 2.061fc B = 0.1776 eV/K 


for tantalum, the material under investigation. 


(12.5) 


www.ebook3000.com 






CHAPTER 13 


Fowler-Nordheim equation 


The main features of the phenomenon of the extraction of electrons from cold metals by intense electric fields are well known, and an 
approximate theory of the effect was first developed by Schottky. More recently the experimental data have been much improved, notably by 
Millikan and Eyring, and Millikan and Lauritsen. The theory has been considered afresh by O.W. Richardson and by Houston working 
with Sommerfeld. It seems to us, however, that there is still room for improvement in the theoretical exposition and its correlation with 
the experiments. Neither O.W. Richardson nor Houston really treat the theory in the simple straightforward way which is now possible 
in the new mechanics, using the revived electron theory of metals which we owe to Sommerfeld. Again, while Millikan and Lauritsen 
seem to have established quite definitely the laws of dependence of the emission on the field strength F, they speak of the implications of 
their result in a way which is hard to justify and might in certain circumstances prove to be definitely misleading. 

- Ralph H. Fowler and Lothar W. Nordheim [36] 1 

The Fowler-Nordheim (FN) equation predicts current density due to field emission, generally from metals that, due to 
closely spaced extraction grids or anodes, experience very high surface fields over small emission areas (e.g., Spindt-type 
field emitter arrays in Figure 13.1). Typical operational parameters are T = 300 IC, F = 8 eV/nm, and ® = 4.5 eV, as 
characteristic of tungsten wires [182] or conical molybdenum field emitters [175]. Standard accounts are given by Dyke 
and Dolan [192] and Murphy and Good [110], with an updated reformulation by Forbes and Deane [193], 

For the transmission probability, typical field emission gradients are quite strong, so strong in fact that those electrons 
with a longitudinal energy close to the Fermi level are most likely to tunnel, even though the supply function is small 
there. The supply function itself is well approximated by its zero temperature limit. Therefore, 

D FJV (E)«P(£)exp(-|-|o<-E)) (13.1) 

f PN (E) * - E)&(p ~ B) (13.2) 

where the argument of the exponential in D(E) is approximated by the linear expansion in energy E about the Fermi 
level p, the supply function has made use of the approximation ln(l + x) « ln(x), and a pre-factor P(E) has been attached 
to DfE) to accommodate the consequences of the coefficient of ip(x) inEq. (11.22), its exact form will be determined using 
the Airy function approach in Eq. (18.59). Because the transmission probability DfE) increases exponentially with energy 
but the supply function only linearly below the Fermi energy, their product will be peaked near E ~ p: other functions 
(such as P(E)) vary slowly with energy, and can therefore be evaluated at E = p and withdrawn from the integral. Let 
(C/F)(p - E) = x, then 

(13 ' 3 > 

For large Cp, the upper limit of the integration may be taken to +co and so the integral is equal to unity (the form when 
Cp is not large, as for semiconductors, will be considered in Section 24.3.2). As a result the non-F-dependent terms can 
be grouped, and J PN written as 

J PN (F)*AF 2 ex p(-|) (13.4) 

with A defined by comparison with Eq. (13.3). An immediate consequence of the largeness or smallness of Cp is thereby 
revealed: in the case of metals, Cp » 1, and the resulting J fn (F ) is independent of the chemical potential p. 

Finding the forms of A, B, and C takes additional work, and depends on the nature of the Gamow factor for the deter¬ 
mination of B and C via 6(E) « -( B/F) - (C/F)(p - E). The influential derivation of Murphy and Good [110] upgraded 
an earlier treatment by Nordheim [194] in which elliptical integral functions were recast as v(y) and t(y) to account for 
the barrier being given by an image charge potential. The present approach is intended to be more intuitive, and will 
be undertaken in three steps: (i) find A,B, and C in Eq. (13.3) using the simple triangular barrier approximation that 


lr The quote is the abstract in whole of Fowler and Nordheim's seminal 1928 manuscript. 
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Figure 13.1 Close-up of a single Spindt-type gated field emitter (in an array) that is 3.5 pm in height; gate is 3 pm in diameter with a 1.6 pm silicon 
nitride shield. White arrow points to the emitter apex, where field emission occurs. Pulsed emission has yielded > 3 mA from individual tips. Image 
courtesy of Christopher E. Holland, Capp A. Spindt (SRI). 


constituted the original FN equation, (it) invoke scaling arguments based on how the integrand in the Gamow factor 
behaves in changing its appearance from triangular-barrier-like for small fields to quadratic-barrier-like for large fields, 
and (Hi) consider the image charge potential directly and show that the behavior anticipated from the scaling arguments 
is what in fact occurs. 


13.1 Triangular barrier approximation 


Whatw 


'e doing here, that is the question. And we are blessed in this, that we happen to know the answer. 


- Samuel Beckett 2 


The evaluation of 0(E) using the triangular barrier V tri (x) = p + O-Fx in 6(E) is known from Eq. (11.17) and is 
given by 

4t/2m ,,, 

- v <•„ m3/2 ( 13 .5) 


where the second relation follows from the definitions k(E) = y/2mA(E)/h and L(E) = A (E)/F with A (E) = p + ® - E as 
in Eq. (11.26). Taylor expanding 6(E) to first order in (p. - E) is sufficient to evaluate B and C. Doing so gives 


B = 0(h)E= ^tV 2 m® 3 

(13.6) 

C = 0'(n)F= %^2m$> 
h 

(13.7) 


where the prime indicates derivative with respect to argument d'(E) = d E 6(E), and d'(p) is 6'(E) evaluated at E = p. Both 
B and C as defined do not depend on field (but only work function ®) and so are constant. 3 Comparing Eq. (13.3) to 
Eq. (13.4) reveals that 


A = 


(13.8) 


2 Samuel Beckett, Waiting for Godot; Tragicomedy in 2 Acts. New York: Grove Press, 1954, Act II, p. 52. 

3 Including ® inside B and C is temporary: by Eq. (13.25) the <D dependence will be pulled out and shown explicitly. 
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Nothing in the analysis so far reveals the nature of P(p), but in Section 18.2.4 it will be evaluated and shown to be 
given by 

/t + ® 

and thus is on the order of unity (e.g., 1.9522, if p = 7 eV and <t> = 4.5 eV). All together, the oft-cited 4 FN equation 
introduced in ref. [36] is 


P(n) = - 


(13.9) 


J m (F) = 


I6n 2 <bh y \i + ® J \ 3 hF ) 


(13.10) 


The triangular barrier model contains enough to anticipate through scaling arguments what the effects of the image 
charge potential should be. From Eq. (13.5) for 0(E), tc(E) is height of the integrand and L(E) is the width at the energy 
E. The effect of taking the derivative of 0(E), or dO/dE, is to bring k(E) to the denominator of the integrand without changing 
the length of the integration region. Therefore, 9(p) will scale as K(p)L(p) but 0'(p) will scale as L(p)/x(p). What effect will 
that have? 

For low fields, the image charge barrier remains more or less triangular in appearance, with the height of the barrier 
above p being ® -»■ ® - \JAQF = </>. Introduce y = \/4QF/® 2 , so that ® -*■ (1 - y)®. If v(y) is introduced such that B -> Bv(y) 
when ® -*■ (1 -y)®, then the question is how does v(y) vary with y based on how 9(p) scales? As k(p) oc ® 1/2 , then 
k -* *r-y/1 -y for ® ->• (1 -y)®. Similarly, L(p) oc i/® 2 - 4QF = ®\/1 -y 2 , and so v(y) is a decreasing function of y that 
behaves more or less as 

v(y) ~ (1 — y)(l — y 2 ) 1 / 2 


the performance of which, if compared to its accurate numerical evaluation via Eq. (13.16) below, would be seen to be in 
the right direction but not particularly praiseworthy: the fault lies in the neglect of how the shape factor coefficient scales. 
Nevertheless, scaling arguments suggest that v(y) is a decreasing function of y and therefore F, showing that the image 
charge current density improves over the simple triangular barrier form of Eq. (13.10). Such a conclusion is intuitively 
obvious due to the barrier being both lower and thinner when the image charge is included. In a similar manner, for 
C -*■ Ct(y) as ® -*■ (1 -y)® means 0'(p) scales as L(p)/k(p) or 


(1 - y 2 )!/2 


(l-y)l/2 


= V 1 +T 


which, like the expression above for v(y), is indicative of the behavior of t(y) (increases as field increases) without being 
a particularly good approximate form of it if compared to Eq. (13.19) below. 


13.2 Image charge approximation 

. ..if thou gaze long into an abyss, the abyss will also gaze into thee. 

- Friedrich Nietzsche 5 

The Murphy and Good (MG) equation for J MG (F) must take on the accurate evaluation of the elliptical integral func¬ 
tions 6 v(y) and tty), so-called here because they can be expressed in terms of elliptical integrals such as Eq. (30.104). In 
an earlier era, tabulating functions was the modus operandi of dealing with them, and so Abramowitz and Stegun [103] 
was never far away. More recently, though, very good approximations to v(y) and t(y) have become available [193], 
accurate to within 0.3% over the range 0 <y < 1. Even so, for those of a more computational disposition, evaluating 
the one-dimensional integral for R(w) below used to construct v(y) and tty) is no longer taxing and can be performed 
accurately and rapidly. 

The image charge represents the metal looking back (as it were) at a charge outside its surface. The degree to which it 
looks back is governed by the elliptical integral functions v(y) and t(y), so-called here because they can be expressed in 


4 Be forewarned: at times in the literature the Murphy and Good formulation of J m (F) is given and ref. [36] incorrectly cited. The equation of 
Fowler and Nordheim in their 1928 article is Eq. (13.10) and is based on a triangular barrier without a Schottky factor. 

5 Friedrich Wilhelm Nietzsche, Beyond Good and Evil (trans. Helen Zimmern), Kindle edition, 2011, Chapter IV: Apophthegms and Interludes, 
Aphorism 146. 

Alternately, R. Forbes has advocated calling them Schottky-Nordheim barrier functions. 
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terms of elliptical integrals such as Eq. (30.104). Remember that v(y) modifies 0(p) and t(y) modifies 0'(p), that is, if 0(E) 
is approximated by a linear function in energy about E = p, then the relation 

6(E )» ^ V2 m& v(y) + A t(y)(p - E) (13.11) 

is the defining equation for v(y) and t(y). The form of Eq. (13.11) will reappear slightly rearranged in Eq. (17.10). Whether 
one heroically tackles numerically evaluating the elliptical integral functions or sheepishly uses an approximation, the 
MG approximation to the FN equation takes the form 

J « (f) = 1 <13J2 » 

One notices right away that P(p) has disappeared: that is not so bad as it would otherwise introduce yet another parameter 
(the Fermi energy p) in a coefficient P(p) that is of order unity in an equation that already faces many challenges when 
applied to experimental data. The need to retain a level of accuracy that is hidden in the uncertainty of other parameters 
themselves or experimental error in general 7 to predict field emission from metals reflects more on the compulsion of 
theorists than the needs of experimentalists. Therefore, for decades Eq. (13.12) was treated as "the" FN equation even 
though many approximations to the v(y) and t(y) functions (and a host of other factors [186]) customize the formula and 
therefore its predictions in numerous small (and occasionally large) ways. 

Still, uncertainty is no excuse to evaluate the elliptical integral functions poorly (or ignore them) when ways 
to obtain good values are available. In an earlier era, tabulating functions was the modus operandi of dealing with 
them, and so Abramowitz and Stegun [103] was generally within reach for the theorists. The functions v(y) and 
t(y ) have been tabulated [195], series expanded [196, 197], interpolated for spreadsheets [198], and put into a 
recursion relation [193]. For those of a more computational disposition, though, evaluating v(y) and t(y) numerically, 
accurately, and rapidly is possible. Therefore, use may be made of Eq. (11.26) and its companions, particularly L(p) 
and x_(p). Replace the integration variable x -»= x_(p) + L(p) sin 2 (w) in the Gamow factor integral of Eq. (11.14) 
and evaluate it at E = p. All the factors with the exception of w = x_/L may be pulled out of the integrand, leaving 
[197, 199] 


0(M) = |v 2mFL(pyR(w ) 

(13.13) 

f" 2( 

Jo y/w + sin 2 u 

(13.14) 

x (u) 1/1 \ 

L(p) 2 \^ETy2 ) 

(13.15) 


in terms of which the sought-for v(y) function becomes 

v(y) = 3(1 -ff^Riw) (13.16) 

The asymptotic limits of R(w) can easily be found. 


EXAMPLE: Find R(w) in the limit of small field. 

SOLUTION: Small field corresponds to small y and therefore to w —> 0. The integral defining R(w) becomes 
R(0) J cos 2 usinudu = J x 2 dx = ^ 

As F is coupled to Q, small field also corresponds to the same limit as Q -*■ 0, meaning the barrier appears increasingly 
triangular. For the triangular barrier, L(p) = O/F, and Eq. (13.6) is recovered. 


7 This says nothing about the level of accuracy of the FN equation itself, which is already an approximation because of how the transmission 
probability is treated (Chapter 17 and Section 18.2.4). 
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EXAMPLE: Find R(w) in the limit of large field. 

SOLUTION: Large field corresponds to y -* 1 and therefore to w -* oo. The integral defining R(w) becomes 


R(w » 1) 


i r ' 2 2 • 2 

—— / cos u sin u 

yw Jo 


16-y^it; 


(13.17) 


For large F, the barrier for E > n appears parabolic. 


In a premonition of things to come, Eq. (13.11) is not the only starting point. The strength of the field just as the top 
of the image charge barrier is dragged down to the Fermi level is given by F max = P 2 /4 Q, obtained by setting (f> = 0 in 
Eq. (11.30). In that limit, L -► 0 so that x_(n)/L(pi) = (0/2 FL(pi))- (1/2) -»■ oo. Consequently, 


„ n VmP 3 / _ 4QF \ 
S 2 tiF \ O 2 / 


(13.18) 


as could be inferred from Eq. (11.20) if one is careful about what the relationship is between the quadratic barrier and 
the image charge parameters when F approaches F max from below. 8 Eq. (13.18), however, is a high field limit, and the 
evaluation of the field emission current comes from the low field side. 

Returning to the elliptical integral functions, the ease with which v{y) may be numerically calculated using R(w) means 
the more painful analysis using elliptical integrals and tabulated values has been dodged, and so attention turns to f(y). 
It can be shown that f(y) is defined by 


where R\w) = dR/dw or 


f(y) = 


2y d \ 3(2w + l)R(w) - 4w(w + l)R'(w) 

3 dy) V{y) ~ ^ 2^1 


i r /2 

R\w) = -± J 


cos 2 u sin 2 w 
(w + sin 2 M) 3 / 2 


du 


(13.19) 


(13.20) 


EXAMPLE: Find c(l) and f(l) exactly. 

SOLUTION: Begin with v(y ) as defined by Eq. (13.16) and use w(y) as defined by Eq. (13.15). Then in the limit 
w(y -*■ 1) -> oo, Eq. (13.17) is useful, giving 


' (2w+ l) 3 / 2 


1> k \[2 _ 
* 64 w 2 ~ 


(13.21) 


The evaluation of t{y -*■ 1) is more difficult, but the same analysis giving Eq. (13.17) when applied to Eq (13.20) 
shows that R'(w) *♦ -n/32w i/2 for large w, and so when used in Eq. (13.19) 


t(y 


x(8w + 5) 
16[ic(2n; + l)] 1 / 2 


*V2_ 


(13.22) 


Such formulae, however, can still be off-putting. As a result, the "quadratic" approximations of Spindt etal. [175] 

t Wt(y)« v o -y 2 
tspind,(y) W t 0 

where v 0 and t 0 are given by v 0 = 0.95 and f 2 = 1.1 are frequently employed for their reasonable accuracy and huge 
simplicity. 


s For the image charge potential, 0O) differs from 9 qua (d) by a factor of yj 2/(1 +y ), which approaches 1 as y -> 1, a pleasing little derivation for 
those who enjoy such things. 
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And so matters sat for years until Forbes found a stunningly simple approximation of fairly good accuracy (better than 
0.33% over the whole range of y) by numerical experimentation using MAPLE. In what began as a happy accident [200] 
but which was put on a better mathematical footing [193, 201] Forbes and Deane found 

v(y)= l-y(3-ln(y)) (13.23) 

f(y)= i + ^d-inCy)) (13.24) 

for which it is easy to see v{y) varies from 1 to 0 and t{y) from 1 to 1.1111 as in Figure 13.2 (the latter behavior justifying 
the widespread approximation of t(y) by a constant, but observe that the exact value of f(l) is not 10/9 but rather n\f2/A 
from Eq. (13.22)). What is now clearly seen is that each contains a logarithmic term, the presence of which frustrates 
polynomial and other kinds of expansions near y = 0 and y= 1 (the logarithmic term was anticipated in ref. [197], but 
the form there lacked the elegant simplicity and therefore computational utility of the Forbes-Deane version). 

The form of Eq. (13.23) can be used to "correct" the Spindt approximation by demanding that v(y) be linear in y 2 with 

the coefficient ofy 2 set to unity, which gives y 0 = l/\/i = 0.606531, v 0 = 1 - (l/6e) = 0.9387 and t(y 0 ) = t 0 = 1 + (l/6e) = 

1.06131 [85, 202], What of it? Although the usage of t 0 makes sense (similar to the reasons for abandoning P(p)), use of 
the quadratic form of v(y) « v 0 - y 2 is ill-advised as in Figure 13.3, particularly when fields are high such that Eq. (13.23) 
is correct but the quadratic form is not, and this will matter when emission near the barrier maximum is considered such 



Figure 13.2 The Nordheim Junctions. A comparison of the numerically calculated v(y) and t(y) functions with the approximations of Eq. (13.23): the 
differences are generally less than 0.33%, as shown in Figure 13.3. 



Figure 13.3 Nordheim function approximation error. The error, defined as 100%(/-/,)//, where/, is the approximation to the exact value/, of the 
functions v(y) and t(y). Quad refers to v^Jy) = 0.9387 -y 2 and t qu Jy) = 0.9387 +y 2 /3. FD refers to Eqs (13.23) and (13.24). 
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that Eq. (13.18) becomes a better starting point in the thermal-field equations of Chapter 17. If one craves simplicity at 
the cost of a small bit of accuracy (and for modeling electron emission in the beam codes of Chapter 33, one does), then 
the recommended approximations are to use Eq. (13.23) and let t(y) « t 0 if one must, and therefore render J MG (F ) as [188] 




B n = -4 \[2m = 6.8309 - l - - 

^ [nmr\/evj 

_ 2B 0 Q _ 8Q / 2m \ 1 / 2 _ / 2.68754 eV \ 

V_ 3V®” 9 ^°^ ' ° ’ 


(13.25) 

(13.26) 

(13.27) 

(13.28) 


where A 0 , B 0 and v are in Table 2.3. The point of introducing A 0 and B 0 is because they are collections of constants both 
fundamental and mathematical ("universal constants" in the terminology of Forbes [198]), so that giving these constants 
special status is desirable. 

The form of Eq. (13.25) has illuminating consequences. Observe that with the Forbes-Deane approximation, the form of 
the MG Eq. (13.12) and the original Fowler-Nordheim Eq. (13.10) become hauntingly close, albeit that P{g) is neglected 
and that A 0 and B 0 are different than A and B by pulling out the work function <t> explicitly. The close family similarity 
between Eq. (13.25) and Eq. (13.10) explains (but does not justify) why the triangular barrier formula is occasionally 
(and wrongly) used for metals: the slope of the current density in so-called Fowler-Nordheim coordinates In (J fn /F 2 ) vs 
1 /F seems to be the same for both, and often it is the slope that is of interest. This is because the Schottky barrier lowering 
factor manifests itself in the new coefficient of (® 2 e 6 /4Q) v and a factor of F _v in the coefficient, both of which do not 
affect the slope (much) even though they do affect the magnitude of J PN by quite a bit, so that including it is a must. Still, 
the neglect of P(g) by setting it equal to unity is desirable to avoid the proliferation of the quantity of parameters needed 
to evaluate J PN (for now, that is: when field emission from semiconductors is considered, the behavior of P(E) will come 
back into consideration, and it will not be ignored so blithely, see Section 24.3.3.). 

Eq. (13.25) is understood to be the MG form utilizing the Forbes-Deane approximation for the evaluation of the 
Fowler-Nordheim current density. Because a subscript acronym of FDMGFN is unwieldy (although it would reflect 
genealogy) it has been contracted back to FN. When J PN is mentioned hereafter, Eq. (13.25) is what is in mind. 


13.3 Analysis of field emission data 

Priestley: I thought details mattered to you, sir. 

Lavoisier: Only if they are relevant. 

-Carl Djerassi and Roald Hoffmann 9 

Turning attention now to the ability of Eq. (13.25) to account for features evident in experimental data, consider first 
the data of Millikan andEyring [49], invoked by both Oppenheimer [109] and Fowler and Nordheim [36] to give credence 
to their equations of tunneling and emission. The data given in Table 13.1 and shown in Figure 13.4 were designed to 
show the small but nevertheless non-zero impact of temperature on field emission: it was the insensitivity of field emission 
current to temperature, recall, that led to the initial speculation that thermal emission electrons ("thermions") were a 
different breed than field emission electrons. Millikan and Eyring's work demonstrated that conduction band electrons 
were the carriers of emission current, and that conclusion led them to speculate on how the thermal and field emission 
equations could be unified, a subject returned to in Section 13.4. 

The data in Figure 13.4 shows an exponential behavior over the range of anode potentials measured, a common feature 
of field emission. Considering the 300 K line (both increasing and decreasing data) and performing a best fit of the data 
using the forms of field emission hypothesized by Oppenheimer (AF 1/4 exp (-B/F)) to that hypothesized by Fowler and 
Nordheim (AF 2 exp(-B/F)) (recall that, by Eq. (13.25), the parameterization by these forms is reasonable) it is seen the 
data are a bit too uncertain to conclusively choose between the two forms (Cautionary Moral 1). It is common to represent 


J Carl Djerassi and Roald Hoffmann, Oxygen: A Play in Two Acts. Weinheim, New York: Wiley-VCH, 2001, Scene 9, p. 89. 
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Table 13.1 Field emission data of Millikan and Eyring. The values are equivalent to 
those of Table VII in ref. [49], The data refer to current vs anode potential for two 
temperatures, 300 K and 1100 K (the third column of Table VII is not reproduced 
given its closeness to the 300 K values shown). The data are plotted in Figure 13.4. 


VJM) 

1(300 K) (pA) 

1(1100 K) (pA) 

3050 

0.12 

0.14 

3500 

0.60 

0.75 

4000 

4.50 

5.40 

4400 

21.00 

24.00 

4750 

54.00 

59.00 

5000 

100.00 

110.00 

4800 

63.00 

69.00 

4200 

12.00 

15.00 

3750 

2.10 

2.70 

3200 

0.24 

0.36 

3000 

0.09 

0.15 



I(V) data on a so-called Fowler-Nordheim plot wherein \n{I/V 2 ) vs 1 /V is considered, based on the presumed relationship 
between current I and current density J via I = AJ, where A is emission area, and likewise the relationship between anode 
potential V and field F via F = P g V, the subscript emphasizing that the P g field enhancement factor, when relating voltage 
to field, has dimensions of inverse length, as is common for presenting experimental data (e.g., Barbour etal. [182]). Such 
discussions are premature before considering what the area factor A could be (see Chapter 30), but a best-fit analysis, 
shown in Figure 13.5, shows that the hypotheses of Oppenheimer, and separately Fowler and Nordheim, are borne out: 
the agreement with each of their hypothetical equations is reasonable given the accuracy of the data. Lastly, the data is 
recast on a Fowler-Nordheim plot, or representation, in Figure 13.6. 


13.4 The Millikan-Lauritsen hypothesis 

It was all very well to say “Drink Me," but the wise little Alice was not going to do that in a hurry. “No, I'll look first," she said, “and 
see whether it's marked ‘poison ' or not"; for she had read several nice little histories about children who had got burnt, and eaten up by 
wild beasts and other unpleasant things, all because they would not remember the simple rules their friends had taught them ... and she 
had never forgotten that, if you drink much from a bottle marked 'poison, ’ it is almost certain to disagree with you... 

- Lewis Carroll 10 


10 Lewis Carroll, Alice's Adventures in Wonderland. Racine, WI: Whitman Publishing Company, 1945. 
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Figure 13.5 Millikan-Eyring data and theory ([49], Table VII). The 300 K data of Figure 13.4 compared to the predictions of Oppenheimer [109] and 
Fowler and Nordheim [36]: observe the uncertainty in the data. 



10 4 /V 

Figure 13.6 Millikan-Eyring data on a Fowler-Nordheim plot. The 300 K data of Figure 13.4 represented on a typical Fowler-Nordheim plot. No 
distinction is made between the increasing vs decreasing directions of the data, as in Table 13.1. 


The only rational way of educating is to be an example - if one can't help it, a warning example. 


- Albert Einstein 11 


Cautionary tales, in which incautious children meet grisly fates for heedlessly failing to follow sage precepts from those 
with experience greater than their own, are efficacious ways to warn of perils. Any tale centered on the origins of a clever 
hypothesis, and conclusions drawn from it, that proves to be not quite right deserves attention for the object lessons it 
provides. 

Millikan, who measured the electron charge using his iconic oil drop experiment (a feat for which he received the 
Nobel Prize in 1923; he also measured the value of Planck's constant, but that is another story), was an accomplished 
and discriminating experimentalist, whose careful research on the difference between current emitted from heating 
metals compared to that pulled from the metals by high field were important to both Oppenheimer and Fowler and 
Nordheim in the development of their theories of field emission. 

Millikan and Lauritsen attempted to distinguish between "thermionic currents" and "field currents", and their careful 
measurements showed that temperature effects on the field currents did not manifest themselves until astonishingly high 
temperatures, from which came "the conclusion that the 'field-electrons' at ordinary or low temperatures have nothing 


11 AIbert Einstein, Ideas and opinions. Based on Mein Weltbild, Essay, "Education and Educators". New York: Crown Publishers, 1954, p. 57. (A 
letter to a young girl, published in Mein Weltbild. Amsterdam: Querido Verlag, 1934). 
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to do with thermions, i.e., that they must have a static, not a dynamic, interpretation" [50], In other words, the emission 
of thermions was thought to be due to a process by which they shared in the "thermal agitation" of the atoms, in contrast 
to the notion that" ...field currents are due to conduction electrons pulled from minute peaks on the surface" [49], as 
argued in their earlier work. 12 Though the origins of the current were by different mechanisms, one associated with the 
behavior of the transmission probability (field) in contrast to the other associated with the thermal tail of the distribution 
(thermal), current was nevertheless carried by the same particle: without the conceptions introduced by Sommerfeld and 
seized upon by Fowler and Nordheim, speaking of the processes coherently was difficult. 

It had already been established that empirical data for the total current from thermal emission from an area A at a 
temperature T produced a linear relation when In (I/V 2 ) was plotted versus l/T, or 



(13.29) 


with A and B related to the slope and intercept of the aforementioned linear relation. Similarly, when Millikan and 
Lauritsen examined In (I/V 2 ) in their field emission studies, they found that it, too, satisfied a linear relation with 
1/y, or 



(13.30) 


The astonishing similarity between these two equations, in which the T of the former has the same role as the F of the 
later, 13 led Millikan and Lauritsen to hypothesize that the Richardson-Laue-Dushman (RLD) equation was the low field 
limit of a "general" formula given by 



(13.31) 


(their Eq. (5)) from which they concluded [50] ".. .the application of an external field is equivalent to increasing the temper¬ 
ature of the electrons within the metal" (their emphasis). Seems reasonable. 

However, as Hamlet's mother was rightly admonished, 14 "seems" and "is" are different: appearances can suggest rela¬ 
tions which do not survive in the light of a coherent theory. As attractive and perhaps intuitive as Eq. (13.31) seems 
to be, it is errant, as are the conclusions based on it. The arguments of Fowler and Nordheim shortly afterwards (their 
1928 abstract of ref. [36] starts Chapter 13), and the energy distribution measurements of Gadzuk and Plummmer [96] 
several decades later would drive home how different the processes were; even later, a general thermal field equation, 
developed in Chapter 17, would show the actual equation even looks different, in that J{F, T ) is the sum of a thermal 
part and a field part, not one expression with differing limits. Herein lies several cautionary tales: that similarity is not 
equivalence, that equations with fitting parameters can model data just fine and still motivate misguided conclusions, 
that an equation which fits the data is not the same as a theoretically justified equation which fits the data (although 
a theoretical equation that doesn't fit the data is worse). But it is also true that Hamlet's hesitation itself is cautionary in 
that understanding need not be required for action and that delay can be fatal to ambition. 

The behavior of the transmission probability D(E) and the supply function /(E) give an explanation as to why 
Eq. (13.31) cannot be the general equation of thermal and field emission. In thermal emission, D(E) is a step function 
which acts to mask all but the thermal tail of /(E) so that the product of the two (the current density integrand) is 
peaked about the barrier maximum p + ®; in field emission, D(E) exponentially increases but slower than the thermal 
tail decreases, so that the maximum of the current density integrand is peaked about the Fermi level p (Figure 8 of ref. 
[110]; Figure 3 of ref. [155]) and shown by measurements [96]. Field effects appear in thermal emission via the Schottky 
barrier lowering factor. Thermal effects do not appear in field emission because the dominant part of the current density 
integrand is where the supply function /(E) in Eq. (11.5) behaves as /(E ~ p) = ( m/nh 2 )(p - E) until conditions are such 
that the states in the thermal tail (E > p) of the supply function begin to contribute appreciably. Such a way of looking 
at emission came after Millikan's hunch, though, so he is entitled to some leeway by posterity. 

The cautionary tale about empirical formulae, however, stays. 


12 The passage goes on to say " ...the fatigue effects of both current treatment and heat treatment being due to the rounding off of these peaks 
by positive ion bombardment or by temperature..." [49]. 

13 This assertion presumes field F and anode potential V are linearly related, as well as I and J: that assumption will come under great scrutiny 
in Chapter 30. 

14 "Seems, madam? Nay, it is, I know not 'seems.'", ref. [37]: Hamlet I.ii.77, p. 936. 
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Fowler-Dubridge equation 


The precise hypothesis which succeeds so well in correlating the observed effect near the threshold is that the photoelectric sensitivity or 
number of electrons emitted per quantum of light absorbed is to a first approximation proportional to the number of electrons per unit 
volume of the metal whose kinetic energy normal to the surface augmented by hv is sufficient to overcome the potential step at the surface. 


- Ralph H. Fowler [203] 


The Fowler-Dubridge (FD) 1 equation predicts quantum efficiency generally for metals due to optical and UV light 
incident on the surface. Typical operational parameters are T = 300 K, F= 50 eV/pm, and 0 = 4.5 eV for metals 
(work function) or /= 1.6 eV for cesiated semiconductors, as characteristic of cesium antimonide or multialkali 
antimonide surfaces (electron affinity). A standard account is given by DuBridge [204] and Bechtel [205]; the latter 
gives a generalization to multiphoton effects. 

14.1 Approximations 

Heisenberg: It starts with Einstein. 

Bohr: It starts with Einstein ...the universe only exists as a series of approximations. Only within the limits determined by our relation¬ 
ship with it. Only through the understanding lodged inside the human head. 


- Michael Frayn 2 


Beginning with Einstein's assertion that a single electron absorbed the whole of the incident photon's energy 
(Section 3.3), Fowler anticipated that the consequences could be accounted for not by altering f(E x ), but rather by 
modifing D(E X ). Appending the entire photon energy hco to E x may not seem proper: after absorbing a photon, the 
electron should be heading off in any direction, only some of which are towards the surface, but Fowler pointed out 
that appending the photon energy to the transmission probability term via D(E X ) -» D(E X + hco ) worked surprisingly 
well (moving beyond that approximation is considered in Section 31.4). Therefore, in Eq. (11.3) make the following 
substitutions 


D fd (E) « ©[£ + hco - {p + </)] 
f FD (E) = ln{ 1 + exp \fiOi - E)\} 


(14.1) 


(14.2) 


where the x subscript on E x is hidden. That the supply function/(E) is not being approximated by any of its limits (as done 
for both thermal and field emission previously) might cause apprehension, as it should: the integration is going to have 
contributions from the Maxwell-Boltzmann tail characteristic of thermal emission, but also from the linear behavior in 
E of field emission, and both can contribute depending on the size of E + hco compared to p + ®. 

The ratio of the number of electrons emitted A Q/q with the number of photons absorbed AE/hcv defines the quantum 
efficiency QE, or 



(14.3) 


where AQ is the total emitted charge and A E is the total absorbed energy. Assume next that the duration over which 
the metal surface is illuminated is long compared to time scales associated with photon absorption or electron transport 
within the metal to the surface. Consequently, electrons are assumed to be emitted as soon as the surface is illuminated, 
and cease to be emitted as soon as the illumination is extinguished. Further, assume that the light intensity is not so 

1 Observe that in the present section the subscript FD refers to Fowler-Dubridge, not Fermi-Dirac. 

2 Michael Frayn, Copenhagen. New York: Anchor Books, 2000, Act 2, p. 73. 
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Figure 14.1 Representation of a photocathode in an rf injector. Image courtesy of C. Hernandez-Garcia, Thomas Jefferson National Accelerator Facility 
(JLAB). Reproduced from C. Hernandez-Garcia, P.G. O'Shea, M.L. Stutzman, Physics Today 61, 44 (2008) with the permission of the American Institute 
of Physics. 


severe that the metal appreciably heats up (as would happen with powerful lasers characteristic of photo-injectors as in 
Figure 14.1), and therefore no temperature-dependent effects occur (e.g., thermal emission from a heated metal surface). 
If At is the duration of the illumination and J x is the illumination intensity, then AQ/AE at (J(hoj)Af)/(I x At) = J(hco)/I x . 
In fact, if the electron emission tracks the time dependence of the illumination, then 



(14.4) 


A number of factors affect Eq. (14.4). A factor of (1 —R(w)) describes the fraction of photons absorbed, where R(co) is 
the reflectivity of the surface. Not all accounts of quantum efficiency include this factor, with some treatments defining 
it as the ratio of the emitted current density to absorbed (not incident) light intensity, so cave, lector . 3 Next, not all of 
the electrons that are excited make their way to the surface unhindered: they can scatter with other electrons and/or 
vibrational modes of atoms (aka phonons) or are lost by other means such as recombination so that only a fraction F x 
survive. At the surface, the remaining electrons encounter a surface barrier, for which only a faction P(hco) of the electrons 
with an energy above the barrier experience transmission into vacuum, and so 



(14.5) 


describes the current density. If QE is defined as the ratio of the number of electrons out ( nr 2 JAt/q) to the number of 
incident photons (nr 2 I x At / ha>) for a given area nr 2 and pulse length At, then 


QE S = [l - «»)] WM 


(14.6) 


EXAMPLE: Expressing J in units of A/cm 2 , intensity I x in units of W/cm 2 and using wavelength X in nm, find the 
constant of proportionality relating QE to J/ XI A . 

SOLUTION: from QE = (ftw/q)(J/I x ) and co = 2ncjX, it is seen that 



Until one knows how to account for the impact of scattering processes, F x is mysterious (at least until Eq. (31.30)). 
Therefore, the factor of most concern at present is Pitico), and how it accounts for the number of electrons above the 


3 Latin for "beware, reader." More or less. 
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barrier compared to all the electrons incident on the barrier. In the language of transmission and supply, this is familiar, 
and is 

_ fo WmE)dE _ f 0 D(E) In (1 + e^-V) dE 
a /„” f(E)dE f 0 °° In (1 + ) dE 

but because tunneling is not appreciable, D(E) acts as a step function, and so 

_,,,ln(l + e^-V) dE 

P(hcd) -> ^ * )) , --- (14.8) 

f 0 °°\n(l+eP^)dE 

Let z = Pin - E) and introduce the Fowler functions U(x) defined by [204] 




such that 1/(0) = f (2)/2 is already familiar from Eq. (A2.6), to find 


EXAMPLE: Find U(x) + U(-x). 



SOLUTION: Separating off the negative region by using 1/(0), it follows that the remaining integral ii 

n Eq. (14.9) 

can be combined (where x -x a 

is the integration term in the integral contained in U(-x)) to give 



kw+w-at>= f ln (rr^)* 

(14.11) 



(14.12) 


When the argument of U is negative, then e? < l over the entire integration region in Eq. (14.9), and so the logarithm 
may be Taylor expanded in a procedure familiar from Section Al.2.2. It follows that 

w-*) = | l 14 ' 13 > 

U(X)= J + 7 J~ U( ~ X) <14 ' 14) 

where x is treated in both as a positive quantity. The last form of Eq. (14.13) can be treated as a polylogarithm or Jon- 
quiere's function Li 2 (e~ x ). These equations are equivalent to the findings of DuBridge (see Eqs (2) and (3) of ref. [204]). 
If numerical integration methods are not available, a reasonable approximation is [206] 


1 gX(l _ fa**) x < o 

2 2 ( 14 - 15 > 

_ + 2_ _^ (1 _*-) x^O 

with b = 1 - (jc 2 / 12) = 0.17753 and a = (1 - b - ln(2 ))/b = 0.72843. The performance of the approximation is shown in 
Figure 14.2 for negative argument (the positive argument being a trivial addition). For photoemission, wavelengths such 
that hco> cp are chosen to get appreciable emission, therefore near room temperature conditions, where p « 38.682 eV _I , 
the arguments of Z7(x) in Eq. (14.10) are large, and so 


Pitim )« 


3 (ha - cp) 2 + ( jck B T ) 2 
3p 2 + ( jrk B T ) 2 


(14.16) 


or more succinctly, P(hco) <x (hoo - <p) 2 if the temperature dependence is taken as small. 
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on Figure 1 of DuBridge and 
Eq. (14.16), but normalized so 
6 = 4.3 eV. 


Table 14.1 Photoemission data. Data 
numerically extracted from DuBridge and 
Roehr [34], Figure 1. 


X (nm) 

V'a (au) 

238 

65.3 

240 

58.8 

248 

36.7 

254 

24.9 

265 

14.0 

270 

11.0 

276 

5.01 

280 

3.09 

289 

0.341 
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14.2 Analysis of photoemission data 

...the variation of the photocurrent with frequency and temperature predicted by Fowler's theory are due directly to the characteristics 
of the Fermi distribution of velocities among the electrons, and to nothing else. The success of the theory thus constitutes one of the most 
direct and most convincing tests yet obtained of the applicability of the Fermi-Dirac statistics to electrons in metals. 

- Lee A. DuBridge [204] 

When compared to experimental data, the agreement afforded by Eq. (14.16) is striking. We use the data behind 
Figure 1 of DuBridge and Roehr [34] showing the ratio of the electron current to the incident intensity as a function of 
wavelength for room temperature conditions from molybdenum, reproduced here in Figure 14.3 and in Table 14.1. 
Figure 14.3 is not the quantum efficiency because the wavelength has not been inserted (as in the example after 
Eq. (14.6)), that is, the y axis is J q /I x , but QE oc J q /(kl x ). Several curves based on Eq. (14.16) with different values of 
Schottky-reduced work function $ = ® - y/4QF are overlaid on top of the data, showing that the work function can be 
approximated from this data even though the wavelength dependence of the reflectivity is ignored. 



CHAPTER 15 


Baroody equation 


The calculations are almost entirely classical and are intended to give insight into the mechanism of emission and to emphasize properly 
the influence of electron scattering and absorption ...Although the development gives some insight into a variety of secondary emission 
phenomena, it is most successful in showing how the Sommerfeld model can clarify the role of work function, and account in a qualitative 
way for the velocity distribution of the secondaries. 

- Eugene M. Baroody [35] 

Baroody's equation gives a rough account of the approximate yield (number of secondary electrons emitted versus 
the number of incident primaries) of emitted electrons after a metal or other material, such as the diamond thin film 
shown in Figure 15.1, is struck by high-energy primary electrons. Typical operational parameters are primary electron 
energies E 0 of hundreds of electronvolts to tens of kiloelectronvolts. Many metals have maximum secondary yields of 1-2 
(e.g„ Pt has a yield of 1.8, semiconductors like Si or Sb have yields of 1.1-13, salts like NaCl have yields of 6.8, oxides 
like MgO have yields of 24; some good photoemitters like Cs 3 Sb also have high yields, and pure diamond has a very high 
yield of over 100 [207]). Good accounts of secondary emission are given by van der Ziel [152], Lye and Dekker [208], 
Dekker [141], and Hacehnberg and Brauer [171], although the present account is after ref. [209]. A history of the early 
theory efforts is provided by McKay [170]. 


15.1 Approximations 


...it is Zeus himself, the Olympian, who gives people good fortune, /to each single man, to the good and the bad, just as he wishes; /and 
since he must have given you yours, you must even endure it. 

- Homer 1 


Whereas thermal and field emission were surface effects (the barrier to emission being the largest determinant to the 
current density), photoemission 2 and secondary emission are concerned with a three-step process beginning with the 
fortunes of the excited electrons. For secondaries it unfolds as first, the penetration of highly energetic primaries that 
burrow their way deeply into bulk, leaving secondary electrons in their wake as a consequence, second, the propagation of 
the secondary electrons that do not suffer recombination or scattering loss events back to the surface, and third, the ability 
of a secondary electron to surmount the surface barrier and be emitted. Such a description is imprecise for secondary 
emission because three kinds of emitted electrons are a consequence of the fortunes they endure after a material is struck 
with a high-energy primary beam: 

1. back-scattered primaries: primary electrons that have suffered a collision and changed their direction to the surface, 
to be emitted with approximately half their incident energy 

2. intermediate energy primaries: either primaries that have given up much of their initial energy yet still have a 
respectable amount left, or secondaries that have acquired energy from a collision with a primary 

3. low-energy secondaries: electrons, many in number, and generally with a low energy (< 20 eV) that have been 
excited by the primary electrons. 

The first group causes a high-energy peak at about half the incident beam energy. The second is poorly represented 
in the emitted population. It is the last group, sometimes called the true secondaries, that are generated by primaries 
whose large initial energy is bled off through repeated collisions with the bulk atoms: these secondaries diffuse through 
the bulk material until they either suffer recombination or get to the surface where, if their energy is favorable for it 


1 The Odyssey of Homer (trans. Richmond Lattimore). New York: Harper & Row, 1967, Book VI, lines 188-190. 
2 The three-step model of photoemission of Spicer follows an analogous development in Section 31.3. 
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Figure 15.1 Diamond secondary emitter. A thin diamond film developed for a diamond amplifier 
based on secondary emission. The white arrow points to a diamond film tens of microns thick where 
the high-energy primary electrons create many secondaries. The yellow square below the film is less 
than 1 inch across. The orange disk to the left is a US penny. Image courtesy of J.E. Butler, B.B. Pate, 
and J. Yater (NRL); see also Figure 4 of ref. [210]. 


(larger than the work function), they are emitted. Such "true" secondaries - the good ones for electron sources that have 
endured much before their wanderings have brought them to the surface - are the ones under discussion. 

Naming the secondary emission equation after Baroody gives short shrift to the contributions of van der Ziel, Lye 
and Dekker, Jonker, Young [211], and others. Moreover, Baroody's equation will not even be the last word. How¬ 
ever, Baroody was the first observe that if 8 m is the maximum secondary emission yield of a material being struck 
by primary electrons of energy E m (with E and 8 being the primary energy and the resultant yield), then the yield 
ratio 8(E)/8{E m ) as a function of E/E m tends to follow a universal curve for both metals (Figure 15.2) and insulators 
(Figure 15.3). 

The relationship between photoemission yield and Eq. (11.1) is easier to visualize when it is "one photon to one 
electron," but in secondary emission, high-energy primary electrons generate many secondaries through collisions with 
atoms, after which the escape of the secondaries has much in common with the photoemission processes. Assume that 
each secondary is created with more or less the same energy A E, corresponding to the amount of energy needed to 
liberate a core electron from an atom. Primary electrons, with their considerable energy, plow straight through the mate¬ 
rial, shedding secondaries as they go. If the energy lost between the positions x and x + 8x is 8E, and if 8E/8x ->■ dE/dx, 
then the number of emitted secondaries whose origins are at x is dS = BdE/dx? Next, contra Tolkien (Section 7.3) some 
who wander do get lost, through recombination or a reduction of their energy to below escape conditions. Their loss 
will depend on how many are present N(x) and the fraction of them that get lost per unit length y, or dN/dx = —yN. 



E/E m 


Figure 15.2 Baroody secondary emission data for various metals compared to Bq. (15.8) for various values of n, for which the values 1.2 <n< 1.3 work 
best. Data digitally extracted from Figure 1 of Baroody [35]. 


3 The usage of B follows convention: it should not be mistaken for the the 


B used in Chapter 13 when discussing field emission. 
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Figure 15.3 Lye-Dekker secondary emission data for the insulator MgO compared to Eq. (15.8) for various values of n, for which the values 
1.35 <n< 1.4 work best. Data digitally extracted from Figure 5 of Lye andDekker [208], 


Solving, N(x) = N 0 exp(-yx). But N and BdE/dx refer to the same secondaries, and so, summing over all generated sec¬ 
ondaries and accounting for their losses incurred in racing to the surface. 


5(E) = -B 


cm• 


(15.1) 


where R(E) is the range of the primary electrons of beam energy E, that is, how far the primaries penetrate into the bulk 
material before stopping. There are reasons (to be treated at greater length in the Bethe model of Section 32.1.1) behind 
the intuition that slower electrons will shed their energy faster than higher energy ones, and so to surmise that dE/dx 
decreases with increasing E as 4 


dE _ A_ 

dx * 1 


(15.2) 


The case n = 2 is Whiddington's law, from which Baroody deduced the universal relation between 5/5 m and E/E m . 
A general power law relation better accounts for the penetration depth studies of Young [208, 211] in addition to being 
of greater flexibility in matching Figures 15.2 and 15.3. Recovering Baroody's deduction without specifying n (yet) is 
possible. 

The form of Eq. (15.2) makes trivial the finding of the range R, where the primary electron finally sputters to a stop 
in the bulk material, or 

i/v- ,i5 ' 3) 


where E 0 now represents the initial energy (i.e., the energy of the primaries in the incident beam of electrons) so that it 
is visually distinct from E(x); it and the range R(E 0 ) are then given by 


E(x) = (E" - nAx) 1/n (15.4) 

R(E 0 )=^ (15.5) 

such that E[,R(i? 0 )] = 0, which seems circular, but is not. Whiddington's law recommends n = 2 so that the range is parabolic 
in energy, but measurements by Young for aluminum [211], reproduced in Figure 15.4, show that n= 1.35 is a better 
approximation. Eq. (15.4) makes possible finding the solution to 5(E 0 ) in Eq. (15.1): inserting the former into the latter 


4 The power (n - 1) is used here; Lye and Dekker [208] use n, so cave, lector iterum. 
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Figure 15.4 Primary electron range vs primary beam energy for aluminum, based on Figure 5 of ref. [211]. The lines were all calculated according to R n 
[pm] = 0.271(E o /4.098 keV)". Young's data digitally extracted (black dots). The best match occurs for n = 1.35 (red open diamonds on solid red line). 

and introducing the dimensionless term X = yR(E 0 ) gives 


SW ‘ B (n)L , 1%./.'""W 

(15.6) 

G n (X) — e~ x / exp ( Xs n ) ds 

Jo 

(15.7) 

where, in going from the first integral representation to the second, the substitution t = s n — 1 is made. Baroody's analysis 
made use of the notation F (Lye and Dekker used G); also Baroody sets up the integral such that the upper bound is a 
variable so as to make use of tabulated functions (a desirable arrangement in his day). Here, having a fixed upper bound 
of 1 allows for a bit more transparency in the argument to be made. Clearly (either immediately or after some reflection), 
X depends on the primary energy E 0 , and as 8 -*■ 8 m as E 0 -> E m , then so too X -* X m . Introduce the dimensionless parameter 
p = E 0 /E m and observe X = X m p n by its definition, so that 


S _ G n (X m p n ) 

8 m P G n (X m ) 

(15.8) 

The case of n = 2 gives Baroody's result, and so Eq. (15.8) is referred to here rather generously as Baroody's equation 5 
although Baroody presented it differently: his Eq. (24) of ref. [35]) appears as 


s 1 /-0.9241P 

f = 1.848e- (0 - 9241 ^ / e* 2 dx 

°m L =2 Jo 

(15.9) 

which a change of integration variables shows is equivalent to Eq. (15.8) for X m (n = 2) = (0.9241) 2 . But to the point: 8/8 m 
depends only on the variable p (and the dimensionless constant X m which, recall, will depend on the value of n chosen). 
That is, there is a universal relation that the yield (in relation to its maximum value) satisfies, and although Baroody 
came to his conclusion using n = 2, Figures 15.2 and 15.3 show that other values of n perform better. 

EXAMPLE: Find the limiting behavior of G n (X) as X -* 0 and X -*■ oo. 

SOLUTION: For small argument, use the expansion e* « 1 +x + {x 2 /2) in the integrand of Eq. (15.6) to obtain 

G n (X « 1)» 1 + Xn + (An ' >2 

” n+1 («+l)(2«+l) 

(15.10) 


5 The present treatment has rr 


n common with Lye and Dekker [208]: their Eq. (11) is equivalent to the present Eq. (15.8 
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For large argument, use the expansion s" = 1 - n{\ - s) in the argument of the exponential in the integrand of 
Eq. (15.6) to obtain 

g,(A » 1) « 1 ~ e ” (15.11) 

The limiting forms are compared to a numerical evaluation in Figure 15.5. 


EXAMPLE: Find an approximation to 8(E 0 ) and determine its accuracy. 

SOLUTION: Using Eq. (15.11) to approximate G n (A) along with the relations p = E 0 /E„ 

, and A = A m p n in Eq. (15.6) 

results in 

(15.12) 

5 m P \ 1 - e~^ n ) 

The approximation is compared to a numerical evaluation in Figure 15.6. 




X 

Figure 15.5 The function G n (A). Comparison of G n (A) to its asymptotic limits of Eqs (15.10) and (15.11) for the case n = 1.35. Although Eq. (15.11) was 
derived under the assumption of X » 1, it is seen to perform reasonably well over the entire range of A. 



p = E 0 / E m 

Figure 15.6 S/5 m vs E 0 /E m . Comparison of the exact (Eq. (15.6)) to the approximate (Eq. (15.12)) forms of S/S m for n = 1.35, for which A m = 1.7944. 
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15.2 Analysis of secondary emission data 

The mob rushes in where individuals fear to tread. 

- Burrhus Frederic Skinner 6 

Much depends on the parameter A in Eq. (15.4) for E(x), and the parameter B in Eq. (15.6), without which the analysis 
of data is hamstrung. Approximations will be considered in Section 32.1.1, so for now Baroody's equation will be used 
to explain the genesis of the n curves in Figures 15.2 and 15.3, and show how the value of n is determined from the 
dimensionless form of <5/<5,„ given in Eq. (15.8), compared to the approximate form of Eq. (15.11). The maximum is 
characterized by d p (8/8 m ) = 0 at p = 1, and so differentiation of Eq. (15.8) and the evaluation of the result at p = 1 gives 
a function H n (X) of the form (apart from multiplicative factors that can be ignored as ratios are taken) 

H„(X) = f (1 + nX(f- l))exp(2(s "-\))ds (15.13) 

Jo 

such that fT„(2,„) = 0. The zeros are indicated in Figure 15.7. As n decreases, the zero-crossing moves to higher vales of X. 
Analytically, H 1 (2) = e~ l , so that X m (n = 1) = oo. As for the other zeros, 1 /X m {n) « 0.952 n - 0.730, as shown in Figure 15.8. 

Although the zeros of H n (X) can be graphically determined from Figure 15.7, a numerical method known as bisection 
works better in practice for finding roots. In bisection, a maximum and minimum value of X is chosen such that H n (X mjn ) > 
0 (X mjn = 0 works well) and H n (X max ) < 0 (a large value like (X max = 10 works well). Then the logic unfolds according to 

Require: Initially set X min = 0. 

Require: Initially set X max =10. 
for j = 1 to IV do 

Generate X h = (X min + X max )/2 
If H„(X h ) > 0 then X max -» X h 
If H n (X h ) < 0 then X min -+ X h 
end for 

X m = ( X min + X max )/2. 

The number of times N that the calculation of X h and its replacement of X max or X min is performed is governed by the 
desired level of accuracy: if on the first iteration X max - X min = 1 (for example), then after N iterations, X max - X min = 1 /2 N . 
Thus, every 10 iterations improves the accuracy (constrains the region in which X m is found) by 3 orders of magnitude 
because 2 10 = 1024 and X m lies at their midpoint by construction. An example code implementing the bisection method 



Figure 15.7 H n (X) function. Behavior of H n (X), defined in Eq. (15.13), for various n such that = A m ) = 0. 


5 Burrhus Frederic Skinner, Walden Two (Hackett Classics). Indianapolis: Hackett Publishing Company, Inc., 2005, p. 37. 
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Figure 15.8 Zeros of the function H _(A), designated 2 m (w). To a good approximation, 1 /2 m (n) is linear in n, the best fit line in red, with the equation for 


k m (ri) -> y(n) likewise in red, although the fit wrongly suggests that 2 m (l) is finite. 

as applied to the determination of the zeros of H n (X) is given in Section A3.7.1. N can be kept small (N = 16) by making 
the final step use linear interpolation between H l = H n (X max ) and H 0 = H n (X min ) to obtain X m . 

15.3 Subsequent approximations 

It is observed that in spite of the great difference in yields the data follow rather closely a single reduced yield curve, which implies that, 
as with metals, the individual crystal characteristics modify the yield curves principally in the role of scale factors and do not influence 
markedly their functional form. 


- Robert G. Lye and A. J. Dekker [208] 


Transmission electron measurements of primary electrons in the energy range 0.3 keV <E 0 < 7 keV through aluminum 
oxide films led Young [211 ] to conclude that n = 1.3 5 is a better value to match the data, in contrast to the Whiddington 
law estimate of « = 2. In fact, n = 1.35 was found to apply for primaries up to 7 MeV. Lye and Dekker proposed the 
"power law" formula for the range seen in Eq. (15.2), but as they note. Young's experiments cast some doubt on that as 
well: the fraction of energy lost by the primaries behaves like 



(15.14) 


where the subscript eff is for "effective." This is a rather different result: in contrast to the power law relation which 
posits that the range of all primary electrons is the same. Young argued that the data revealed an almost linear decrease 
in the number of electrons with depth. Usage of Eq. (15.14) in Eq. (15.1) makes for an easy integration: recalling that 
X = kR{E 0 ), then 



(15.15) 


resulting in, in the terminology of Lye and Dekker (Eq. (17) of ref. [208]), a "universal reduced yield equation" given by 



(15.16) 


where, as before, p = E 0 /E m . Such an equation is hauntingly familiar from Eq. (15.12), but slightly different in that 
the factor of n in the exponential terms is absent. That is not much of a difference (the A factors could be rescaled, 
for instance), but Young's formulation becomes convenient when dE/dx is evaluated using the Bethe model of 
Section 32.1.1, and there it shall wait. 
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Child-Langmuir law 


Creon: You knew the order not to do this thing? 

Antigone I knew, of course I knew. The word was plain. 

Creon: And still you dared to overstep these laws? 

Antigone For me it was not Zeus who made that order. /Nor did that Justice who lives with the gods below/mark out such laws to 
hold among mankind. /Nor did I think your orders were so strong/that you, a mortal man, could over-run the gods' unwritten and 
unfailing laws. 

- Sophocles 1 


Antigone, the daughter of King Oedipus and his wife/mother Queen Jocasta of Thebes, defied the alleged tyrant 
Creon, Antigone's uncle, by giving her brother Polyneices a proper burial, after Creon ordered Polyneices' body to be 
left on the battlefield for the carrion-eaters after Polyneices' attempt to take Thebes by force. For that defiance, Antigone 
was sentenced by Creon to death. The great eponymous play about her fate, written by Sophocles around 441 BCE, 
explored the differences between the laws of the state and moral (or even natural) laws. Given Antigone's family tree, 
her positioning herself as a defender of the divine moral order may seem tenuous, but the larger philosophical point 
implicit in her argument, that there is a difference between man's law (injunctions for his benefit) and laws of nature 
(descriptions of how things are), parallels similar contemplations on what a "law" in science is. 

More than 2000 years after Antigone, the experiments of Langmuir on the Edison effect (aka thermal emission) [132] 
showed that the amount of current I drawn between a planar diode is related to the voltage V on the anode and the 
anode-cathode (AK) separation D by I oc p 3 / 2 /D 2 (Child having found the same relation for positively charged ions). 
Langmuir himself considered how the relation was modified for current flow between coaxial cylinders [146] and 
concentric spheres [147]. Interest extended to triodes, tetrodes and pentodes, given that space charge controlled their 
performance, and then on to klystrons and traveling wave tubes (TWTs). The relation kept adapting to deal with each 
new complication to the physics, such as temporal variation of the electric field, initial velocity of the electrons from 
the cathode, a statistical distribution of initial velocities, relativistic effects, and so on. The problems are incredibly rich, 
and the physics deceptively subtle, but the caveats and special conditions even for the simple diode problem led van der 
Ziel (p. 155 of ref. [152]) to advise about Child's so-called law that "This expression is usually not very accurate..." (he 
gave a more accurate version, as will we). Such a history of a law has more of Creon in it than Zeus, but referring to the 
limits placed on electron emission by cathode geometry and anode potential as the "Child-Langmuir limiting relation for 
particular circumstances" lacks both panache and gravitas compared to referring to it as a "law". 

But even that flippant quip is a bit off the mark. Observe that the other canonical equations are referred to as equations, 
but the Child-Langmuir relation is often referred to as "Child's law". Why the difference? Simply, because a "law" is 
an empirical relation that works even when there is not an underlying theory to explain it (a usage of "law" different 
than the lay understanding of it). Aye, there's the rub. In elegant phrasing, Thomas Kuhn, 2 in an almost Jeffersonian 
sentiment, argued 

... empirical laws... can be confronted directly with observation or experiment... they fill an apparent gap, supplying information that 
was previously lacking. As science develops, they may be refined, but the original versions remain approximations to their successors, and 
their force is therefore either obvious or readily recaptured. Laws, in short, to the extent that they are purely empirical, enter science as 
net additions to knowledge and are never thereafter entirely displaced. 

1 Sophocles, Antigone, in The Complete Greek Tragedies: Sophocles I (trans. E. Wyckoff). Chicago: University of Chicago Press, 1954, lines 447-455. 
2 Thomas S. Kuhn, The Essential Tension: Selected Studies in Scientific Tradition and Change. Chicago: University of Chicago Press, 1977, p. 19. Compare 
Jefferson's quote in Section II. 
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Read closely, such a description applies rather well the evolution of Child's law over decades, and points to its conse¬ 
quential importance. Still, although the theoretical foundation has much improved, the nickname has not been displaced 
either. 


16.1 Constant density approximation 


Die Tragdie besteht darin, dafi sich der Baum nicht biegt, sondem bricht. 


- Ludwig Wittgenstein 3 


The first space-charge behavior to be considered, that associated with charged particles in fixed locations, represents 
a solvable instance of Poisson's equation. It has, however, more to do with band bending in semiconductors than it does 
the space-charge limited flow to be encountered in Section 16.2. Understanding it is necessary to the explanation of 
why band-to-band tunneling, a kind of "breakage" in holding off current flow, occurs in Section 24.4 (see in particular 
Figure 24.13), where the bending becomes extreme enough to allow tunneling. 

For continuity, though, the treatment shall be conducted in the language of AIC gaps and capacitors. The consideration 
of two planar conductors, the anode being at V a and the cathode being at zero potential energy, a distance D apart with 
the intervening gap filled by a constant charge density p 0 is a model of space charge in semiconductors caused by doping 
(impurity atoms that give up or take up charge, being "donors" and "acceptors," respectively). It is also a good simple 
model to introduce methods and notation. The potential energy V{x) between the parallel plates is found by solving 
Poisson's equation, which, in one dimension, is 

-f-V(x) = -^-p(x) (16.1) 

ax e 0 


Note a few things about this form compared to usual accounts in electrostatics [99, 100]: first, because V(x) is a potential 
energy and p 0 is a number density, a factor of q 2 on the right-hand side is explicitly present; second, because electrons are 
the particles under consideration, the anode potential energy is -V a . Therefore, although Eq. (16.1) is equivalent to more 
engineering-friendly accounts, there are differences to heed. Had not Benjamin Franklin, as clever as he was, randomly 
established the convention for electricity such that "negative" charge became associated with the electron [212], there 
would be a friendlier symmetry between the equations for potential energy and potential. 

A constant density p{x) = p 0 makes the first integration of Eq. (16.1) trivial: with the identification Fix) = -dV(x')/dx, 
then 

Fix) -F c = —x = UjcQPoX (16.2) 


where —F c is the field at the surface of the cathode (insofar as possible, terms are taken as positive and their signs explicitly 
evident), and the second form makes use of Eq. (2.8). The second integration is trivial, and it follows that for the anode 
potential energy at V a and for an AK gap D, then, if the cathode is said to be grounded (held at V(0) = 0), a second 
integration yields 

^ = F c + 8nQp 0 D (16.3) 

In the absence of charge in the AK gap (p 0 = 0) then the usual parallel plate capacitor equation F c = VJD results, but 
other circumstances are of greater consequence. The field at the surface of the cathode can be made to vanish if p 0 D = o 0 is 
such that VjD = 1 6xQo 0 = o 0 /e 0 \ such an equation is well-known in electrostatics, 4 as a one-dimensional sheet of charge 
gives rise to a field on either side of the sheet that does not taper off with (i.e., is independent of) distance. Consequently, 
a stack of sheets gives rise to the same field as if all the charge density were in a single sheet. Thinking of the sheets as 
having a thickness dx such that dF = 1 6nQp 0 dx regains Eq. (16.2). 

The relation of depletion layers to the doping of semiconductors will figure prominently in Section 24.4 and be related 
to Eq. (16.3). An alternate designation is the "space-charge region" where band bending occurs as near the surface 
of a semiconductor, or near semiconductor/metal interfaces [213], For now, a simple treatment replaces the vacuum 


3 "You get tragedy where the tree, instead of bending, breaks," Ludwig Wittgenstein, Culture and Value (trans. Peter Winch). Chicago: University 
of Chicago Press, 1984, p. 1. 

4 See, for example. Section 3.6 (Surface Charges and Double Layers) of ref. [99]. 
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between the parallel plates in the capacitor model with an insulator (diamond or silicon dioxide Si0 2 , for example) with 
a dielectric constant K s , which is accounted for by replacing e 0 -» K s e 0 in Poisson's equation. The dielectric properties of 
the insulator decrease the internal field compared to what would exist for a vacuum. In the absence of charges in the 
gap, the (constant) field is F = V a /(K S D) for a dielectric-filled capacitor. 

If the dielectric material contains charged impurities introduced by doping, then p 0 will be related to the doping density 
N d . Of particular importance is the length over which the field is completely shielded out, known as the depletion width 
or w. Shielding of the field is taken to mean F c -*■ 0 in Eq. (16.3) or, equivalently, F(w) = 0 in Eq. (16.2). Finding the 
depletion width w is straightforward. Because N d is constant 5 with typical values of 10 15 -10 19 atoms/cm 3 , a second 
integration of Eq. (16.2) with F( 0) = 0 as a boundary condition gives 


V a 


q 2 N d a 

2 K s e 0 W 


= 8 tcQN'i'y 


(16.4) 


Observe the units: V a has units of eV, Q of eV-nm, and w of nm. Therefore, because N d has units of #/nm 3 , both sides of 
the equation are in units of eV. 


EXAMPLE: A diamond thin film is exposed to an external field of F vac = 10 eV/pm (corresponding to an electric 
field of 10 MV/m). Assuming a doping density of 10 15 boron atoms per cm 3 and that the donors are singly charged, 
find the depletion width w. Why is the dielectric constant of diamond (K s = 5.7) not needed in this calculation? By 
how much do the bands bend? 

SOLUTION: The field just inside the diamond is F(0) = F vac /K s , so that, using Eq. (16.2) with F(w) = 0, it is found 
that 


F vac _ 4 

K s ~ K s e 0 W 


1 6lrQPo ]t 


(16.5) 


or w = F uac /{\6xQp 0 ) = 552.64 nm (f/ri), where/is the field in eV/pm and n is the doping density in 10 14 atoms/cm 3 . 
With / =10 and n = 10, then w = 0.52264 pm. The independence of K s is a consequence of specifying the vacuum 
field rather than the internal field at the surface boundary. Then, from Eq. (16.4), V a = 0.4848 eV. 


16.2 Constant current approximation 


Plus Vobstacle est puissant, plus■ 


Lest van der Ziel's limited praise of Eq. (16.18) seem confusing, return to the odd bit about excess current being 
returned to the cathode when F m changes from accelerating electrons away from the cathode to pushing them back: that 
is, a potential barrier is created that must be surmounted for current to cross the gap, and its neglect undercuts the ability 
of J CL to predict the current. The origin and location of the barrier may be found via the constant current approximation, 
but will take some development. 

If the current density J 0 « q 2 p(x)v(x) is taken as constant, 7 * * that implies p(x) = J 0 /q 2 v(x), and so the number density 
varies with x. From conservation of energy (l/2)mv(x) 2 = V m - V(x), and so Eq. (16.1) becomes 




(16.6) 


5 N d is constant as far as present needs are concerned here. In fact, N d (T) is a temperature dependent quantity and the donors ionize only when 
conditions are favorable for doing so. 

6 "The more powerful the obstacle, the more glory is received", L'etourdi, ou les Contre-Temps (The Blunderer), EBook #6563, Project Gutenberg, 
http://www.gutenberg.org, 2004, v.xi. 

7 Care is required here: J is taken as a charge current density with units of A/cm 2 , yet p is being treated as a number density with units of #/cm 3 , 

in sympathy with usual treatments of doping density which do likewise, rather than a charge density with units of q/cm 3 , as was the case in 

Eq. (3.5). To respect that, an additional q must be appended to the coefficient. 
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where V m is a reference potential, either the surface of the cathode for now, or where the potential energy is at a maximum 
for later, so that letting U(x) = V m - Vix) is convenient. Multiplying both sides by F = dU/dx results in 



d(F 2 ) = l^Qj + f2™dU = 2aU~ l/2 dU 
q V U 

(16.7) 


8tt QJ. / - 

a = - -\]2m 

4 

(16.8) 

Each side is n 

iow easily integrated, giving 



F 2 - F 2 = 4 ayju 

(16.9) 


where F(0) = F m is the field 
that F = dU/dx, 


the cathode surface (or potential energy maximum), where 17(0) = 0. Then, remembering 


f Ua dU 

Jo (4aU 1 / 2 +Fl,) 1 / 2 


(16.10) 


for which the right-hand side is straightforward but the left-hand side is not. On the left, there is 
to U 1/2 = (F^/4a)u and the integral is 

J V« +1 3 


change of variables 
(16.11) 


After rearranging a bit, recalling U(JD) = -V a if V m = 0 where D is the AIC gap separation and V a refers to the anode, then 


haV^ 2 -FlJj^j4aV x J z + F 2 = 6a 2 D - F * (16.12) 

In a manipulation more tedius than tantalizing, that equation can be recast as 

F l, ~ F mj^ + ( 4V Z /2 ~ 9aD2 ) = 0 (16.13) 


The limiting cases of this equation are instructive. First, observe that a is proportional to J 0 (Eq. (16.8)), so that what is 
said about one applies to the other. The first limit of interest, then, corresponds to no current flowing across the AK gap, 
or a -*■ 0, for which Eq. (16.13) is trivial (F m D — V a = 0) and reproduces the usual relation between anode potential, gap 
distance, and field: the potential energy decreases linearly across the gap. If the current density is just very small rather 
than zero, then to leading order in a, 




(16.14) 


which indicates that as more current is injected into the gap (i.e„ as J 0 becomes larger), the field at the surface F m 
decreases, and the potential energy acquires a curved characteristic. As ever more current is injected, at some point the 
surface field itself will vanish and the potential energy across the gap will have a parabolic shape such that the maximum 
of the curve is at the surface of the cathode and the surface field F m = 0. Under that special condition, J 0 is called J CL , and 
from Eq. (16.13) it corresponds to a = 4V a 3/2 /9D 2 , or 





(16.15) 

(16.16) 


In fact, if dimensionless terms / and j are introduced such that F m =fV a /D and; = J 0 /J C7 , then Eq. (16.13) takes on a 
distinctly more appealing form of [214] 

/ 2 (l-/)= ifj(l-T) (16.17) 


such that j = 0 corresponds to / = 1 and j = \ to / = 0. Observe that the limiting cases f 2 1 and j 2 -> 0 give rise to 
Eq. (16.14). 

The representation of Eq. (16.15) for J CL is not in a form commonly encountered. To connect with its common rep¬ 
resentation, let V a -*■ q<p a , where cp a is the anode potential in volts, and use the definition of Q in Eq. (2.8) to obtain the 
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conventional and venerated form of Child's law representing the maximum current that 
before a non-zero fraction of emitted electrons are returned to the cathode, or 


(^)‘V 


where the coefficient of <p 3 / 2 is often referred to as the perveance and designated K. 


be pulled from the cathode 


(16.18) 


EXAMPLE: What is the perveance K for D = 1 cm? 

SOLUTION: Inserting the values of e 0 = 8.8542 x 1(T 12 C/V-m and, for the electron, q = 1.6022 x 1(T 19 C and 
m = 9.1094 x 10 -31 kg in MKSA units, one finds 


an enjoyable calculation for those who are inordinately fond of exponential notation. One may alternately start 
from J CL = K'vl /2 using Eq. (16.15). As Q = 3.6 eV-nm and me 2 = 5.11 X 10 6 eV then K'D 2 = 0.01457 q/eV 3 ' 2 is. 
Observing that (1 q)/(l fs) = 160.22 pA gives K'D 2 = 2.334 pA/eV 3/2 . Using K = K'q 3/2 with q = eV/V returns the 
previous result. 


Let the position of the barrier maximum be at x m in front of the cathode surface and at a potential V m higher than that 
of the cathode (held at V = 0), as in Figure 16.1. The yellow shaded region would give rise to Eq. (16.13) if x m were at 
the surface of the cathode and V m were at the cathode potential. Thus, one expects that the modifications to Eq. (16.13) 
of V fl -*■ V a + V m and D -*■ D — x m correct the equation, or 


q (V a + VJ 3 ' 2 
l&xQyflm ( D ~ x m) 2 


(16.19) 


But determining the values of V m and x m is not straightforward: as methods for their determination are found elsewhere 
for thermionic emitters [148, 152], an abbreviated account suffices. If the cathode emits a current J 0 such that the emitted 
electrons encounter a potential hump that only a fraction r of them surmount, then rJ 0 is the current that reaches the 
anode, and (1 - r)J 0 is reflected back to the cathode. If the AK gap is divided into two regions. Region 1 being before x m 
and Region 2 being after it, then p 2 (x) = rJ 0 /v 2 (x), but in Region 1 the reflected electrons also contribute to the density. 



the left, anode (blue) to the right, 
rto the unshaded region (to the left 
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and so summing the there and back components is needed, or pfx) = rJ 0 /v ] (x) + (1 - r)J 0 /v l {x) or (2 - r)J 0 /v 1 (x). This 
entails the solution of two equations. 


U X (X) = —— ~T~T 

dx 2 £g «i(4 

*a0 i .-£e=g. 

dx 2 2 Eg v 2 (x) 


(16.20) 

(16.21) 


subject to the boundary conditions that 0) = 0, U l (x m ) = U 2 (x m ) = V m , and U 2 (D) = —V a , from which x m and V m can 
be determined. Now, the limiting current density is rJ 0 = J C i{y a + - x m ), where J 0 is the current density the cath¬ 

ode generates under particular conditions (like temperature for thermionic cathodes) regardless of how much current is 
returned, and van der Ziel's comment is now properly understood. Moreover, current density has been treated as con¬ 
stant, but oscillations and complex behavior attend particular conditions such that interesting physics continues to bedevil 
"space-charge limited" flow [215-218], 

A reasonable question would be, why run cathodes so hard that a potential energy maximum forms outside the 
cathode surface? If temperature is required for thermal emission, this means more power is expended heating the 
cathode than needed, and for many systems (particularly space-based ones) power is at a premium. The short answer 
is because cathode surfaces are typically non-uniform, to have a positive F m at the cathode pulls electrons away from 
the surface as soon as they are liberated, and so whatever non-uniformity of emission exists at the cathode surface is 
replicated in the beam produced. Often, this is undesirable (Section 33.4). The creation of a virtual cathode into which 
electrons are injected and distributed thereby has considerable advantage in creating a smooth, uniformly emitting beam 
[215, 219, 220], An additional advantage is that flicker noise is reduced, which is a problem when space-charge 
smoothing cannot be employed [221], The transition from temperature-limited emission to space-charge limited 
emission reappears in the introduction of the Miram curve (Section 29.1). 


16.3 Transit time approximation 


A time of love, a time of hate/A time of peace... 

And a time for every purpose under heaven. 

- Pete Seeger 8 


Focusing on current density and space charge obscures the importance of another parameter. Because the charge in the 
gap determines the space-charge reduction in the field at the surface of the cathode, that other parameter must be time or, 
more specifically, how long the electrons take to get from cathode to anode, otherwise known as the transit time. Charge 
per unit area a creates a field F from a sheet of charge (an isolated sheet gives F = cr/2e 0 : recall the discussion following 
Eq. (16.4)). For constant current density, a = J 0 r, where r is the transit time. Seen another way from Eq. (16.3), the 
space-charge field goes as q 2 p 0 D, but from p g = J 0 /q{v) and (v)r = D, the views are equivalent. 

The transit time model originates from a model of vacuum capacitance [222], and can be extended to treat relativistic 
and quantum mechanical conditions [149] (but not here). It has been used to infer scaling laws for cylindrical and 
spherical diode geometries [223], It is useful when (as for photoemission) the pulse duration of the beam bunch can be 
shorter than the transit time across the gap [224], as then not only can there be more current than allowed by J CL , but 
an oscillatory behavior in the THz frequency can be seen via simulation for a train of electron bunches [225], But it is 
the pedagogical utility of the transit time model as a means to understand space-charge behavior that counts now. 

Under equilibrium conditions for constant current density, the transit times of all electrons across the gap are the same 
in the planar diode if the initial velocities are negligible. Are they? Recall that the mean velocity of the emitted electrons 
for thermal emission is approximately 


(v x m « 


/ 0 °° (hk x /m)e~ eE * dk x 
fo°° e-^dk x 


(16.22) 


where E x = (hk x ) 2 /2m. The approximation changes a bit if the emission mechanism is tunneling: for field emission, as 
the electron emerges from the tunneling barrier, its kinetic energy is comparable to the Fermi energy g. Compare such 


8 Pete Seeger, lyrics to "Turn, Turn Turn", 1954. 
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energies to those a short distance away from the cathode surface, where the emitted electron has an energy comparable 
to the potential drop near the extraction grid (held at kilovolts for gridded dispenser cathodes) or gate (held at hundreds 
of volts for Spindt-type field emitters), values which tend to dominate the few electronvolts or less of the emitted electron 
kinetic energy. Conditions for photoemission and secondary emission, in which the cathode surface is subject to fields of 
several megavolts per meter (corresponding to F m 10 eV/pm) are comparable. In all these cases, the initial kinetic energy 
(and therefore velocity) only mildly affects the transit time. A simple account can therefore treat the initial velocity as 
negligible. 

If the transit time is known. Child's law can be almost trivially inferred from Eq. (16.3) (although this will be put on 
better footing subsequently) 



(16.23) 


with J g -*■ J a when F m -* 0, but a difficulty with that is that the definition of "transit time" is not unique in that it can 
be approximated different ways: it can refer to how long one electron by itself takes to cross the gap (the "ballistic" or 
space-charge-free transit time), the actual transit time (which begs the question of what J CL is), or a transit time associated 
with emission occurring all at once so that an infinitesimally thin sheet crosses the gap (the "pulse" transit time), as may 
occur for rapid photoemission pulses. For all that, the transit time does not vary as strongly with the anode potential 
as does the current density, so that a reasonable estimate of t, if one can be had, is good enough to anticipate when 
space-charge effects begin to matter. Stacked next to the exact approach of Eq. (16.17), the virtue of "good enough" is 
a rather anemic endorsement, but that is not the point: the transit time concept becomes useful when visualizing space 
charge and discussing how matters are different for multidimensional surfaces, even if the approximation is inexact. 

The ballistic or space-charge-free transit time t 0 is obtained using simple kinematical equations, and therefore easiest. 
From x(f) = x 0 + v 0 t + (a/2)t 2 with v 0 being the (neglected) initial velocity and ma = F 0 = V a /D from Newton's equation, 
then x(t) = D gives 



(16.24) 


Such a transit time is a measure of how long a particle takes to fall down a potential energy hill, as on the left side of 
Figure 16.2. 

The second transit time t is for when space charge is present, as on the right side of Figure 16.2, where the fall of the 
particle is impeded by charge in the gap. It is defined by 



(16.25) 


where x + (E) is the location where the electron of energy E emerges through or over the barrier characterized by the 
image charge potential, but generally x + «: D and so it is negligible. The sheet charge density varies with position, so let 
<r(x) be defined by 



(16.26) 



Ballistic 


Space charge 


Figure 16.2 Ballistic transit time (left) corresponds to how long a particle takes to fall down a potential energy hill when the field is linear, ii 
(right) when charge is present and the descent is thereby slowed. 
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The potential energy V(x) can be calculated from Poisson's equation subject to the conditions y(0) = 0, V(D) = V a , and 
d x V{0) = F, where F is the field at the surface, and so 



(16.27) 


(16.28) 


where N r is defined by this equation, but which is seen to be N z = f® o{x)dx/{Do{D)). Compare this to a different way of 
solving Poisson's equation, relying on the relatively common trick of multiplying both sides by 2 F = 2dV/dx to get 



(16.29) 


which, using the velocity v(x) = \2{V(x) + E 0 )/m\ 1 / 2 where mv 2 /2 = E 0 is the initial kinetic energy (KE), allows for a 
simple integration of both sides and gives (after a wee effort) 



(16.30) 


where F without an argument is F a F(0) or the field at the surface. In an exact approach, a second integration is required 
to obtain a self-consistent relation (see, for example refs [214, 226]), but the transit time approach proceeds differently. 
Assume the initial kinetic energy E 0 = 0, so that 



(16.31) 


Then, from a(JD) = {e 0 /q 2 ) ((V a /D) - F ) plus the right-hand side of Eq. (16.25) it follows 



(16.32) 


Thus r decreases as F increases. From both Eqs (16.25) and (16.28) 

v. q 

— = F= F+ —N t Jt 
D 0 e 0 T 


(16.33) 


As N t Jt is the average charge per unit area within the gap, the form of Eq. (16.33) is as expected, and it is the defining 
equation relating transit time to current density. Observe that the derivation depends on the one-dimensional relation 
between a and F: for two- and three-dimensional charge distributions field varies with distance, and the definition of r 
in such cases is still open. 

Space-charge limited current arises when F -*■ 0, for which J J CL' and the value of N r is affected by the choice of t. 
For z —<■ r 0 , as in Eq. (16.24), then N z = 9/8, a finding first reported in ref. [222], In contrast, if Eq. (16.32) is used for t, 
then N r = 3/4 when F -* 0. Showing that these limits are so is edifying. Using t 0 underestimates space-charge effects, but 
the F = 0 limit of Eq. (16.32) overestimates them. The usage of both in Eq. (16.33) easily identifies when space-charge 
effects are significant: field emission experimental data, for example, lie between the two choices [209], but showing 
that now is a bit premature until the geometry models are introduced. 

The two transit times considered so far, namely, the ballistic transit time r a of Eq. (16.24) and the averaged charge 
density transit time r(F) of Eq. (16.32), presume that the current density is unchanging. When it is not, a third transit 
time appears, and its consideration is instructive. Imagine what happens if charge is emitted in an instant, say by a 
short-duration laser pulse causing photoemission. The resultant sheet will take time to cross the AK gap, and the time it 
takes represents the third transit time. The density can be modeled as p(x) = (a/D)S(x - x(tj), where x(t) is the location of 
the charged sheet and a is the charge per unit area of the sheet. An important limiting case is when q 2 <y/e 0 of the sheet 
is enough to eliminate the field on the cathode: when that happens, let a -* <r 0 and q 2 a 0 = JSt = e 0 F 0 , where 8t <c t is the 
very short duration of the laser pulse injecting the charge. Excess emitted charge density is pushed back to the cathode. 
The fields on either side of the sheet, shown in Figure 16.3, are related by [209, 227]) 


V a = F + (D -x) + F_x 


(16.34) 
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/ 


1 te!g> / 


Figure 16.3 When a sheet of charge is injected into the AK gap, the fields F ± on each side of the charge differ by an amount proportional to the area 
charge density a of the sheet. The position of the sheet is x(f). 


where F_ is the field between the cathode surface and the sheet, and F + between the sheet and anode: the two generally 
are not equal, but can be related further by applying Gauss's law to the sheet (aka the Guassian pillbox argument [99]). 
For an arbitrary sheet charge density a that argument results in 

-o = F + -F_ (16.35) 


The signs are because the fields due to the sheet point in opposite directions and so on the - side the force and the normal 
are antiparallel. Solving Eqs (16.34) and (16.35) simultaneously gives 


F + (x{ 0) = F 0 + tlx{f) (l + (16.36) 

F_(x(t)) = F g - q ^(l- -► F^- (16.37) 


where the arrows correspond to a -*■ a g . A tricky point is that the force under which the sheet moves (i.e., under which 
x evolves) is the average of F + and F_ (not their sum, as perhaps thought: in the limit of vanishing a the fields on each 
side are equal to F and the field on the sheet is likewise F), and so 


d 2 - F F _ 

ii-x(f) = + —x(t) 

dt 2 2m mD 


(16.38) 


Just as equations of the form d 2 x = -co 2 x bring to mind harmonic oscillators and sines and cosines, so too do equations 
of the form dfx = m 2 x bring to mind hyperbolic sines and cosines. One therefore anticipates solutions of the form 
x(t) = A cosh(af) + B sinh(flf) + C. When x(0) = 0, then C = -A. Because the initial velocity is assumed to be zero, B = 0. 
And lastly, so that Eq. (16.38) holds, A = D/2 and a 2 = F 0 /mD = \/2/v The general solution is therefore given by 


x (0 = | (cosh (\/2^ J - 1^ 
The transit time t is then found by x(t) = D, or 

t = —cosh _1 (3) = 1.2465 t 0 

V2 


(16.39) 


(16.40) 


That is a fairly astonishing result: the difference in transit times between an electron crossing the AK gap on its own (r 0 ) 
versus traveling with so many friends that an infinitesimally thin sheet density reduces the field at the cathode to zero 
is a difference of about 25%. The uncertainty in about every other aspect of electron emission, from approximations to 
the Gamow factor and uncertainty about the electron trajectories, to work function and crystal face variation, and then 
on to statistical variation in field enhancement factors, makes the overall uncertainty in predicting the current density 
of physical electron sources such that 25% seems a bargain. It is therefore not too far from reasonable to take the transit 
time as something simple like t 0 and devote one's effort to better approximating current density. 
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A General thermal-field-photoemission equation 


In novafert animus mutatas dicere formas 
corpora; di, coeptis (nam vos mutastis et illas) 
adspirate meis... 


- Ovid 1 


In Ovid's Metamorphosis, or the Book of Transformations, the Roman poet fashioned from the mythologies of earlier 
times and writers something at once familiar and yet new. In the same way, the emission equations can be refash¬ 
ioned in a manner that - in the appropriate limits - transforms into the canonical equations of thermal (Eq. (12.3)), 
field (Eq. (13.25)), and photoemission (Eq. (14.16)). A single equation that becomes the various emission equations 
in the relevant limits is made possible by a reconsideration of the transmission probability D(E) and how it behaves in 
conjunction with the supply function /(£), from which the boundary between the thermal and field regimes naturally 
emerges. 

Is such a single equation really useful? After all, the fields on a dispenser cathode are three or four orders of magnitude 
lower than for field emission cathodes, and thermal or field emission is undesirable for a laser-switched photocathode's 
operation, so are these equations really seen operating at the same time? Consider two instances. 

Measurements of a low work function Ba-dispenser cathode (a thermionic emitter) with different metal and 
metal-oxide top layers were made by Geittner et al. [228] and are reproduced in Figure 17.1, in which emission from a 
tungsten (W-I) cathode is compared to that from a cathode containing scandium (Sc/Re-I). The current as a function 
of anode potential shows a change of behavior in keeping with a switch from a thermal characteristic (ln(7) <x y/~F) to 
a field emission characteristic (ln(Z) <x 1/F). Even better, and shown to guide the eye, the dashed (thermal) and solid 
(field) curves are based on the Richardson-Laue-Dushman (RLD) and Fowler-Nordheim (FN) equations, respectively, 
using the following methods, required because the measurements are in terms of an anode potential ( U) rather than 
a macroscopic field (F 0 = U/D). Experimentally, field enhancement factors relate F 0 to U, and are often called beta 
(P) factors (Section 30.1). The conventional reference to field enhancement factors by p is another cruel joke of 
nomenclature by employing an already overworked symbol. Even worse, experimental beta factors (which convert 
potential differences to fields) have units of length whereas theoretical beta factors (which compare microscopic to 
macroscopic fields) are dimensionless. For the moment, let the beta factors be of the experimental kind, and subscripted 
by whether they are to be used in the thermal (p r ) or field emission (p f ) equation. 2 The thermal (dashed) lines of 
Figure 17.1 are therefore constructed based on Eq. (13.25) to be 



(17.1) 


where y = sj4Qp r /(k B T), as intuited from Eq. (12.3). For the W line, U 0 = 3 kV and for the Sc/Re line, U g = 1.5 kV, with 
y = 4 kV -1 / 2 for both. The choice of y entails that p r U a = 12.4 eV/pm for W and 17.4 eV/pm for Sc/Re if the temperature 
is 300 K. The field (solid) lines are constructed using 



(17.2) 


where B = (4/3 hpf)\j2m ® 3 , as intuited from Eq. (13.25) for current density. 3 For the W line, U 0 = 5 kV, B = 65.2 kV, 
and 2 - v = 1.227 (® = 4.5 eV and p f = 1 nm” 1 ), and for the Sc/Re line, U 0 = 3.5 kV, B = 33.8 kV, and 2 - v = 0.6615 

1 "My intention is to tell of bodies changed/To different forms; the gods, who made the changes, /Will help me - or so I hope..." Ovid, Metamor¬ 
phoses (trans. R. Humphries). Bloomington: Indiana University Press, 1955, p. 3. 

2 Later, the distinction between experimental and theoretical beta factors will be denoted by referring to the former with a subscript, as in fi . 
3 This approach presumes that the emission area is constant, which is not true; the impact of emission area is considered in Section 30.3. 
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equation; dashed lines correspond to a Richardson-like equation. Experimental data digitally extracted from Figure 2 of ref. [228]. 



Field [MV/m] 


Figure 17.2 Dark current measured from the CsK 2 Sb photocathode in the Los Alamos high-brightness accelerator FEL (HIBAF). The solid i 
on a conventional FN-fit. Based on Figure 5 of ref. [230]; numerical data courtesy of P.G. O'Shea (UMD). 




(0 = 1.5 eV and p f = 0.37 nmr 1 ). Clearly, both thermal and field emission current components contribute to the total 
current from a low work function dispenser cathode if the anode potential is sufficiently high. 

Measurements of a photoemission cathode were undertaken in 1991 on the Los Alamos High Brightness Free Elec¬ 
tron Laser (HIBAF), which employed a photoinjector to replace the thermionic cathode. The switch from a thermal to 
a photoemitter was motivated by the desire to achieve a lower emittance and brighter beam (Chapter 33), and thereby 
enable the production of micropulses containing 10 nC of charge per bunch. Dark current (current that existed when 
the laser was off) was found, and argued to arise from field emission from the CsK 2 Sb photocathode [29, 229]. A com¬ 
parison of that data, shown in Figure 17.2 and compared to a microscopic model of field emission based on the point 
charge model of Section 30.3 applied to that photocathode material, shows that hypothesis to be warranted [230], A 
slightly faster approach is to show that the data lies along a Fowler-Nordheim fit, as the solid line on Figure 17.2 
shows, the parameters of which are reasonable for the work function, the presumed size of the protrusions leading 
to the field emission current, and the fact that a fine polishing of the photocathode surface eliminated the field emission 
current. 

So indeed, field emission, photoemission, and thermal emission effects can appear simultaneously. Having one 
equation that simultaneously treats them is practical beyond its economy. 
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17.1 Experimental thermal-field energy distributions 

When we mean to build, 

We first survey the plot, then draw the model... 

- William Shakespeare 4 

The consideration of field emission from hot metal bodies is a good place to explore the transition from cold to hot 
emission. Refractory metals like tungsten and molybdenum can take punishing temperatures that melt other metals, 
and so making field emitters out of them is prudent: a heat treatment is often required to prepare them for high field 
operation, during which contaminants like carbon and absorbed gasses are removed before measurements are made 
[175]. These metals therefore are useful to examine the energy spread of the emitted electrons as the anode potential 
is progressively raised for a range of temperature conditions. The detailed analysis of experimental data performed by 
Gadzuk and Plummer [96] on the total energy distribution curves for a tungsten emitter held at 1570 K and subjected 
to progressively higher anode potentials is considered here, although ref. [154] gives a thorough account of Schottky 
emitters (field emission from hot emitters) and ref. [160] examines the energy distributions of carbon nanotube emitters. 
In the transition from thermal to field emission, the migration of the hump in the energy distribution peak moves from 
near the top of the emission barrier p + <fi for thermal emission down to the Fermi level p for field emission in a manner 
suggested by Figure 2 of Dolan and Dyke [192] and in Section 13.4. In particular, the work of Gadzuk and Plummer is 
synopsized in Figures 17.3 and 17.4. 

Relating those findings to the treatment of current density here, however, is hampered by the distinction between 
total and normal energy. The x axes of Figures 17.3 and 17.4 are of total energy E, and therefore differ from the normal 
energy E x into the barrier (called W by Gadzuk and Plummer, following the treatment of Young [231], but updated for 
modern sensibilities by Forbes [232]) that forms the basis of the emission equations approaches developed herein as in 
the supply function of Eq. (11.4) and the Gamow factor 0(E X ) in the transmission probability D(E X ) in the integrand of Eq. 
(11.3). E and E x are related by E = E x + E L , where E x = h 2 k 2 x /2m and E x = h 2 (k 2 + kf)/2m. In terms of the total energy 
E, the current density can be written as 

J(F,T)= [ j'(E)dE (17.3) 

Jo 

Thus, /(B) is what is plotted in Figures 17.3 and 17.4. The total energy distributions bear a resemblance to the normal 
energy distributions that lay behind Eq. 11.3. Such a viewpoint is useful for understanding effects related to band 
structure [96, 165], so understanding the distinction between total energy versus normal energy formulations is 
important. 

The total energy distribution is best approached from the viewpoint afforded by the precursor to Eq. (11.1), rewritten 
as (including electron spin) 

J(F,T)= (^ / ( v) D(E x )f PD (E)dk (17.4) 

where E = E x + E ± = h 2 k 2 /2m. Instead of using k x and k L , consider instead the magnitude k and the polar angles, so that 

J(F,T)= (^fi J k 2 dk {^) f Fo(E )D t(E) (17.5) 

r /2 

D t (E) = / D(Ecos 2 0) cos 9 sin 9d9 (17.6) 

where integration over the azimuthal angle dtp has been performed, the "total energy" transmission probability D t (E) 
has been introduced, and the cos 9 from k x = k cos 9 is moved to the D t integral. Observe that in contrast to the normal 
energy distribution, where the Fermi-Dirac distribution was modified to obtain the supply function (i.e., f FD (E) -* f(E x )), 
in the total energy distribution the modifications flow to the transmission probability (i.e., D(E X ) -*■ D t (E)). Making the 
substitution s = E cos 2 0, it follows ^ 

D t (E) = ~~ f D(s)ds (17.7) 

An Jo 


4 Ref. [37]: The Second Part of King Henry the Fourth, I.iii.41-42, p. 713. 
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:o field emission (peak near 

E - n a 0 eV) for a tungsten field emitter held at a steady temperature of 1570 K. Gadzuk and Plummer offered a = 8 eV, <6 = 4.8 eV as the values 
giving the best fit to their theoretical analysis. Plot numbers refer to the anode potential (in volts). Based on Figure 2a of ref. [96]. 



- 2-101234 
E-E f [eV] 

Figure 17.4 Same as Figure 17.3 but normalized and not on a log scale. 


From Eq. (17.4) the integrand/(E) behind Figures 17.3 and 17.4 is 




mq \ ED t (E) 


(17.8) 


Just as the supply function/(E) could be analytically found, so too can D t (E), depending on the form D(E X ) takes, and its 
derivation shall be undertaken next. 


17.2 Theoretical thermal-field energy distributions 

Again, to the intermediate some extremes show a certain likeness, as that of rashness to courage and that of prodigality to liberality; but 
the extremes show the greatest unlikeness to each other; now contraries are defined as the things that are furthest from each other, so that 
things that are further apart are more contrary. 

- Aristotle 5 


5 R.P. Hardie and R.K. Gaye, Aristotle Nicomachean Ethics Book II, Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 9, Aristotle 
n. Chicago: Encyclopaedia Britannica, 1952, p. 354 (Book n, Chapter 8). 
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A description of the so-called "thermal-field" conditions and their evolution was theoretically challenging (e.g.. Cutler 
and Nagy [233]). Gadzuk and Plummer based their analysis, as did Murphy and Good [110] before them, on the use of 
the Kemble form of D(E X ) given by [112] 


where 9{E X ) is the Gamow factor (Section 11.2). Near the Fermi level E x m p, 9 is large, and the Fowler-Nordheim 
equation follows. Conversely, near E x » p + (p, 9 is small, and D(E X ) is comparable to unity so that the Richardson equation 
follows. The Kemble approximation is useful in that an approximation to it based on its extremes yields a convenient 
representation of its middle, the extremes being the form of the transmission probability D(E) in the thermal and field 
limits. Note the guarded phrase "convenient representation": the approximation that shall be used here to capture the 
total energy distribution behavior of the previous section will not in fact be good, but it will suggest a path to get to an 
approximation that is. The first step to doing so, familiar from the derivation of the Fowler-Nordhiem equation, is to 
consider a linear form of the Gamow factor expressed in an unusual way by 


6(E) » 9(E m ) + 0'(E m )(E - E m ) = P P (E C - E) 


(17.10) 


where p p and E c are defined by this equation: as per best practice for numerical approximation, the Taylor expansion 
point E m is where the integrand is maximum. Two new terms p F and E c are introduced. E c functions as the point where 
D(E C ) = 1/2 in the Kemble approximation if 9(E) were linear in energy, 6 and p F acts as an energy slope factor, much 
like p T = l/k B T : to emphasize the distinction, it is given an F subscript. Thus, p P acts as the energy slope factor for 
the transmission probability D(E) much like p T acts as an energy slope factor for the supply function f(E) accounting 
for temperature effects. When Eq. (17.10) is used for the Gamow factor 9(E) in the Kemble approximation, it shall be 
referred to as the linearized Kemble approximation. A complication that will eventually command attention is that the 
integrand maximum changes location along the E axis as conditions change, therefore these "constants" are not constant, 
but depend on field and temperature, and as a result shift with conditions. 

For field-like conditions, the factors f) p and E c are as follows. The behavior of the Gamow factor 9(E) for the triangular 
barrier in Eq. (13.1) is modified by the image charge and leads to Eq. (13.11) but suggestively rewritten as 

+ (17 ' n) 

Therefore, it is easy to identify (using the definitions of B and C from Eqs (13.6) and (13.7)) that the linearized Gamow 
factor 9(E) for E k p 


p P *t(y) 
2 v(v) 


(17.12) 

(17.13) 


It is worth emphasizing that E c is not the same as the location of the maximum of the current density integrand, but off-set. 
For thermal-like conditions, the factors P F and E c change, but so as not to confuse the narrative, that discussion will wait 
until Section 17.3. 

The important point is that as E approaches and then passes from below the barrier (E < p + <p) to above it (E > p + <p), 
9(E) likewise appears to be driven to zero, but how it behaves for E above the barrier maximum has not yet been 
adequately dealt with. However, by demanding that 9(E) and its first derivative be continuous across E = p+cp, the 
linear form p P (E c - E) results, and as E increases, it becomes more negative. As a result, the linearized Kemble form of 
D(E) -> 1 as it should when the energy significantly exceeds the barrier height. Such a treatment may seem unsettling, 
but it is not without grounds: although the Gamow factor vanishes, the Kemble form is an approximation to the solutions 
to Schrodinger's equation that do behave in a manner mimicked by the linearized Kemble form. To see why requires 
the numerical solutions of Section 18.1, but until such time, Eq. (17.11) using Eqs (17.12) and (17.13) even for energies 
above the barrier maximum will have to be the comic premise, 7 although they change in the transition region and especially 


6 ...which it is not, and more on that below. 

7 In theater and humor, the comic premise is a paradoxical or unrealistic setting that must be accepted for the drama or joke to unfold, for 
example in the gag "A priest, a rabbi, and a duck walk into a bar, and the bartender says 'What is this, some kind of joke?'" requires suspension 
of disbelief that characters not normally together frequent establishments they do not normally patronize (even with a duck). 
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in the thermal regime. Usage of Eq. (17.11) in the linearized Kemble approximation allows D t {E) to be found, perhaps 
with effort, to be 


D t (E) = - 


When P F E C : 


2p P E \ l+e-Mc J 
> 1, the denominator of the fraction within the logarithm is approximately unity, and so 


f(E) = 

A trained eye will see Figure 17.3 emerge, but tc 


/ mg \ ln(l+e-^- g )) 

\ 2p p jr 2 h 3 ) ePAE-v) + i 

assist the imagination, examples help. 


(17.15) 


EXAMPLE: Find j'(E) in units of eV-fs-nm-q for contemporary tungsten-like parameters of p = 11 eV and <t> = 
4.5 eV, and for a fixed temperature of T = 1570 K with E = Xfi. Assume p F and E c values for field emission conditions 
and use the Forbes-Deane approximation for v{y), but for simplicity let t(y) = t(y 0 ) = 1.06131. Consider only the cases 
F = 1.0 eV/nm, 3.5 eV/nm, and 4.5 eV/nm. 

SOLUTION: Evaluation of Eq. (17.12) reveals = 32.068/FandE c = 11 + 2.8267v(y) withy = 0.26666 y/p. Thus, 
for 1 eV/nm, 

/(xp) = 0.0456In (l + (? 243(X ~ L24) ) / (l + e 81 - 3 * r " 1) ) 

for 3.5 eV/nm, 

= 0.155ln (1 + e 71 ' 5 ^ 118 )) / (l + e sl ^) 

and for 4.5 eV/nm, 

) = 0.1981n(l +e 561(x - 116) ) /(l -He 81 - 3 ^) 


A plot of the three equations in the above example (plus one more for good measure) gives a feel of how the total 
thermal-field energy distribution evolves as a function of field in Figure 17.5. Note in particular that when F = 3.07 
eV/nm then p F « p T . 

The difference between the total energy distribution and the normal energy distribution is then manifest in the dif¬ 
ferences in the integrands before the final integration (over E in the total energy distribution and over E x in the normal 
energy distribution). The integrands are 


fnorJE x ) = ( 


_qm_\ 
2n 2 ti i ) 


In (1 + 
p T ( 1 + 


(17.16) 



Figure 17.5 Evolution of the total energy distribution j' m (P) for various fields using contemporary tungsten-like parameters /t = 11 eV and 4> = 4.5 eV at 
a temperature of T = 1570 K in the linearized Kemble approximation. Note the similarity to Figure 17.3. The line labeled "3.07 eV/nm" corresponds to 
fir = h- 
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Figure 17.6 The energy distribution integrands for the normal (Eq. (17.16)) and total (Eq. (17.17)) distributions for the values of Gadzuk and Plummer 
of n = 8 eV, 4> = 4.8 eV, and room temperature (300 K) conditions. 



(17.17) 


where the first equation is for the normal energy distribution and the second is for the total energy distribution: observe 
the subtle symmetries between the two. A comparison of their behavior as a function of their argument is shown in 
Figure 17.6 for room temperature conditions: as the temperature drops, the total distribution becomes higher and sharper 
near E « f compared to the normal distribution (which peaks at a lower energy), giving a small advantage to the more 
gentle normal distribution for numerical evaluations. Having shown what we have set out to, we can return to consid¬ 
erations of the normal energy distribution approach that forms the basis of the earlier sections. 

Understanding or visualizing the migratory behavior of the normal distribution integrand D(E x )f(E x ) is necessary before 
formulating a general thermal-field equation. An algorithm to do so is developed in Section A3.8. Although the temper¬ 
ature is that used by Gadzuk and Plumer, the analysis will relax back to the preferred copper-like values of f = 7 eV and 
® = 4.5 eV. A waterfall plot for 6 eV < E < 10 eV for values of F equispaced between 2.4 eV/nm and 3.6 eV/nm in steps 
of 0.2 eV/nm is created by the algorithm and shown in Figure 17.7. The transition region between thermal and field 
emission is evidently represented by the broad line near F = 3 eV/nm with a width almost spanning the energy range 
F < E < f + cj). 

With the Section A3.8 algorithm and a free afternoon, different circumstances can be investigated. At room tempera¬ 
ture, the transition region drops to a much lower field and the curves sharpen, a characteristic of "cold" field emission. 
As the temperature increases, the transition region rises to higher values of F and the distribution becomes less abrupt, 
developing softer declines away from the peak (visually, the yellow-green bands narrow). But it is the middle condition 



Figure 17.7 The migration of the normal energy distribution of thermal emission to field 


emission for parameters mimicking Figure 17.3: a metal-like emitter with p = 7 eV and <I> = 4.5 
eV is held at 1570 K. A line for each D(E x )f(E x ) is shown for 6 eV < B x < 10 eV with values of F 
equispaced between 2.4 eV/nm and 3.6 eV/nm in steps of 0.2 eV/nm, with the lower field values 
closer and the higher values to the rear. It is seen the behavior of Figure 17.4 is captured. 
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6 8 10 


Energy [eV] 

Figure 17.8 Same as Figure 17.7 but densely spaced and viewed top-down. The color calibration bar to the right is as used for the previous plot. 
Migration from thermal to field conditions is demarcated by the bar of red near Fa 3 eV/nm: the peak initially centered near E a /j + ip (right-hand 
side for F a 2.4 eV/nm) broadens and then coalesces again near E a n (left-hand side for F a 3.6 eV) as the field increases. 


when p p « p T that corresponds to the red bar in the energy range p < E < p + 4> (where p = 7 eV and 4> « 2.4 eV) and 
Fa 3 eV/nm in Figure 17.8. 

The transition region is nicely parameterized by comparing the beta factors governing how the integrand in D(E x )f(E x ) 
falls off: the energy factor P T captures the fall-off of f(E x ), while P F captures the fall off (as E x decreases) of D(E X ). They 
can be compared. Consider the ratio 


(17.18) 


Taking p F as from Eq. (17.12) (i.e., n a TiF/2k B T\/2mm(y)), and letting t(y) a t(y 0 ) = 1.06131, then n a 0.320412F if F is 
measured in eV/nm. In that case, the red bar in Figure 17.8 denoting the transition between thermal and field emission 
conditions, seen to occur near F = 3 eV/nm, is characterized by the condition n a 1. That is significant: when the energy 
slope factors are comparable, the transition from one regime to the other is initiated. 

The Pp up to now is as given by Eq. (17.12), corresponding to a Taylor expansion of 0(E X ) to first order about E x = p, 
that is, the linearized 0(E X ) is the tangent line at E x = \i. For field emission, this is close to where the maximum of j' norm (E x ) 
occurs. However, as n -*■ 1, the maximum occurs between p and p + 4>, and failure to account for that reduces the accuracy 
of the approximation to J(F, T). When emission becomes thermal-like, p F should be found for 0(E X ) linearized near the 
barrier maximum \i + 4>- Near there the image charge barrier V(x) is approximately parabolic, and 8(E) can be evaluated 
in a procedure similar to that leading to Eq. (11.20). For a potential of the form V q (x) = V 0 - a(x - x 0 ) 2 such that x 0 is the 
maximum of the image charge potential in Eq. (11.25) and a is such that d 2 V q (x 0 ) = d x V imaae (x 0 ), and so (recalling that 
4> = O- i/4 QF) 

0 qu JE) = + 4> — E) (17.19) 

from which the p F and E c factors are easily seen to be 

< 17 - 20) 

E c an+4> (17.21) 


for thermal emission conditions. This has consequences for the values n and s (and even u) take on in the general emission 
equation. Representative values will be considered in Section 17.3 below. 

Before taking leave of the energy slope factors, return to the passing observation above that the image charge barrier 
looks parabolic at high field: it is more than that. When the energy E is near the barrier maximum n + 4> for the image 
charge potential, the barrier apex resembles the quadratic barrier generally, regardless of field (a point made in Section 
13.2 when the functions v(y) and t(y) were introduced, and which anticipated the present treatment). But this implies 
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that p p for the image charge barrier and the quadratic barrier should be related in the limit that E x -»■ p + cp. This can be 
shown analytically: in fact, the ratio Eqs (17.12) and (17.20) is 


Pp(image) _ n /QF\ 1/4 
p P (quad ) 2f(y) VO 2 / 


(17.22) 


In the limit that E 
results in 


p + (p, then y in t(y) approaches unity, or 



a/4QF^ ®. 
1.1107 


Substituting in the limiting field F = 0 2 /4Q 
(17.23) 


which is known to be true as the limiting case [196] (and even well approximated by the Forbes-Deane formula of 
Eq. (13.24), which gives 10/9 = 1.1111), as the adventurous in spirit can show using Eq. (13.19). It can also be shown 
numerically, as in Section 17.4 below. 


17.3 The N(n,s,u) function 


There are three species of government: republican, monarchical, and despotic.. .we must now inquire into those laws which directly 
conform to this nature, and consequently are the fundamental institutions. 

- Montesquieu 8 

That the thermal, field, and photoemission canonical equations of Chapter 11 were all obtainable from a master 
equation suggests that the three kinds of emission represent three domains of an underlying function which also merit 
inquiry. The master equation is 

q r°° hk y 

J(F,T,eo)= — — ~D(E y + EJf(E r ))dk r (17.24) 

2n Jo m 

= A-/ D ( E X + EJf(E x )dE x (17.25) 

2 nh Jo 

where the domains are specified by the forms that D{E X ) and f(E x ) take in either high or low T and F limits, and whether 
or not there was a photon term present (E a = 0 when there is not). Using the Kemble form of the transmission probability 
D(E X ) (Eq. (17.9)) and the supply function f(E x ) (Eq. (11.5)), Eq. (17.24) can be more compactly written as [85, 202] 


J(F, T, oo) = A rjd T 2 N P F (E 0 - p), p P E 0 ] 


(17.26) 


where E g is the generalization of E c in Eq. (17.13) to accommodate photons, more about which in Section 17.3.2. With 
the introduction of the important quantities 


n = p T /p F (17.27) 

s = p P (E 0 -p ) (17.28) 

u = p p E 0 (17.29) 

where E 0 = E 0 (E m ) is understood to be evaluated at the integrand maximum E m , then the N function N{n, s, u) is an integral 
of the form 

/.eo 1 + e z 

where the integration variable z arises from p p (E - p), the denominator in the integrand arises from the transmission 
probability, and the numerator arises from the supply function. The three domains are specified by whether s is positive 
or negative and, if positive, whether n is greater than or less than 1. 


8 Charles de Secondat, Baron de Montesquieu, The Spirit of Laws (trans. Thomas Nugent, rev. J. V. Prichard), Kindle eBook, 2011, Book II, Section 
1. Montesquieu went on to advocate three branches of republican government (executive, legislative, and judicial) in Book XI.6. Many things 
of profound interest come in threes. 
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There have been heroic efforts to carefully approximate similar integrals, for example by Guth and Mullin [190] (who 
also study the Fowler-DuBridge function of Eq. (14.9)). Any time spent crafting analytical approximations to Eq. (17.30) 
reveals it to be rather ghastly: its integration range is such that the usual recourse to Taylor expansions of ln(l +x) and 
1/(1 + x) (Section Al.2.1) abuts the divergent series, causing the crimes against intuition treated in Section 6.3: the case 
n = 1 is frustrating as it is there that the divergent series reveal their "devilish" nature, but that is the very point of 
greatest interest in investigating the transition, and coincidentally the region that brought ruin to the Millikan-Lauritsen 
hypothesis of Eq. (13.31). One must proceed cautiously. The first step is to appreciate the variation of parameters involved 
as described by Eqs (17.12) and (17.20). 

Each of the parameters n, s, and u depend on p p , and as there are different limiting cases of p p , it is evident that some¬ 
thing unusual is going on between those limiting cases. Consider the parameter n from Eq. (17.18) again as it approaches 
unity starting from a high-field, low-temperature condition compared to starting from a low-field, high-temperature 
condition: the ratio of the field-like n (Eq. (17.12)) to the thermal-like n (Eq. (17.20)) is 


tz(fLeld) = fVjjj 
K(thermal) 4 t(y) 

As with Eq. (17.23), as y -► 1, the two become the same, 9 as evident in the pinching of the contour lines in Figure 17.9. 
That a difference exists between the n limits is a wrinkle that will need revisiting below, but for the development of 
a general thermal-field equation, what it means here is that before approximations can be invoked, an understanding 
of the values n, s, and u may take on is needed. The limiting cases are field and thermal emission conditions, and so 
representative conditions are taken for each: 

• For field conditions: If the forms of fi p (Eq. (17.12)), E 0 (Eq. (17.13)), and B and C are from Eqs (13.7) and (13.6)), 
respectively, then 


Ck B Tt(y ) 2\J 2m<&k B Tt(y) 


B . . Ay/lm<& , , 

s =p v(y)= ^n^ v(y) 

u = |c(y) + ^pt(y) 


(17.33) 

(17.34) 


where as usual y = \/4QF/O in the Nordheim functions v(y) and t(y). 



Field [eV/nm] 

Figure 17.9 Boundaries defined by n = 1 using field-like (F-like; Eq. (17.12); bluish region) and thermal-like (T-like; Eq. (17.20); reddish region) 
energy slope factors. The intermediate regime (trans; white region between two n = 1 lines) corresponds to where n = 1. The two black dots correspond 
to typical operating temperatures for thermal (top left: 1273 K, 1 eV//<m) and field (middle right: 300 K, 5.1734 eV/nm) conditions treated in the two 
examples below. 


3 To prove this requires Eq. (13.22), as Eq. (13.24) is only approximately correct at y = 1. 
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• For thermal conditions: From the forms of p F (Eq. (17.20)), E 0 (Eq. (17.21)), and the terms B and C, then 

n = h ( 4f 3 ^ 1/4 
n ~ 2nk B T \ Qm 2 ) 

s = P F (® - y/4QF) = P F <P 

u = p P (p + <p) 


(17.35) 

(17.36) 

(17.37) 


Under these approximations, the typical values are found in the subsequent examples using the following limiting case 
formulae. 


EXAMPLE: Field-like conditions: 

Consider copper-like parameters (g = 7 eV, ® = 4.5 eV) at room temperature (T = 300 K) such that y =y 0 = e~ l/2 . 
This chooses a value of field ( F = <t> 2 /4 eQ = 5.1734 eV/nm) typical of field emission yet simplifies v(y) and t(y) when 
using Eqs (13.23) and (13.24). 

SOLUTION: For the chosen value of F, C = 4.2014 eV J , B = 65.207 eV/nm, t(y) = 1 + (l/6e) = 1.0613, v(y)= 1 - 
(7/6e) = 0.57081 and therefore p F = Ct(y ) = 4.4590 eV -1 . For room temperature, p T = 38.682 eV -1 . Substituting, it 
is found n = 8.6749, s = 7.1946, ns = 62.413, and u = 38.408. 


EXAMPLE: Thermal-like conditions: 

Consider parameters typical of a barium dispenser cathode (g = 1 eV, ® = 2.0 eV, T = 1273 K) for a large field (by 
thermionic cathode standards) of F = 1 eV/pm, corresponding to an electric field of 1 MV/m. 

SOLUTION: For the given temperature and field, p T = 9.1159 eV~\ p F = 2217.0 eV' 1 , cp = ® = 1-9621 eV, 

and therefore n = 3.7388 x lO" 3 , s = 4349.8, ns = p T cP = 17.886, and u = 19869. 


What is seen from these examples is that thermal conditions are characterized by very small n and field conditions 
by large n, but also that s is large and ns very large for field conditions, whereas ns is large and s very large for thermal 
conditions. These observations will decide which terms to keep in an expansion of the integrand terms in N(n, s, u) when 
discussing thermal and field emission. 

What about photoemission? In the treatment leading to the one-dimensional Fowler-DuBridge theory of Chapter 14, 
the approach was to augment the energy E x in the transmission probability by E w = hw, or D(E X ) -*■ D(E X + E m ). Careful 
attention to how s is defined reveals the presence of photons causes fi -*■ \i + E m , or s(photo) -*■ P F (E g — n — E m )); coupled 
with E 0 < cp and E m > <p, photoemission entails that s is negative. Therefore, consider the positive and negative regimes of 
s separately. 


17.3.1 The s > 0 regime 

The scientific approach to the examination of phenomena is a defense against the pure emotion of fear. Keep tight hold and continue while 
there's time. 

- Tom Stoppard 10 

To tame difficult integrals, it is often useful to separate them into manageable parts by either subtracting out a dominant 
(and analytic) contribution or partitioning the integration region into sections where Taylor expansions can and do lead 
to convergent series. Both techniques are useful here [202]. Below, this will amount to 

N = N x +N 2 + N 3 +N 4 (17.38) 


10 Tom Stoppard, Rosencrantz & Guildenstern are Dead, New York: Grove Press, 1967, Act 1, p. 17. 
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where the Nj shall be defined below, and which rely on tricks such as ln(l + e*) = x + ln(l + e~ x ). Given how s and ns 
behave, matters shall be set so that N x and N 2 dominate when field emission is ascendant, and JV 3 and JV 4 when thermal 
emission is ascendant, therefore refer to the integrals that will arise as either "field-dominated" or "thermal-dominated". 
The field-dominated regions matter most when n > 1 and are characterized by z > s. They are 

NAn,s,u) = n f (17.39) 

Js eZ + 1 
r u In [1 + e *>1 

NJn,s,u) = n / —-- -dz (17.40) 

Js e z +1 


Similarly, the thermal-dominated integrals matter most when n < 1 and ai 

In [ 1 + e n( - z ~ s '>\ 

NAn,s,u) = n / —-- 

3 7-cc * +1 

f u In [l + 

N 4 in,s,u) = nJ o l gZ+1 


re characterized by z < s. They are 
-dz 

dz 


(17.41) 

(17.42) 


The choice of the integrand and the integration region is to a definite purpose each time. In the case of N lr the integral is 
analytic. In the case of the others, use can be made of familiar series expansions (Section Al.2.1) of the terms behaving 
as ln(l + x) or 1/(1 + x), where x stands in for an exponential term (e.g., & and e ±n<z ~ s) ) (an analogous approach formed 
the basis of Guth and Mullin's treatment [234]). The integration regions then allow for a term-by-term integration of the 
series expansions over regions where the series are convergent. 

An important approximation concerns u: when u » 1 then the upper limit of the integral can be extended to oo, 
thereby reducing the number of terms that survive in the series expansions that arise and making for much easier 
formulae. Practically, it means that N{n, s, u) » N(n, s, oo) = N(n, s) and likewise with the N jr and that approximation shall 
be assumed henceforth. 

Two integrals enable finding N^s), and they are 


! JtT = ln (! + e_S ) < 17 - 43 ) 

Js e + 1 

/ = u x ln ( 1 + e ~ s ) ( 1 7 - 44 ) 

where U(s) is the Fowler-DuBridge function from Eq. (14.9). To show it, make the substitution z = — ln(x) in the first 
integral, and use integration by parts in the second. It is then found that 


JVjfK.S) 


n 2 U(-s) 


(17.45) 


a compact result, but given that the other three integrals will be given in terms of a series representation, it is more useful 
to express N 1 as a series itself. The next example concerns finding that series representation. 


EXAMPLE: Find the series expansion of N^n^). 

SOLUTION: From Eq. (A1.5) followed by an integration of each tern 

l in the series. 


N l (n,s) = n 2 [ dxT(-iy +] 

y* 

(17.46) 

J~ °° i=i 

j 


;=1 J 


(17.47) 

Observe that N x {n, 0) = n 2 f(2)/2. The result is reminiscent of the polylogarithm function discussed ii 

i conjunction 

withEq. (14.13). 
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The other Nj entail double summations, one for each expansion using Eqs (A1.3) and (A1.5) for 1/(1 + x) and ln( l + x), 
respectively, after which the integration over the exponentials may be performed using 

£ e- ax dx=^e~ as (17.48) 

In an exercise requiring attention to bookkeeping, a straightforward integration and simplification for each of the Nj 
result in 


N 2 (n,s) 

N 3 (n,s) 

N 4 (n,s) 


”11 


(-1 f+ie- 1 * 
j(jn + k) 

(-1 ) k+ ie-> m 


sU w«+*) 

w yy { ~ 1)k+i (e-Xt-e- 
H H Kin - kr 


(17.49) 

(17.50) 

(17.51) 


where the limits of the k summation for N 2 differ from the summations for the other Nj\ this is a consequence of having 
to use 1 /(e? + 1) = e~ z /(\ + e~ z ) when z > 0 in N 2 and 1V 4 , but not iV 3 . It is more useful to separate out the k = 0 part of 
the summation so that when one combines N l through N 4 it is found that 

N(n,s) = y -—^— (n 2 e~ is + e~ ]m ) 

+ 2 ” 2 11 ^ (17 - 52) 


Before continuing, notice a feature of importance: the denominator ( n 2 j 2 - k 2 ) vanishes whenever n = k/j, which appears 
to reveal a singular (infinite) term in the summation. Note, though, that at the same value of n, then by L'Hopital's rule 
governing the ratio of two differentiable functions that vanish at the same point, when n j/k then 
e -ks _ e -jm 5(2 + s(k +jn ) _s__s^_ 

n 2 } 2 - k 2 2{k+jn) 2k 2 { ’ ' 

There are three important points to be made: 

1. In the quest for a computationally useful approximation to N(n,s), the appearance of infinities for particular values of 
n signals that they are an artifact of the approximation, not of N(n, s ) itself. 

2. In the thermal regime ns is large but s is larger so that e~ s «: e~ ns , whereas in the field regime, s is large but ns is larger 
so that e~ s » e~ ns . Thus, away from n « 1 either the thermal term dominates or the field does, but one of them is 
negligible as n moves away from unity. 

3. N(n,s) is well-behaved as n -*■ 0 and n -> oo, as it must be, with the leading order surviving terms giving rise to the 
RLD and FN equations. 

The last two points figure strongly in the treatment of an approximation to the function E(x) in Eq. (17.65). 

Eq. (17.52) contains a great many terms, but they are of two types: N x and N 2 involve e~ h whereas N 3 and N 4 involve 
e -i n \ Remember now that s and ns are both large, but for field conditions ns is very large and for thermal conditions s is 
very large. More importantly, only the k = 1 term need be kept in the field-dominant series, and only the j = 1 term in 
the thermal-dominant series. Knowing that, and keeping in mind that / and k are dummy labels, so that when they are 
swapped, a - sign is introduced because of the denominator, it is found that 


H (; 


HS2 1 


pt (j/n) 2 ~ 1 


(17.54) 


In fact, this can be made even more compact: if the function E(x) is introduced (where x is used to indicate a general 
argument, n having been reserved for P T /P p in particular) such that 


2(x)=l+2y 


(~iy +1 

(/ A ) 2 -1 


(17.55) 
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then the elegant result 


N(n, s) = e s n 2 T. 


+ e " s £(«) 


(17.56) 


is obtained, with the first term being the /zeW-dominated term (which will lead to J p ), and the second the 
thermal -dominated term (which will lead to J T ). Recall from Eq. (17.26) that current density is the product of N 
with A^T 2 . Therefore, the "field" and "thermal" J's are defined by 


j f ~ Arj.d^bPt) 2n2 ^\n) e S 
j t = ^ Sifl (^/? r )- 2 E(«)e-' K 


(17.57) 

(17.58) 


The symmetry between the two forms is far better appreciated, however, by putting in the definition of n and rewriting: 

J F = JWMr)“ 2 S (^) exp[-m - m (17.59) 

J P = A^OcbPt)- 2 2 exp [~p T (E 0 - g)] (17.60) 


Now the pleasing symmetry between T and F that Millikan intuited must be there is readily apparent. The compact form 
of the general thermal-field equation for current density, or J gtf (F, T), is given by 


J gtf (F,T) = 


{ 


n~ 2 J p + J T 
J F + n 2 J T 


(«< 1) 
(«> 1) 


(17.61) 


with the helpful mnemonic that n 2 attaches itself to whichever J is smaller in such a way as to increase its contribution. 
Importantly, in the large n limit (T -* 0), J p -*■ J FN , and in the small n limit (F -> 0), J T -*■ J RJD . Eq. (17.61) in its entirety is 
the means by which thermal terms are introduced into field emission, and field into thermal emission, situations which 
matter when n « 1. 

A certain aesthetic attractiveness attends Eqs (17.59) and (17.60) until they are put into practice. The problem is 
because E(w) and E(1 /n) are calculated from an infinite series given by Eq. (17.55), there are values of n for which the 
denominator can vanish, whereas as seen before in the full definition of N{n, s) there are no such singularities. This is an 
aforesaid artifact of the approximation. The solution, therefore, is to find a low-order polynomial approximating E(x » 1), 
as for commonly encountered parameters, either e~ s or e~ ns quickly overpowers Z( 1 / n) or S(«), respectively. The first term 
in E(x) is singular atx = 1, the very region where all four Nj are retained. No "low-order polynomial" approximation exists 
in the vicinity of that singularity. For that case, the summations are regrouped until the problemmatic x = 1 singular part 
is isolated, and that part is separated from the polynomial part. Using the relation 


1 yx 2k 

(j/x) 2 -1 h J 2k 

and the definition of the Riemann f(j) function series of Eq. (A1.8) it is found that 
£(x) = 1 + |(1 - 2 1 - 2 0a2j)x 2 ' 


(17.62) 


(17.63) 


which hardly seems an improvement as f(2j) -*■ 1 as j —> oo: the replacement of a series with a singularity in it with a 
divergent series atx = 1 (bringing to mind Section 6.3) is not obviously wise: as Virgil said, aegrescit medendo 11 . The reason 


u Latin for "the remedy is worse than the disease", the interpretation given to "grown worse with healing" (trans. James Rhoades), The Poems 
of Virgil, Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 13, Virgil. Chicago: Encyclopaedia Britannica, 1952, p. 355, Aeneid, 
Book XU, line 48. The Romans of Virgil's time distrusted their physicians. 
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to do so is because the offending part may be isolated by 


E(x) = 1 + 2 X * 2j + 2 X ft 1 — 2l ~ 2i ) CGy) - 1} ^ 

= + 2 £ {(: 1 - 2 1 -«) «2» - 1} x* (17.64) 

» + (f(2) " V* 2 + " 8 >* 4 (17.65) 

where now the singular term is isolated and the other terms are small by comparison when x < 1 (or even x a little larger 
than 1, as will be needed). Thus, at n = 1, the singularity of E(«) cancels that of £(] /«) via 

lim{E(«) + 2:(i)} = (4) + 2f(2) - 8 = -0.9220 (17.66) 

It is, however, the behavior of Eq. (17.56) as n -*■ 1 that matters. In the series summation definition of E(») in Eq. (17.55), 
there are two kinds of terms: those that are problematic as n -* 1 (the first term of the summation) and those that are 
not (the rest of the summation), and so for n « 1, 

s(w» 1)= 1 + T^ + 2 ijrzr (17 - 67) 


where in the second summation, n is set to 1 as doing so does not encounter a zero in the denominator. The equation 
continues to hold for n-* 1 jn. The summation term is easily evaluated 


2 y (~iy +I _ y f (~iy +1 (-iy +1 ) 

u i 2 - 1 \ i +1 j - 1 J 


y (-iy +2 . 1 

h j 2 


(17.68) 


and so with the approximation e m « e *(!-(«- l)s). 


_ s (n+ 1 )(n 2 + 1) + s(3n 2 + 1) 
^ 2{n + 1) 


(s+ l)e~ s = lV(l,s) 


(17.69) 


The behavior of N(n, s) for n « 1 is the most important for J GTP , but how the other limits of n -* 0 and n -* oo fare, as 
in the discussion following Eq. (17.52), is important to reassess. How the behavior of J GTF is obtained in the n< cl and 
n » 1 limit given that Z(x) from Eq. (17.64) is a converging series only for x < 1 is the point: J GTF asks for arguments in 
E(x) for x > 1 (i.e., if n < 1 then l/n > 1 and same with n > 1 entailing i/n < 1). Any untoward behavior as a function 
of n that occurs is a consequence of the approximation to £(x). The resolution is understood by being specific about how 
n approaches the limits of 0 and oo when using Eq. (17.65): 

• In the thermal regime, n-* 0 behavior is obtained by driving F -> 0. The polynomial part of the E(1 /ri) approximation 
then diverges as 1/F 4 but it is attached to an exponential part that declines as exp {-a/F p ) where p is on the order of 
unity and a sweeps up all other factors. 

• In the field regime, n -*■ oo behavior is obtained by driving T -*■ 0. The polynomial part of the E(«) approximation then 
diverges as 1 /T 4 but it is attached to an exponential part that declines as exp(-a/T) where a sweeps up all other factors. 

Consequently, the nature of the polynomial part of the approximation to E(x) to become large (as x 4 ) when its range is 
extended to x > 1 is overshadowed by the exponential terms to which it is attached, and so the numerical values given 
by the E(x) approximation continue to work well into the traditional regimes of the canonical equations. The canonical 
equations are thereby recovered with acceptable numerical accuracy. This is additionally demonstrated in Figure 17.10, 
where the field part is always less than the thermal part in the thermal regime, whereas the thermal part is always 
less than the field part in the field regime, by several orders of magnitude. The quantities are evaluated in terms of an 
argument x such that x = F/ 0.05 0 corresponds to n -* 0 and x = T/50 —> 0 corresponds to n -* oo, using parameters 

typical of dispenser cathodes (T = 1200 K and ® = 2.0 eV) or field emitters (F = 8 eV/nm and <J> = 4.5 eV). To reiterate, 
the thermal and field parts of N(n, s) do not become comparable until within the n = 1 regime. 
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Figure 17.10 Behavior of the field-dominated and thermal-dominated parts of N(n,s) (first and second terms of Bq. (17.56), respectively) as a function 
of T = 5Ox K (field) and F = 0.05x eV/nm (thermal), where n(F, T) and s(f) are given by Eqs (17.32) and (17.33) in the field regime, and Eqs (17.35) 
and (17.36) in the thermal regime. 

Speaking of n and s as parameters themselves is, however, misguided. Although p p has been treated as though it can only 
take on one of two forms (from 9(E « p) for field emission leading to Eq. (17.12)), or 6(E « p + <p) for thermal emission 
leading to Eq. (17.20)), in reality, p p , as well as n and s, change with F and T in smooth and continuous ways. N(n,s ) is 
good to approximate J GTP and understand its behavior, but more computationally friendly forms await in Section 17.5, 
and a proper numerical evaluation in Section 17.6. 

17.3.2 The s < 0 regime 

Whatever may he the plays on words and the acrobatics of logic, to understand is, above all, to unify. 


- Albert Camus 12 


Now comes taming the difficult integrals associated with N(n, -s). Proceed in a manner much like that of Section 17.3.1 
for +s. The notation bears emphasis because s is treated as a positive quantity, the situations when a negative argument 
arises are explicitly indicated by —s so that the meanings of the integration limits will always be transparent. As before, 
the integral of N(n, —s) shall be partitioned into a sum of other integrals such that convergent expansions of ln(l + e” (z+s) ) 
(related below to summations over;) and 1/(1 + e z ) (related below to summations over k) are obtained in Eq. (17.30) 
for s —f —s, except that now the limit of u -* oo is assumed from the start. Although the notation shall be similar albeit 
with the introduction of a prime where N'(n,s) = N(n, —s), the terms Nj will not share the same integration region as N, 
for index /. Expansions and careful regrouping of terms, the painstaking details of which are left as an exercise, give the 
following 



(17.70) 



(17.71) 


Make note of whether the summations begin at 1 or 0, reflecting whether the exponential term is smaller or larger than 1. 
Continuing, 



(17.72) 


= n(U(s) ~ 17(0) — sln(2)) 



(17.73) 


2 Albert Camus (trans. J. O'Brien), "An Absurd Reasoning," The Myth of Sisyphus and Other Essays. New York: Vintage Books, 1991, p. 13. 
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and lastly, 



(17.74) 


= n(Z7(0) + sln(2)) 



(17.75) 


As before, let both e s and e m be taken as small, so that the leading order terms need be retained. Regrouping, 



(17.76) 


which should look unsettlingly familiar: the terms involving ~L(n ±l ) are, in fact, N(n,s), and so (using the leading order 
expansion for U(s) » 217(0) - e~ s + (s 2 / 2) in Eqs (14.9) and (14.14)) it is found [202] (where N'(n,s) -» N(n, -s)) that 


N(n, s ) + N(n, -s) = ± n 2 s 2 + f (2 )(n 2 + 1) 


(17.77) 


which is an equation with a certain aesthetic allure. In the case where N(n,s) is small, as occurs when the thermal-field 
terms are small, then the photoemission term N(n, -s ) is, to leading order, the Fowler-DuBridge relation of Eq. (14.14) 
under the conditions that n is very small but ns is large. Because photo excited electrons pass over the barrier p + 4> 
having been excited from near the Fermi level p, and because the energy E x is augmented by the photon energy fico 
in D(E X + E a> ), then (recalling the convention about —s) it follows that —s = p F (p + tp - (p + hco)) or s = p F (hco - <p) for 
photoemission, and so ns = P T {hoo - tp). In both cases of the next example, as with thermionic emission, ns is large, but s 
is very large. Therefore, to leading order, the Fowler-DuBridge result is recovered. 


EXAMPLE: Numerically evaluate n, s, and ns for parameters characteristic of photoemission. Use copper-like values 
(p = 7 eV, O - 4.5 eV) at room temperature (T = 300 K) for laboratory fields (2 kV/cm -*■ 0.0002 eV/nm) and for 
fields comparable to an rf photo injector (25 MV/m -* 0.025 eV/nm). Assume X = 266 nm (hco = 4.661 eV). 


SOLUTION: 


For laboratory conditions, 0 = O - y r 4QF = 4.483 eV. FromEq. (17.20), p F = 7413 eV" 1 . P T = l/k B T = 38.68 eV' 1 . 
Therefore, n = 0.005218, $ = 1320, and ns = 6.886. 

For rf photo injector conditions, cp = 4.31 eV, p F = 198.3 eV -1 , p T is unchanged, n = 0.1951, s = 69.56 eV -1 , and 
ns= 13.57. 


17.4 Brute force evaluation 

W-e-e-elll, a satirical piece in The Times is one thing, hut bricks and baseball bats really get right to the point... 

- Woody Allen 13 

The behavior of p F when it is evaluated near the barrier maximum compared to how it behaves near the Ferm level 
(the difference between Eqs (17.12) and (17.20)) must now be dealt with explicitly. This is necessary because numerically 
evaluating the integral behind the current density J(F, T) requires knowing E m , that is, knowing where the integrand is at 
a maximum and evaluating the energy slope factor p F there. Finding the location of the maximum E m and therefore P F (.E m ) 
is analytically difficult in the transition region, and so the behavior will be investigated using the bricks and baseball bats 
approach of numerical evaluation. Subtlety can wait. 

The numerical evaluation of the Gamow factor 0{E X ) is the first step. Recall that the canonical equations assumed 
6{E X ) was linear in energy, but strictly, it is not. Even the triangular barrier Gamow factor of Eq. (13.5) shows a more 
complicated non-linear behavior, and that trend persists in the variation of p F between the image charge potential and 
the quadratic barrier when E approaches the barrier maximum. Still, the variation of 0(E X ) is smooth, and that suggests a 
polynomial is a good approximation if one can be found. 


13 "Manhattan',' Pour Films of Woody Allen. New York: Random House, 1982, p. 203. 


www.ebook3000.com 






190 Chapter 17 


The numerical evaluation of 9{E X ) will make use of a generalization of 9{p) (Eq. (13.13)) and R{w) (Eq. (13.14)) for 
arbitrary energy E instead of just the case E = p, given that the integrand of R{w) has desirable features that make 
numerical integration techniques in Section A1.2 profitable (the slope of the integrand at the boundaries vanishes so 
that Eq. (A1.13) renders the trapezoidal series approximation better than expected). Therefore, 

xCB)= _2Q_ 

* FL{E X ) + [4QF + (FL(E X )) 2 ] 1 / 2 

UE X ) = i[(/r + ® - E x ) 2 - 4 QF] l/2 

A numerical integration is economized by the introduction of z = (p + ® - E x )/0 and the usual y = y/4QF/<t>, in terms of 
which 


(17.78) 

(17.79) 

(17.80) 


x_(E x ) = 



L(F X ) = 


(17.81) 

(17.82) 


Consider three cases for which R{w(E x )) is obtained via numerical integration to show the behavior of 9(E X ) evaluated 
using the numerical algorithm of Section A3.9: they are shown below for copper-like parameters at various fields of 1 
eV/nm (Figure 17.11), 4 eV/nm (Figure 17.12), and 8 eV/nm (Figure 17.13). 

Four features are evident: first, 9(E X ) is mostly linear; second, the linear approximation becomes better for high field, but 
is increasingly distressed for lower fields characteristic of thermal-field emission; third, the slope of 9{E X ) at p and p + <p 
is well-matched by the Fowler-Nordheim (Eq. (17.12)) and quadratic (Eq. (17.20)) p p equations, respectively; and lastly 
the mild convexity of 9(E X ) indicates that there is a single E x for which n = 1 or where the integrand is at a maximum, 
in contrast to the behavior indicated in Figure 17.9, in other words finding E m is a moving target. 

A purely numerical treatment for finding E m would be faced with many evaluations of 9(E X ), each requiring an inte¬ 
gration. For that matter, finding J(F, T ) would likewise be faced with two integrations in the calculation of J(F, T): one 
for the evaluation of 9{E X ) and one for the integration over E x . To transition from current density J to total current 1 (as 
in Section 30.2) requires yet another integration over surface. As is generally true, numerical methods are very good 
for the evaluation of one-dimensional integrals, but become increasingly less accurate as more dimensions are added. In 
fact, multidimensional integrals are better handled using Monte Carlo integration methods (Section Al.2.5) when the 
dimensionality d becomes too large. Although "brute" is a synonym of forceful, it need not be one of stupid. The smooth 



Figure 17.11 0(E X ) evaluated numerically from Eq. (17.78) for copper-like parameters and a low field of 1 eV/nm. The blue line is the slope to 0(E X ) 
evaluated at E x = n (i.e., the Fowler-Nordheim approximation). The red line is the slope evaluated at E = n + 0 (he., the quadratic barrier 
approximation). 
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Figure 17.12 Same as Figure 17.11 but for an intermediate field of 4 eV/nm. 



E [eV] 


Figure 17.13 Same as Figure 17.11 but for a high field of 8 eV/nm. Observe that, as predicted by Eq. (17.22), /? F (/0 is closer to p r (ii + 4>). 


behavior of 0(E X ) and its known behavior at the boundaries of the transition region suggests a very good way to approxi¬ 
mate J(F, T) in the transition region via a numerical integration by replacing 6(E X ) with a polynomial expansion, and so a 
little cleverness can significantly ease the computational burden, if a reliable method to obtain the polynomial expansion 
can be found. It can. 

Recall from Section A1.2 that the trapezoidal integral approximation becomes quite accurate when the slopes of the 
integrand at the limits are included, leading to Eq. (A1.17). The reason? Inclusion of the slopes is tantamount to finding a 
cubic polynomial, which has higher order accuracy for smooth functions. Therefore, knowing 6{E X ) and 0'(E X ) = dd/dE x = 
- P F (E X ) at both E x = n and E x = p + </>, for a total of four conditions, allows for the specification of a cubic polynomial 
for 0(E X ), and given its reasonable behavior such an approximation is very good. A cubic in E x , however, lacks elegance. 
Instead, introduce the dimensionless p = (E x - p)/<j) as the values p = 0,1 for the limits E x = p and E x = p + (/>, respectively. 
In terms of p, the cubic polynomial is 

0(E x (p)) = £ C/ (17.83) 

i =o 

The boundary conditions can be put into a compact matrix notation enabling a rapid inversion and solution. The 
4x4 matrix multiplying the coefficients vector C corresponds to the values that p takes on with the (ij)th compo¬ 
nent corresponding to p> for i = 1 and i = 3 (the 9 rows), and to jpi~ l for i = 2 and i = 4 (the 6' rows). Observing that 
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<p(d/dE x ) = d p , the matrix equation is 

1 0 0 o' 

0 10 0 
1111 
0 12 3 , 

where on the right-hand side the third entry is 0 because 9(n + (}>) = 0 at the barrier maximum. Inversion of the matrix 
equation gives the C jr regrouping gives 

0(v + p4>) =(1-P) 2 (2p+I)0(p) 

-Ml-pm- - VPM + </>)} (17.85) 

Notice the general structure behind the polynomial: factors of p(\ -p) prevent those terms from appearing in 9{E X ) at 
the boundaries so that at p = 0,1, only the correct boundary value survives. Such factors routinely appear in Lagrangian 
interpolation coefficients [103] for a similar reason. Lastly, observe that in keeping with the comic premise of the lin¬ 
earized 9 factor and the associated linearized Kemble approximation (see discussion prior to Eq. (17.14)), the cubic 
form of Eq. (17.85) becomes negative for p > 1 as in Figure 17.14, which allows the Kemble form of the transmission 
probability to gracefully approach unity above the barrier maximum such that 9(E X ) and 9'(E X ) are continuous across 
E x = fi + 4>. 

Use of the cubic approximation to 9(E) given by Eq. (17.85) results in the sinuous current integrand shown in 
Figure 17.15: clearly, j(E x ) is more sharply peaked than suggested by the linear 9(E) used in Figure 17.9 due to the higher 


|Q)' 


9(n) 

Q 


-WM 

c 2 


0 

l c b 


-4>P p (m + <W 



E x [eV] 


Figure 17.14 A comparison of the exact (numerically evaluated) 0(B) compared to the polynomial approximation of Eq. (17.85) for the fields of F = 8 
eV/nm (red) and F = 4 eV/nm (blue). The approximate 9 has been extended into the negative regime, whereas the numerical 9 there would become 
imaginary. Dashed lines indicate the locations of the barrier maximum for each field for copper-like conditions (/i = 7 eV, 4> = 4.5 eV). 



Figure 17.15 The integrand j(E x ) for the temperatures 500 K (left/blue), 1000 K (middle/green) and 2000 K (right/red). The 32 equispaced lines are for 
increasing field in steps of 0.05 eV/nm (500 K), 0.1 eV/nm (1000 K), and 0.2 eV/nm (i.e., F ; . =jAB). In all cases, 6 eV < E x < 11.5 eV, it = 7 eV, and 
4> = 4.5 eV. 
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Figure 17.16 The location E m of the maximum of j(E) for the temperatures considered in Figure 17.15. 




Figure 17.17 The values of n(F, T) = P T /P p (E m ) along the lines shown in Figure 17.16. 


order terms that cause faster fall-off on each side of peak located at E x = E m . The locations of E m as a function of field 
are shown in Figure 17.16. But the cubic form of 6(E) does more: it allows for the calculation of Pp(E m ) = -0'(E m ) along 
the lines of Figure 17.16, and so the value of n(F, T ) may be evaluated along the spine that snakes from right to left as 
the field increases. The results of doing this are shown in Figure 17.17 and are perhaps surprising: in the transition region, 
the value of P P (E m ) is defined by the relation n= 1 to a good approximation! And with that observation, a very convenient 
analytical model becomes possible. 


17.5 A computationally kind model 

Kindness, n. A brief preface to ten volumes of exaction. 

- Ambrose Bierce 14 

Just as the linear form of 9(E X ) made analytic forms of J(F, T) possible using the Kemble form of the transmission 
probability D(E X ), the cubic 9(E X ) revealed a structure of n(F,T) in Eq. (17.17) that can be exploited. Such an ability 
to avoid evaluation by numerical integration altogether makes the formulation useful for when general thermal-field 


14 Ambrose Bierce, The Devil's Dictionary. New York: Dover Publications, 1993, p. 72. 
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evaluations are required, for example in computing the current from a hot needle-like structure (e.g., Zro/W Schottky 
emitters [154]) subject to high fields, as thermal emission dominates along the sides, field emission dominates at the 
apex, and in between the thermal and field emissions are comparable. Similar complications arise for field-enhanced 
photoemission [235], 


17.5.1 Intermediate n = 1 regime 


.. .make me happy in your unity. 


- William Shakespeare 15 


As suggested by Figures 17.11 thru 17.13, when parameters are such that n is either well below or well above 1, then 
n and s are obtainable from Eqs (17.35) plus (17.36) (thermal) or Eqs (17.32) plus (17.33) (field), respectively, and total 
current density J GTF from Eq. (17.26) (with p F E 0 -* oo), which is a rather gentle means of obtaining the thermal or field 
current. But when n = 1, that requires work before J CTF can use Eq. (17.69) because although n has become simple, s 
and ns (which become equal) have acquired complexity as a result of the dependence of s = p F (E m )(E 0 (E m ) - p) on E 0 . 
Knowledge of how E m changes in the transition regime is required, a point the conclusions of Section 17.4 sought to 
make clear. The first step is to bound the problem. 

Consider, therefore, the difference in behavior of n{F, T) exhibited by Eqs (17.32) and (17.35): the former behaves as 
n ex F/Tt(y ) and the latter as not F 3 / 4 / T. That is, each exhibits a different power that F is raised to, which is the cause of the 
convergence of the T-like and F-like lines in Figure 17.9. If v is that power, then it may be found by v = Fd P )n{n). For the 
thermal regime, v = 3/4, but for the field regime, evaluation of the power is more involved: approximating f(y) « 1.0613, 
sets v to 1, but the weak field dependence of t(y) changes that slightly. Using Eq. (13.24) gives 


(17.86) 


The variation of y between 0 (for F = 0) and 1 (for F = 4Q/4> 2 ), results in a variation of v between 1 and 19/20 = 0.95, 
with the latter value being more common when the fields are high. For copper-like parameters and a field of 8 eV/nm, 
v = 0.95427. 

As seen from Figure 17.17, the intersection of the power law with the line n = 1 is different for each regime. In the 
thermal regime, ln(«) as a function of ln(F) intersects the n = 1 boundary when T max = 1 / k B p F (p + <p) (from Eq. (17.20)), 
whereas in the field regime, it intersects at T min = l/k F p F (p) (from Eq. (17.12)). Explicitly, 


= HF ( V2 ' 
k R \fmS> \ 4f( » 




(*) 


(17.87) 

(17.88) 


Both T min and T max are proportional to the product of hF/k n \Jm<X> with a factor containing y, and that by virtue of the 
field dependence of y = \/4QF/<S>, the power v is off-set from unity. 


EXAMPLE: Find T max and T min for Cu parameters and F = 4 eV/nm. 

SOLUTION: The quantity hc/\/mc 2 <t> = 0.13013 nm. For the chosen field, y = 0.53333 and so f(y) = 1.0515. There¬ 
fore 

T mm = 2031K 
T max = 2633 K 


15 Ref. [37]: The Tragedy of King Richard the Third, n.1.31. 
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Figure 17.18 A graphical representation (based on Figures A3.1 and A3.2) of the process used to find a solution to Eq. (17.90) using iteration of 
x = attfis/x) with a = 0.53389 and b = 0.26666 for a temperature of T = 1000 K. Open circles represent f(Xj_ j) and black circles represent x ; .. a scales 
linearly with T, and so to leading order, F max will as well. 

An equivalent calculation can be used to find the maximum and minimum fields for a given temperature, although 
the more complicated relationship of t(y) on F makes their determination more involved. For a given temperature T, and 
using Eq. (17.20), the thermal regime extends for all fields up to 



(17.89) 


Conversely, the field regime is for all fields greater than 


= 2W -) yi^ 

h 


(17.90) 



where the max and min subscripts refer to the fields at the upper and lower n = 1 regime, and where y max = -^4QF max /®. 



Although Eq. (17.89) can be evaluated directly, Eq. (17.90) must be solved by other means if the weak dependence of 

t(y) on field is respected. The means may include the following: 

• The method of bisection, as in Section A3.7.1 is a rugged and straightforward approach: extreme values of F max and 
F min are chosen and the midpoint F mid is used in calculating F - 2 t(y{F))sJ 2 m<S>/p T h. If the answer is greater than 0, F min 
is set to the value of F mid , and if not, then F max is. With each step, the uncertainty in the location of the correct value 
of F is reduced by a factor of 2. Though rugged, the method is not fast by comparison to iteration. 

• The method of iteration, as in Section A3.7.2, is a more elegant method for solutions to equations of the form 
x=f(x) as long as f(x) < 1. Eq. (17.90) is such an equation. Let x j+1 =/(*,) given an initial guess x lr then after several 
iterations a reasonable approximation to x M results. With x -*■ F and x^ —■ F max , the derivative of the right-hand 
side of Eq. (17.90) is rather small: if it were zero, then iteration would give the correct answer in one step. 
But small is good: as seen in Figure 17.18, even for a poorly chosen initial guess of F = 10 eV/nm, a very small 
number of iterations determine F max as a consequence of the weak variation of the right-hand side of Eq. (17.90) 
with F max . 

17.5.2 Evaluation of the general thermal-field current density 

Some say the world will end in fire, 

Some say in ice. 

From what I've tasted of desire 
I hold with those who favor fire. 

But if it had to perish twice, 

I think I know enough of hate 
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To say that for destruction ice 
Is also great 
And would suffice. 

- Robert Frost 16 

The domination of current by thermal or cold (aka field) emission is related to the position of E m . Although E m is easily 
approximated by p in the field regime or p + <p in the thermal regime, in the intermediate (n = 1 regime), it changes 
subject to the constraint 

P P (E m )\ n=1 =P T (17.91) 

The behavior of p p (E) is obtained from Eq. (17.85) and d E = cp~ l d p to give 

Mj* + v$) = i - p) + a -p)(i - 3 P )fi p <ji) 

<p 


+ pOp-2)Mii + 4>) 


(17.92) 


an equation which is quadratic in p. Therefore, Eq. (17.91) in Eq. (17.92) defines E m , from which s\ n=l = p T (E 0 (E m ) - p) 
may be calculated from 17 

0(EJ 0(EJ 

B ° =Em+ Mt) =Em+ ^r (17 - 93) 

The use of s so calculated in N(l,s) leads to the current density via 

T)\ n=l =A RJD T 2 N(l,s) 

« A WD T 2 (s(E m ) + ljc-W (17.94) 

Each term in Eq. (17.94) takes some unpacking, and so consider an example worked out in detail to appreciate what is 
involved. 


EXAMPLE: Find J GTP for p = 7 eV, ® = 4.5 eV at a field of F = (F mm + F min )/2 and a temperature of T = 1000 K. 
SOLUTION: Proceed in steps. 

1. From Eq. (17.89) the value of F min is 1.100 eV/nm. 

2. The first three iterations of Eq. (17.90) using an initial value of 4 eV/nm give 4.000,1.970, and 1.930, after which 
the last significant digit does not change in the third decimal, and so F max = 1.930 eV/nm. Therefore, F = 1.515 
eV/nm. 

3. <f> = ® - \/4 QF = 3.023 eV. 

4. p T = 1 /k B T= 11.60 eV 1 . 

5. Using Eq. (17.78), 9(p) = 36.58. 

6. Using Eq. (17.20), 0 F (p + 4>) = 9.129 eV" 1 . 

7. Using Eq. (17.12), p F (p) = 14.71 eV" 1 . 

8. Eq. (17.92) is 11.60 = 72.61p(l - p) + 9.129p(3p - 2) - 14.71(1 - p)(3p - 1), which has the roots p = 0.6034 and 
p = -4.689, for which the positive root is the correct one. 

9. From p, E m = p + pcj) = 8.823 eV. 

10. From Eq. (17.93) and P P = p T , E 0 = 9.900 eV. 

11. From Eq. (17.26), s(E m ) = 33.66. 

12. From Eq. (17.94), J GTP (F, T) = 1.006 X 10“ 5 A/cm 2 . 


How does J gtp (F, T) as calculated using the computationally kind approach compare to its competitors? It is expected 
that J GTP should exceed J PN because p p in the transition region is smaller than near p, and should exceed J RLD because 


16 Robert Frost, Fire and Ice, 1920, https://www.poetryfoundation.org. 

17 This form is possible because even though s is defined by s(E m ) = P P (E m )(E 0 (E m ) - p), the n = 1 regime allows p P to be replaced by p T , which 
is easier to evaluate. When 1, then the usage of P T is not correct. 
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of the influence of tunneling electrons. Calculation confirms this. Using the parameters of the previous example and 
Eq. (13.25), then J FN /J GTP = 0.00081674. Using Eq. (12.3)), then J RLD /J GTF = 0.0069563. The thermal-field, or n = 1 
regime, exhibits behavior not accounted for by either the canonical thermal or field emission equations, and therefore 
the conditions which signify its presence, detailed in Eqs (17.87), (17.88), (17.89), and (17.90), require attention for 
general thermal-field emission conditions. 

Lastly, there is the question of accuracy: use of N(l,s) in the thermal-field, or intermediate n= 1, region is only 
approximately correct by retaining just the leading order part of a factor (1 + exp (p F (E 0 - E))/( 1 + exp(0(E))) that plays 
the role of f(x) in Section Al.2.4. Because this function is peaked itself at E = E m (and equal to unity at E m ), N(\,s) is an 
overestimate of the actual integrand involving 0(E) by a factor on the order of 2, as indicated by comparing the integrands 
in Figure 17.19. On a log plot, such differences are easily overlooked. They are not altogether insignificant, though: 
factors of 2 are preferable to orders of magnitude differences that characterize the usage of the FN and RLD equations in 
regimes where they are not warranted, as in Figures 17.20 and 17.21. 
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Figure 17.19 Comparison of the integrands j(E ) using 0(E) (exact) and P F (E 0 - E) (linear). The red line is the integrand of N(l,s). Conditions are such 
that n = 1, and F is equidistant from its maximum and minimum values (Eqs (17.90) and (17.89)) for T = 1000 K and copper-like parameters. 




log 10 (F [eV/nm]) 


Figure 17.20 General thermal-field emission from a metal ( F = 7 eV, ® = 4.5 eV) held at a temperature of 1000 K. The thermal region is labeled n < 1, 
the field region is labeled n > 1, and the intermediate region is labeled n = 1. The gray lines at the boundaries mark the fields specified by Eqs (17.89) 
and (17.90). RLD refers to Eq. (12.3) and FN to Eq. (13.25). 
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Figure 17.21 General thermal-field emission from a metal (u = 7 eV, 4> = 4.5 eV) held at a field of 2 eV/nm. The thermal region is labeled n < 1, the 
field region is labeled n > 1, and the intermediate region is labeled n = 1. The gray lines at the boundaries mark the temperatures specified by 
Eqs (17.87) and (17.88). RLD refers to Eq. (12.3) andFNtoEq. (13.25). 



17.6 General thermal-field emission code 

... Wisdom comes alone through suffering. 

- Aeschylus 18 

...a person that started in to carry a cat home by the tail was gitting knowledge that was always going to be useful to him, and warn't 
ever going to grow dim or doubtful. 

- Mark Twain 19 

Doing is learning. The final and arduous task in the thermal-field emission theory treatment is to describe the means 
for its numerical implementation using a common programming language, as the tasks to be performed for even one pair 
of field and temperature values is already more involved than for either the thermal or field canonical equations. The 
code developed in the present section is complete (it will run as written). The algorithm is presented in Fortran using 
F95 conventions as Fortran is the quintessential scientific programming language: even the name connotes turning 
scientific equations into code (FORmula TRANslation) [236, 237], and it is particularly adept at handling vectors and 
matrices (although that capability is not fully exploited in the program). The reformulation into other languages should 
be straightforward. 

The code is broken into blocks (subroutines and formula) for easier implementation. For simplicity, no optimizations 
are made in the form of vectorizations or the replacement of formulae used repeatedly by their numerical equivalent. 
The components are in the forms of modules, subroutines, and function calls, as follows: 

1. Modules 

° Fundamentals: Fundamental constants used in main program and subroutines, such as n,m,k B ,h,a,c,£ 0 , and Q 
(called Qo). 20 Two additional terms are given: Arid is encountered in Eq. (12.3); Ampcm2 converts A/cm 2 from 
nm-eV-fs-q units to MKSA (Table 2.1). 

° MaterialEnvironment: Parameters used to control the output sent to written data files: they can be altered after 
the first run by editing the name list file Jf rin. nml. Default parameters are used in the output below. 

2. Subroutines 

° NameDropping: Reading in of the name list file Jf rin. nml: if the file does not exist, a sample one is created and 
populated with the default values. 

3. Function calls 

° RJfno: Fowler-Nordheim equation as given by Eq. (13.25). 

° RJrldo: Richardson-Laue-Dushman equation as given by Eq. (12.3). 


18 Aeschylus, Agamemnon (trans. R. Lattimore), The Complete Greek Tragedies:Aeschylus I. Chicago: University of Chicago Press, 1953, lines 77-78. 
19 Mark Twain, Tom Saywer Abroad, A Public Domain Book, Kindle eBook, 2013, Chapter 10. 

20 Values derived from NIST: http://physics.nist.gov/constants. 
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° RJgtf: General thermal-field emission equation based on Eq. (17.56). 

° RNns: The function N(n,s) based on Eq. (17.65). 

° ThetaE: Evaluation of 0(E) based on Eq. (17.85). 

° BetaE: Evaluation of P F (E) based on Eq. (17.92). 

4. Output 

° JTemp. txt contains four columns for the vectors T, J GTP , J RID and J FN for a fixed field. Because J m is independent 
of temperature, its values are constant 

° Jf ield. txt contains four columns for the vectors F, J GTP , J RLD and J FN for a fixed temperature, where J RID varies 
because of the field dependence of <j) = ® QF. 

17.6.1 Simulation code 


module Fundamentals 

!Source: http://physics.nist.gov/constants 
! (converted from MKSA) 

'Units: 

! electron volts, nanometers, 

! femtoseconds, charge g = 1 

! terms with different units are combinations of Units 
! eg: [rmo] = [eV]/c~2, [c] = nm/fs 


real, parameter :: pi = 3.14159265359, & 

& rmo = 5.685629853, rkb = 1.0/11604.5192, & 

& rhbar = 0.658211899, & 

& alpha = 7.29735257E-3, c = 299.7924580, & 

& epso = 1.0/(4*pi*alpha*rhbar*c), & 

& Qo = alpha*rhbar*c/4, Ampcm2 = 1.602176565E10, & 
& Arid = (rmo*rkb**2)/(2*rhbar*(pi*rhbar)**2) 


end module Fundamentals 


module MaterialEnvironment 


real :: Chem,Phi,Fmax,Fmin,Field,Tmax,Tmin,Temp 

integer :: Nx 

character*l :: tb = char(9) 

data Chem, Phi, Nx / 7.0, 4.5, 41 / 

data Fmax, Fmin, Field, Tmax, Tmin, Temp & 

& / 8.0, 1.0, 2.0, 1700.0, 300.0, 1000.0 / 


end module MaterialEnvironment 


PROGRAM GenThermField 


Calculation of Generalized Thermal 
Field Current Density 
Units: Field = eV/nm; Temp = Kelvin; 
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J = Amp/cm*2 

When not specified, units are eV-nm-fs-q 


use MaterialEnvironment !parameters used in subroutines 
real :: Tmp, Fid 

!....Read in needed parameters 
call NameDropping 


Create temperature variation output 
Let ln(F(j)) be evenly spaced 


write(*,*) ' Creating^file^Jfield.txt:_field^variation' 

write(*,*) '_Temp^taken^toJoe^T„=„' ,Temp, '^Kelvin' 
open(unit=ll,file='Jfield.txt') 

write(11,*) 'F„[eV/nm]',tb,'JGTF^ [A/cm* 2]',tb, & 

& 'JRLD',tb,'JFN' 

do i = 1, Nx 

Fid = Fmin*(Fmax/Fmin)**(float (i-1)/ (Nx-1)) 
write (11,*) Fid,tb,RJgtf(Phi,Chem,Fid,Temp), tb, & 

& RJrldo(Phi,Fid,Temp),tb,RJfno(Phi, Fid) 

end do 

close(unit=ll) 


Create field variation output 
.Let T(j) be evenly spaced 


write(*,*) 'Creating^file^Jtemp.txt:„thermal„variation' 

write (* , *) ' _Field„taken„to„be„F„=„' , Field, ' „eV/nm' 

open(unit=12,file='JTemp.txt') 
write(12,*) 'T„[K] ',tb,'JGTF„ [A/cm * 2]',tb,& 

& ' JRLD',tb,'JFN' 

do i = 1, Nx 

Tmp = Tmin + (Tmax-Tmin)*float(i-1)/(Nx-1) 

write(12,*) Tmp,tb,RJgtf(Phi,Chem,Field,Tmp), tb, & 

& RJrldo(Phi,Field,Tmp),tb,RJfno(Phi,Field) 

end do 

close(unit=12) 


END PROGRAM GenThermField 


subroutine NameDropping 

'READING/CREATING OF DATA INPUT FILE 


use MaterialEnvironment 


implicit none 

integer :: istat, idebug 

namelist/JftParam/ Chem, Phi, Fmax, Fmin, Field, & 
& Tmax, Tmin, Temp, Nx 

writ e(*,*) 'PROGRAM^GenFNRLD' 
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write(*,*) '_<Jfrin.nml>Jaeing„opened...' 

open(unit=ll,file='Jfrin.nml',status='old',iostat=istat) 

close(unit=ll) 

if(istat == 0) then JA namelist file exists; read it 
open(unit=ll,file='Jfrin.nml') 
read(11,JftParam) 

close(unit=ll) 

if(idebug>0) write(*,JftParam) 
else lA namelist file does not exist; create it 
write(*,*) '^input^file^Jfrin.NMLjiot^found.' 
write(*,*) '_Creating„input„data_file^now...' 


open(unit=ll,file='Jfrin.nml') 

write (11, *) ' Symbol,_,„Def inition_„ w „_ w _ ww __Unit' 

write (11, *) ' Chem_^_„Chemical^Potential„_^ [eV] ' 

write (11, *) ' Phi___„Work^Function___„_ [eV] ' 

write (11,*) ' Fmax„__„Max„f ield„f or„Jf ield_ [eV/nm] ' 

write (11,*) ' Fmin_„_J4in„f ield„f or„Jf ield_ [eV/nm] ' 

write (11,*) ' Field__„Field„f or^f or„Jtemp__ [eV/nm] ' 

write (11, *) ' Tmax__„J4ax„temp_f or_Jtemp_„ [Kelvin] ' 

write (11, *) ' Tmin___Min„temp_f or_Jtemp__ [Kelvin] ' 

write (11,*) ' Tmin___Temp_for_for_Jf ield_ [Kelvin] ' 


write (11, *) ' Nx___„JSTumber„of jplot^points' 

write(11,JftParam) 

close(unit=ll) 
stop 
end if 


END subroutine NameDropping 
READING/CREATING OF DATA INPUT 




function RJfno(Phi,Field) 


Internal Units: eV-nm-fs-g 
Implicit use is made of the Forbes-Deane 
Approximation to v(y) 

The Nordheim paraemeter t(y) is approximated by tyo 
'Output Units: Amps / cm~2 


use Fundamentals 

implicit none 

real, parameter :: Ao = 1.0/(16*rhbar*pi**2), & 

& Bo = 4*sqrt(2*rmo)/(3*rhbar), & 

& tyo = 1.0613132 

real :: Phi, Field, RJfno, nu 

nu = 2*Bo*Qo/(3*sqrt(Phi)) 

RJfno = (Ao/(Phi*tyo**2))*(((Phi**2)*exp (6.0)/& 

& (4*Qo))**nu)*(Field**(2.0-nu))* & 

& exp(-Bo*(Phi**l.5)/Field)*Ampcm2 
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function RJrldo(Phi,Field,Temp) 


Internal Units: eV-nm-fs-q 
Output Units: Amps / cm'2 


use Fundamentals 

implicit none 

real :: Phi, phix. Field, Temp, RJrldo 

phix = Phi - sqrt(4*Qo*Field) 

RJrldo = Arid*(Temp**2)*exp(-phix/(rkb*Temp))*Ampcm2 

return 




function RJgtf(Phi,Chem,Fid,Tmp) 
!Internal Units: eV-nm-fs-q 
!Output Units: Amps / cm'2 


use Fundamentals 

implicit none 

real :: y, phix, vy, ty, Qmu, Bfmu, Bfmphi, Tmin, & 

& Tmax, Tmp, Em, s, rn, RJgtf, Phi, Chem, & 

& Fid, BetaT, & 

& RNns, Ap, Bp, Cp, p, ThetaE, BetaE 
! Create terms needed for the evaluations 
betaT = 1.0/(rkb*Tmp) 
y = sqrt(4*Qo*Fld)/Phi 
phix = Phi - sqrt(4*Qo*Fld) 
vy = 1.0 - (3.0 - alog(y))*(y**2)/3 
ty = 1.0 + (1.0 - alog(y))*(y**2)/9 
Qmu = 4*sqrt(2*rmo*Phi**3)*vy/(3*rhbar*Fld) 

Bfmu = 2*sqrt(2*rmo*Phi)*ty/( rhbar*Fld) 

Bfmphi = pi*sqrt(y*rmo*Phi) /( rhbar*Fld) 

Tmin = (rhbar*Fld/(4*ty*rkb))*sqrt(2.0/(rmo*Phi)) 
Tmax = (rhbar*Fld/(pi*rkb)) *sqrt(1.0/(y*rmo*Phi)) 

! Three regimes: 

if(Tmp.le.Tmin) then ! Field Emission Regime 

s = Qmu 
rn = betaT/Bfmu 

else if(Tmp.gt.Tmax) then ! Thermal Emission Regime 

s = Bfmphi*Phix 
rn = betaT/BFmphi 

else 


1.0 


!Intermediate Regime 
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Ap = 3*(phix*(Bfmu + Bfmphi) - 2*Qmu) 

Bp = 2*(3*Qmu - phix*(2*Bfmu + Bfmphi)) 

Cp = phix*(Bfmu - BetaT) 
p = (-Bp-sqrt(Bp**2 - 4*Ap*Cp))/(2*Ap) 

Em = Chem + p*phix 
s = betaT*(Em - Chem + & 

& (ThetaE(Phi,Chem,Fid,Em)/BetaE(Phi,Chem,Fid,Em))) 

end if 

RJgtf = (Arid/(rkb*betaT)**2)*RNns(rn,s)*Ampcm2 

return 


end 


function RNns(n,s) 

! (7*pi"4 - 720)/720 = -0.10593434 

! ( pi" 2 - 12)/6 = -0.35506593 


implicit none 

real :: n, s, RNns, sn, sd, x, y, z, sng 
x = n**2 
y = 1.0/x 

sn = -x*(0.10593434*x + 0.35506593) 
sd = -y*(0.10593434*y + 0.35506593) 
z = (n-1.0)*s 
if(abs(z).gt.0.001) then 

sng = (n**2 + 1.0)*((n**2)*exp (-s) - & 

& exp(-n*s))/(n**2 - 1.0) 

sng = (0.5*(n**2 + 1.0)*exp(-s)/(n+1.0))* & 
& ((1.0-n)*(s**2) + 2*(1.0+n) + 2*s) 

end if 

RNns = sng + sn*exp(-n*s) + x*sd*exp(-s) 

return 




function ThetaE(Phi,Chem,Field,E) 
! Cubic in p = (E - mu)/phix 


use Fundamentals 

implicit none 

real :: Qmu, Bfmu, Bfmphi, p, ThetaE, Phi, Chem, & 
& Field, E, vy, ty, y, phix 

y = sqrt(4*Qo*Field)/Phi 
phix = Phi - sqrt(4*Qo*Field) 
vy = 1.0 - (3.0 - alog(y))*(y**2)/3 
ty = 1.0 + (1.0 - alog(y))*(y**2)/9 
Qmu = 4*sqrt(2*rmo*Phi**3)*vy/(3*rhbar*Field) 
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Bfmu = 2*sqrt(2*rmo*Phi)*ty/( rhbar*Field) 
Bfmphi = pi*sqrt(y*rmo*Phi) /( rhbar*Field) 
p = (E - Chem)/phix 

ThetaE = Qmu*(2*p+1.0)*(1.0 - p)**2 - & 

& phix*Bfmu*p*(1.0 - p)**2 + & 

& phix*Bfmphi*(1.0 - p)*p**2 

return 


end 


function BetaE(Phi,Chem,Field,E) 

! Quadratic in p = (E - mu)/phix 


use Fundamentals 

implicit none 

real :: Qmu, Bfmu, Bfmphi, p, BetaE, Phi, Chem, & 
& Field, E, vy, ty, y, phix 

y = sqrt(4*Qo*Field)/Phi 
phix = Phi - sqrt(4*Qo*Field) 
vy = 1.0 - (3.0 - alog(y))*(y**2)/3 
ty = 1.0 + (1.0 - alog(y))*(y**2)/9 
Qmu = 4*sqrt(2*rmo*Phi**3)*vy/(3*rhbar*Field) 
Bfmu = 2*sqrt(2*rmo*Phi)*ty/( rhbar*Field) 
Bfmphi = pi*sqrt(y*rmo*Phi) /( rhbar*Field) 
p = (E - Chem)/phix 

BetaE = (6*Qmu/phix)*p*(1.0 - p) - & 

& Bfmu*(1.0 - p)*(3*p - 1.0) + & 

& Bfmphi*p*(3*p - 2.0) 

return 




17.6.2 Input/Output 

Input file Jfrin.nml (containing default values): 
Symbol Definition Unit 


Chem 

Chemical Potential 

[eV] 

Phi 

Work Function 


[eV] 

Fmax 

Max field 

for 

Jfield 

[eV/nm] 

Fmin 

Min field 

for 

Jfield 

[eV/nm] 

Field 

Field for 

for 

Jtemp 

[eV/nm] 

Tmax 

Max temp 

for 

Jtemp 

[Kelvin] 

Tmin 

Min temp 

for 

Jtemp 

[Kelvin] 

Tmin 

Temp for 

for 

Jfield 

[Kelvin] 

Nx 

Number of 

plot points 



& JFTPARAM 
CHEM = 7.00000, 

PHI = 4.50000, 
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FMAX = 
FMIN = 
FIELD = 
TMAX = 
TMIN = 
TEMP = 


8 . 00000 , 
1 . 00000 , 

2 . 00000 , 
1700.00, 
300.000, 
1000.00, 


NX = 41, 

/ 

Output file JTemp. txt in tab-delimited 
T [K] JGTF [A/cm"2] JRLD 


300.000 
335.000 
370.000 
405.000 
440.000 
475.000 
510.000 
545.000 
580.000 
615.000 
650.000 
685.000 
720.000 
755.000 
790.000 
825.000 
860.000 
895.000 
930.000 
965.000 
1000.00 
1035.00 
1070.00 
1105.00 
1140.00 
1175.00 
1210.00 
1245.00 
1280.00 
1315.00 
1350.00 
1385.00 
1420.00 
1455.00 
1490.00 
1525.00 
1560.00 
1595.00 
1630.00 
1665.00 
1700.00 


4.792653E-04 
4.970490E-04 
5.179189E-04 
5.423434E-04 
5.709133E-04 
6.043782E-04 
6.436993E-04 
6.901248E-04 
7.452986E-04 
8.114256E-04 
8.915255E-04 
9.898394E-04 
1.112504E-03 
1.268712E-03 
1.472803E-03 
1.748101E-03 
2.134147E-03 
2.700365E-03 
3.571666E-03 
4.975671E-03 
7.327671E-03 
1.137798E-02 
1.294884E-02 
1.615842E-02 
2.172754E-02 
3.093601E-02 
4.600899E-02 
7.071256E-02 
0.111365 
0.178509 
0.289663 
0.473769 
0.778335 
1.28079 
2.10631 
3.45551 
5.64702 
7.51378 
9.70862 
12.6294 
16.5061 


format: 

JFN 

0.000000 
0.000000 
1.088887E-31 
2.599633E-28 
1.825727E-25 
4.938201E-23 
6.255702E-21 
4.292822E-19 
1.782111E-17 
4.874463E-16 
9.393571E-15 
1.345306E-13 
1.494791E-12 
1.334567E-11 
9.854653E-11 
6.164562E-10 
3.333098E-09 
1.584359E-08 
6.717056E-08 
2.571505E-07 
8.984302E-07 
2.891130E-06 
8.637821E-06 
2.412849E-05 
6.340260E-05 
1.575728E-04 
3.721621E-04 
8.388857E-04 
1.811501E-03 
3.760179E-03 
7.525453E-03 
1.456118E-02 
2.730675E-02 
4.974182E-02 
8.819163E-02 
0.152470 
0.257467 
0.425305 
0.688221 
1.09234 
1.70257 


3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
3.925210E-04 
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Output file Jf if 
F [eV/nm] 
1.00000 
1.05336 
1.10957 
1.16878 
1.23114 
1.29684 
1.36604 
1.43893 
1.51572 
1.59660 
1.68179 
1.77154 
1.86607 
1.96564 
2.07053 
2.18102 
2.29740 
2.41999 
2.54912 
2.68514 
2.82843 
2.97935 
3.13834 
3.30580 
3.48220 
3.66802 
3.86375 
4.06992 
4.28709 
4.51586 
4.75683 
5.01066 
5.27803 
5.55967 
5.85634 
6.16884 
6.49802 
6.84476 
7.21000 
7.59474 
8.00000 


d. txt in tab-delimi 

JGTF [A/cm' 2] 

3.610247E-08 
8.879720E-08 

2.2 82641E-07 
3.464692E-07 
5.675248E-07 
1.009689E-06 
1.965366E-06 
4.22 02 86E-06 
1.009293E-05 
2.718130E-05 
8.351229E-05 
2.9733 05E-04 
1.250405E-03 
5.079909E-03 
1.58 0514E-02 
5.320935E-02 
0.185017 
0.643032 
2.18956 
7.22165 

22.9292 
69.8631 
204.007 
570.791 
1530.86 
3939.07 
9735.08 
23138.0 

52960.2 
116904 . 
249222 . 
513861. 

1.026172E+06 
1.987526E+06 
3.738639E+06 
6.838934E+06 
1.218124E+07 
2.115210E+07 
3.584957E+07 
5.937064E+07 
9.617879E+07 


format: 

JRLD JFN 
2.808417E-09 
4.052475E-09 
5.904351E-09 
8.688134E-09 
1.291519E-08 
1.940028E-08 
2.945578E-08 
4.521744E-08 
7.02 0131E-08 
1.102586E-07 
1.752450E-07 
2.81954 0E-07 
4.59354 9E-07 
7.580564E-07 
1.267604E-06 
2.148536E-06 
3.692663E-06 
6.437693E-06 
1.13 8879E-05 
2.045251E-05 
3.73004 IE-05 
6.911160E-05 
1.301498E-04 
2.492153E-04 
4.854400E-04 
9.623223E-04 
1.942366E-03 
3.993702E-03 
8.368768E-03 
1.788174E-02 
3.897999E-02 
8.673330E-02 
0.197095 
0.457671 
1.08660 
2.63918 
6.56176 
16.7105 
43.6160 
116.755 
320.750 


1.161073E-18 
3.366324E-17 
8.256117E-16 
1.727458E-14 
3.108433E-13 
4.847257E-12 
6.598127E-11 
7.894216E-10 
8.355946E-09 
7.873680E-08 
6.643776E-07 
5.048154E-06 
3.472470E-05 
2.1733 07E-04 
1.243542E-03 
6.534721E-03 
3.167354E-02 
0.142182 
0.593417 
2.31123 
8.42985 
28.8888 
93.3138 
284.951 
824.970 
2270.50 
5955.74 
14925.8 

35820.3 

82502.4 
182747. 
390070. 
803816. 

1.602017E+06 
3.093205E+06 
5.795330E+06 
1.055209E+07 
1.869902E+07 
3.229345E+07 
5.44243 8E+07 
8.961740E+07 


17.6.3 Remarks 

The output data files JTemp. txt and Jf ield. txt have been graphically presented in Figures 17.21 and 17.20, respec¬ 
tively. In the later, the dovetailing of the general thermal-field form with the canonical FN and RLD equations is good, 
apart from the mild bumps in the curve of J GTP (F, T) near the left boundary of the n = 1 region, discussed in Figure 17.19. 
Although greater accuracy can be obtained by using the polynomial 9(E) of Eq. (17.85) or, better, the exact form in 
Eq. (17.78), in a numerical integration of j(E x ) exact approaches begin to be attractive. 



PART III 

Exact tunneling and transmission 
evaluation 


“Do you mean that you think you can find out the answer to it?" said the March Hare. 

“Exactly so," said Alice. 

“Then you should say what you mean," the March Hare went on. 

“I do," Alice hastily replied; “at least - at least I mean what I say - that's the same thing, you know ." 

“Not the same thing a hit!" said the Hatter. 

“You might just as well say that T see what I eat ' is the same thing as T eat what 1 see 7 " 

- Lewis Carroll 1 

Exact solutions to models are indispensable for identifying when approximations go awry, and are "exactly so" in two 
ways. The first type are exact (not approximate) solutions to simple models, of which the rectangular and triangular 
barriers are iconic. The second type are numerical methods for exact (accurate) solutions of more general barriers. Both 
types illuminate how to modify the Kemble approximation and the underlying Gamow factor and identify where they 
are appropriate - or when they are not. 

The treatment of the exact methods will first analyze the simplest of the barriers, namely, the rectangular and triangular 
barriers, and so introduce numerical methods for dealing with them. Those methods shall then be generalized to develop 
a transfer matrix approach (TMA) first using plane waves and then using the more accurate Airy function method. Lastly, 
because the TMA methods can analyze resonances, the treatment will close with a consideration of several configurations 
where tunneling, resonance, and emission appear alongside one another. 


1 Lewis Carroll, Alice's Adventures in Wonderland. Radne, WI: Whitman Publishing Company, 1945, Chapter VII: A Mad Tea-Party. 
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Simple barriers 


Cora: . ..But the decision I've come to is I'd rather make the gravest of mistakes than surrender my own judgment. 

- Last of the Mohicans 1 

The development of the general thermal-field-photoemission equation (Eq. (17.26) using Eq. (17.30)) implicitly used 
a feature of the Kemble approximation of Eq. (17.9) that when 0(E) = 0, then D(E) =1/2. Even from elementary consid¬ 
erations, this is not quite acceptable: recall the step function potential barrier of Section 9.2.2, where D(E(k)), as given in 
Eq. (9.69), vanishes at E = V 0 . The step function barrier is the limiting cases of a rectangular barrier (L -*■ oo) or a trian¬ 
gular barrier (F -» 0) for the Gamow factors of Section 11.2, and for these cases D(E = V g ) does not in general equal 1 /2. 
Nevertheless, Eq. (11.1) for J(F, I) was remarked to have had a kind of truthiness in its ability to anticipate the behavior of 
the energy distributions of thermal-field emission: the Gamow factor (Section 11.2) provided a means of estimating the 
tunneling current, the Kemble approximation of Eq. (17.9) enabled treating the integrand near the barrier maximum, the 
polynomial representation of the Gamow factor 0(E) for E > p + $ that was extended into the region E x > n + cp enabled 
the development of the general thermal-field current density J GTF in terms of N(n,s) of Eq. (17.56), and enabled the 
recovery of the canonical Fowler-Nordheim (Eq. (13.25)) and Richardson (Eq. (12.3)) equations, as well as the Fowler 
Dubridge (Eq. (14.3)) equation. These canonical equations were shown to have good correspondence with experimental 
data. Lastly, there is the weighty authority of those whose names became synonymous with the canonical equations, 
and whose pronouncements command a certain deference. 

All that may be true, but it does not settle whether the Kemble form is reasonable for general barriers, or whether the 
Gamow factor is reasonable in general. Moreover, the justification for extending 9(E) into situations where E> p + cp 
because that prescription seems to work is rather weak praise, particularly when 9(E > V 0 ) should vanish. Blow the 
extension of the wave function formalism of Section 9.2.2 justifies the canonical equations beyond the step potential 
models of Section 9.2.3.1 is intriguing, although the Bohm approach of Section 11.2 lends a certain plausibility. Lastly, 
deference to the authority of past masters is often - but not always - valid. 

A better approach is not to surrender one's judgment and undertake assessing accuracy using an exact methodology 
to identify when the Kemble form is good, what approximation to 9(E X ) is good, and how D(E X ) really behaves near 
the barrier maximum. To emphasize the point: exact methods can treat barriers that induce resonant behavior, but 
the methodologies covered till now cannot. The first step to understanding what changes is a reconsideration of the 
simple barriers, not from the perspective of the Gamow factor, but from the perspective of solving Schrodinger's equation 
properly. 

18.1 Rectangular barrier 

Thus there exists an essential truth that must he disengaged from the outward appearance of the objects to he represented. This is the only 
truth that matters. 

- Henri Matisse 2 

Consider the simple rectangular barrier, a staple of all quantum mechanics textbooks. Although straightforward, it 
portends a great deal that appears with other and more complex potentials, in addition to being exquisitely suited to 


Michael Mann and Christopher Crowe, screenplay of Last of the Mohicans, based on the novel by James Fenimore Cooper, 1992. 

2 Henri Matisse, "Exactitude is Not Truth", Matisse on Art, ed. Jack D. Flam. Berkeley: University of California Press, 1995, Essay 40, p. 179. The 
title of the essay refers to a quote by Eugene Delacroix (Journal de Eugene Delacroix, entry for January 25, 1857, p. 236: "La froide exactitude 
n'est pas l'art...', translated as "Cold exactitude (precision) is not art..." In another of life's little ironies, Matisse did not preserve the quote 


Introduction to the Physics of Electron Emission, First Edition. Kevin L. Jensen. 
© 2018 John Wiley & Sons Ltd. Published 2018 by John Wiley & Sons Ltd. 
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demonstrate the methodology that will lead to the transfer matrix approach of Chapter 19. Rather than cleaving to the 
traditional narrative, consider it instead in a way that sheds light on both the Kemble approximation and the Gamow 
factor. 

A rectangular barrier is a modification to the step barrier by the addition of a second step, therefore methods used in 
the analysis of the step potential of Sections 9.2.2 and 9.2.3 can be generalized. Now, however, instead of treating two 
regions, as in Figure 9.7, there are three, as in Figure 11.1, as considered when treating the Gamow factor in Section 
11.2: each side of the barrier (regions 1 and 3) and the barrier itself (region 2). Using the convention that hk n , t n , and r n 
are the momentum, transmission, and reflection coefficients, respectively, in region n, and that the boundary between 
regions n and n+ 1 occurs at x n , then for a barrier the single matrix equation of Eq. (9.65) (for n = 1) becomes two matrix 
equations. Writing them out is an ordeal, so analogous to the methods described in Section 8.2, introduce the simplifying 
notation for a 2 x 2 matrix M n (x) and a two-component coefficient vector |f) defined by 



(18.1) 


in terms of which the two matrix equations are delightfully simple, one for each side of the barrier: 


Al 1 (x 1 )|f 1 >=M 2 (x l )|f 2 > 

M 2 (x 2 )|f 2 >=M 3 (x 2 )|f 3 > 


(18.2) 


The elimination of |f 2 ) simplifies matters even more, giving 


If,) = {Af^x,}- 1 • M 2 (x,)} • {M 2 (x 2 )~ 1 ■ M 3 (x 2 )} |f 3 > 
= S,(x,) • $ 2 (x 2 )|f 3 > 


(18.3) 


where the matrix S n (x) = M n (x)~ l ■ M„ +1 (x) is defined. Thus, a factor of S n (x n ) is associated with each interface between 
adjacent regions where V(x) changes: the change in V{x) can be a change in slope, but also a discontinuous change in 
height, or both, but for now consider just discontinuous changes in height. The generalization to more than one barrier 
should be rather easy when the time comes. 

Unlike the step potential, two cases are now in play. Letting the k associated with the barrier height V 0 be defined 
by V 0 = h 2 k 2 0 /2m, then over the barrier transport is defined by k y > k 0 . Conversely, tunneling occurs when k x < k 0 . The 
following assumptions and conventions are used 

• The problem is one dimensional, and so the total energy is E = h 2 k 2 /2m. The case presently under consideration 
(Figure 11.1) is k x = /c 3 = k because in regions 1 and 3, Vix) = 0 whereas in region 2, V{x) = V 0 (but this need not be 
so generally). 

• The boundary between region 1 and 2 occurs at x, = 0; the boundary between 2 and 3 occurs at x 2 = L. 

• The quantity k = \/\k 2 0 - k 2 \ is so constructed that it is always real , 3 so that when E > V 0 , k 2 = k, but when E < V 0 , then 
k 2 = iK. 

Whether k 2 is imaginary (zV) or real (k) determines whether the transport is over (E > V 0 ) or under (E < V 0 ) the barrier, 
respectively. Consider each in turn. 

For over-the-barrier transport, using the simple inverse of a 2 x 2 matrix followed by matrix multiplication, the expres¬ 
sion for S y (0) is easily shown to be 



(18.4) 


Were the problem to end here, the step potential findings of Eq. (9.65) result, but the problem does not end here: there 
is the matter of S 2 (L), given by 



(18.5) 


3 An attempt to preserve this convention when linear fields are introduced is the motivation for the Zi functions in Section 19.2, along with the 
desire that Zi account for outgoing waves. 
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One might think the next step is to consider the product • S 2 , but that is a torturous path for the writing of equations: 
it is easier to make use of the coefficient vectors, where 


lfi>- 




(18.6) 


(i.e., t r = 1, r, = r(k), f 3 = t(k) and r 3 = 0) because of the ease of dealing with two-component columns rather than 
four-component matrices, especially when multiplications by 0 are involved, to wit 


( 1 \ _ t(k) ikL ( 4kic cos ( kL ) “ 2z'(k 2 + K ' 2 ) sin(*rL)' 
\r(k) J ~ 4kx e ^ -2 i(k 2 - k 2 ) sin(rL) 


where e ±lKl = cos(rL) ± i sin(vl) renders the components a bit more compact. Equating the components of the column 
vectors results in expressions for r(k) and t(k), or 


-i(k 2 - k 2 ) sin(K-L) 

2kic cos(zrL) - i(k 2 + k 2 ) sin(vL) 


(18.8) 


t(k) = 


_2 kK _ _ 

2kK cos (jcL) - i(k 2 + k 2 ) sin(zrL) 


(18.9) 


From the relations j iltc = Tik/m and j tram = ( hk/m)\t\ 2 , and Eq. (9.69), it follows that the transmission probability for a 
rectangular barrier D rec (k) is 


D rec {k > k 0 ) 


={ i+ i(!-l) sinVi) } 


For under-the-barrier transport, the problem could be started afresh by setting up the matrix relations at each 
interface again, but then it is far easer to realize that doing so means repeating all the same steps with the replace¬ 
ment k —► he. The result is simply to make the same changes in Eq. (18.10) and use the relation sin(zieL) = i sinh(vL) 
to obtain 


-ec(k <k 0 )= | 1 + + -0 sinh 2 (K-L) j- 


Therefore, the transition from under-the-barrier to over-the-barrier is occasioned by k zV, but this is equivalent to 
(k < k 0 ) ->■ (k> k 0 ), or, in other words, for all k, D rec (k) is given by 


D rec (k ) = 1 + 




(18.12) 


This form can be shoe-horned into the Kemble form of D(k) = 1/(1 + e°) if 0(k) is identified as 


0(k) = 2 In 



(18.13) 


with the realization that when k > k 0 , the argument of the sinh -function becomes imaginary and the sin function results 
so that e 0<k> goes from exponentially increasing to acquiring an oscillatory nature. 

It is difficult to see how the Gamow factor hides inEq. (18.13), but it is there. Recall that the Gamow factor for the rect¬ 
angular barrier, from Eq. (11.15), is given by 2 kL = 2Ly/k 2 - k 2 <-+ 6 rec (k), therefore as the argument of the sinh-function 
becomes large, ln(sinh(2 kL)) —► 2kL, and there it is. However, other factors complicate matters when k « k 0 or when 
k -* 0. The case k 0 L = 1 corresponds to a barrier that is as wide as 1 jk 0 is tall, and the case k/k 0 = 1 corresponds to an 
energy h 2 k 2 /2m equal to the barrier height V 0 . Then 8 rec (k)/(2k a L) = y/l — {k/k 0 ) 2 , regardless of the magnitude of k 0 L, 
against which 6(k)/ (2 k 0 L) may be compared, as in Figure 18.1. Thus, it is seen that when k 0 L is large, the Gamow factor is 
a reasonable approximation, but when k 0 L -> 1, it is less so. At the boundaries of the figure, 6{k -*■ 0)/(2 k 0 L) -*■ —2 ln(2x) 
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Figure 18.1 Comparison of 6 rlc (k)/(2k 0 L) (using Eq. (18.13)) to ic/k 0 = y/l - (k/k 0 ) 2 . 


becomes large whereas 6 rec approaches unity, and that 0(k -»• k 0 ) -*■ 2 \n(k 0 L/2) whereas 0 rec ->• 0. Therefore, the Gamow 
factor is best in the intermediate regime k/k 0 ~ l/y/l for high and thin barriers (k 0 L » 1). 

The relation of D rec {k) to the Kemble approximation is even more nuanced. Recall that the Kemble approximation 
is D(k)zi 1/(1 + exp(0(k))), which suggests that D(k 0 ) = 1/2 because 0(k o ) = 0. In contrast, Eq. (18.12) can be used 
to show 


DM 


-{'•(WF 


which gives the Kemble result D{k 0 ) =1/2 only for k 0 L = 2. 

For k > k 0 , more interesting behavior is revealed: not only is 0(k) and its first derivative continuous across E k = V 0 
(as assumed by the general thermal-field treatment and shown in Figure 17.4), but also Eq. (18.13) can be rewritten, 
equivalently, as 


8 rec {k > k 0 ) = 2 In 



(18.15) 


Now it is apparent that when kL = jn for integer j, then D-* 1. That is, the transmission probability is oscillating, with 
wider barriers experiencing more oscillations, as shown in Figure 18.2. Notice something else, though: the step potential 
barrier result of Eq. (9.69) (where k' there is k here) is lurking in the background, as shown in Figure 18.3, where 
the step function D(k) of Eq. (9.69) is compared to the n= 16 case as a function of ic/k 0 , with oscillations developing 



Figure 18.2 The transmission probability D rcc (k) for k> k 0 for the values k 0 L = h = 2,3,5,7, and 11, respectively. Observe the continuity of D(k) and 
d E D(k) across the k/k 0 = 1 boundary. 
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Figure 18.3 The transmission probability D (k) for k > k 0 for the values k 0 L = h = 16 compared to the step function transmission probability of 
Eq. (9.69). 


in the rectangular barrier case because of constructive and destructive interference of the wave function components 
that manifest themselves across a single barrier (a premonition of the truly bizarre behavior in store when the resonant 
tunneling diode (RTD) is considered in Section 19.2.4 below). As seen, the overall trend is anticipated by the step function 
results. 

Right about now, it appears that the previous embrace of the linearized Kemble approximation of Section 17.2, used 
by J gtf (F, T) and discussed near Eq. (17.10), is running afoul of features of the simple square barrier model. For the 
purposes of discussion, let h = k 0 L. Then (i) although D(k) is continuous across k = k 0 , the condition D(k 0 ) =1/2 only 
occurs for comparatively short barriers such that h = 2, (ii) D(k) = 1/2 for k > k 0 when h is larger, and (iii) for higher 
values of h = k/k 0 , an oscillatory behavior is distinctly visible but which has no analog in the linearized Kemble form. 
But the linearized Kemble approximation also assumed that d E 0(E(k)) = 6'(E) was continuous through E = V 0 (or k = k 0 ), 
which is the case for Eq. (18.13) but not so for 0 rec (E) <x \Je - V 0 for which 0' rec (E) <x -1 /\Je- V 0 and therefore singular 
at E= V 0 . 

The triangular barrier (as well as the quadratic barrier) is not singular at E = V 0 for the Gamow factors by comparison. 
It remains to be seen if that will make a difference. But it is rather clear already that one moral of the story is that simple 
models (such as those based on the Kemble approximation) are wonderful for their pedagogical insights, but as with any 
toy model, the differences with models that are more faithful to reality (more "essential" as Matisse might have meant 
it) matter. 


18.2 Triangular barrier: general method 

Joseph: There you are. It's clever. It's German. It's quality work. And there are simply too many notes. Do you see? 

Mozart: There are just as many notes, Majesty, neither more nor less, as are required. 

- Peter Shaffer 4 

The triangular barrier model, which formed the basis of Fowler and Nordheim's celebrated equation [36] given in 
Eq. (13.10), simplifies the rectangular barrier model. It does so by returning to two regions instead of three, as for the 
rectangular barrier. But that visual simplicity comes at the cost of greater mathematical complexity by making the wave 
functions of region 2 more difficult to describe: the potential is no longer constant and therefore plane waves can no 
longer be used. That is, whereas Schrodinger's equation in the rectangular barrier region 

[~^n^ + V ) ¥k{X) = Ek¥k{X) (18 ' 16) 


1 Peter Shaffer, Amadeus. New York: Harper Perennial, 2001, Act 1, Scene 8, p. 39. 
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implies y/ k (x) is a combination of exp (±iy/k 2 - kjx) with E k = h 2 k 2 /2m and V 0 = h 2 k 2 /2m (using the notation of the last 
section), the triangular barrier potential of 

(- £ J? + v ° - Fx ) ¥k(x)=EkVk(x) (18 - 17) 

makes y/ k (x) something a good deal more challenging. To see what it is, Eq. (18.17) should be simplified because in its 
present form it has "too many notes", and the functions that will arise because of that will only add to the cacophony. 

The first step is to prune Eq. (18.17) until Airy's differential equation is clearly visible [85, 173, 238]. Introduce the 
term/ = 2 mF/h 2 (which has units of/ = 1/nm 2 ) so that Eq. (18.17) can be written as 

-d 2 y/ k (x) + {k 2 0 - k 2 - fx)y/ k (x) = 0 (18.18) 

and then introduce the term w such that w(x) = f^ 2 ^(k 2 — k 2 —fx ): with that notational simplification, the cacophony of 
Eq. (18.17) is reduced to 

d 2 i/z - WV = 0 (18.19) 


or Airy's differential equation, which is a far simpler tune. Consequently, ys will be linear combinations of the functions 
Ai(in) and Bi(in). 

The sign of various terms creates issues. The first issue is that the field gradient has been taken so far as negative (i.e., 
F appears with an explicit - sign as in V(x) = V 0 — Fx). When more complex potentials are encountered, occasions arise 
where both positive and negative fields are encountered. There is advantage in keeping / itself a positive quantity, so 
introduce a factor s to account for the sign of the gradient of the field: s = -1 will correspond to the usual potential 
term (—Fx) but when the potential is rising, as in (+Fx), then s = +1. Thus, V(x) = V g — E k — Fx becomes, in this notation, 
k(x ) 2 = \k 2 0 — k 2 + sfx\ with s = -1, where k has been generalized to include a dependence on position x. Because s appears 
in Eq. (18.19) as s 2 = 1 (one should convince oneself of this), its consideration need not trouble the discussion till later. 

The second issue is that k 2 - k 2 + sfx can change sign, so careful bookkeeping for following the consequences of that 
sign through the calculations is important. Giving the sign associated with k 2 -k 2 + sfx its own name does that: call it c 
here. 5 In the asymptotically small/ limit, sines and cosines, as well as exponentially increasing and decreasing functions, 
that arise in the step potential problem of Section 9.2.2 should be recovered, suggesting that c corresponds to the four 
roots of the equation c 4 = 1, or e = ±1, ±z. Rather than dealing with w, which can be positive or negative, deal instead 
with z such that it is only positive and defined by w = c 2 z, with 


z(x) = f~ 2/3 \k% -k 2 +sfx\ 


k(x ) 2 

/ 2/3 


(18.20) 


where c 2 = 1 if (k 2 - k 2 + sfx) > 0 and c 2 = -1 if (k 2 - k 2 + sfx) < 0. Now terms such as / and k are always, by definition, 
positive, so that the value of c denotes the Airy function under consideration, of which there are four: Ai(c 2 z) = Ai(±z) 
and Bi(c 2 z) = Bi(±z). 

The literature on Airy functions is far too extensive and thorough to be improved upon. But it is not the behavior 
of Airy functions which is of interest. Instead, it is the behavior of decaying, growing, or propagating wave function 
solutions which command attention, and to force attention back to that key point, speak instead of a function Zi(c, z) 
which is a solution to the equation 

d 2 Zi(c,z)-c 2 zZi(c,z) = 0 (18.21) 


How does Zi (c,z) behave? There are two kinds of behavior that are sought: exponential and oscillatory behavior. 
Therefore, 


Zi(l, z) = Bi(z) 

Zi(-l,z) = Ai(z) 

Zi (i, z) = Ai(-z) - /Bi(-z) 

Zi(—z ,z) = Ai(-z) + zBi(-z) (18.22) 


5 c here should not be confused with the speed of light. 
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because the first grows exponentially, the second decays exponentially, and the last two correspond to traveling waves 
to the left or right. All can be expressed in a single equation as 

Zi(c, z) = - \(c - l)(c 2 + 2c + 3) Ai(c 2 z) 

4 

+ ~(c + 1)(3 c 2 -2c+ ljBi^z) (18.23) 

4 

Alas, such a response just begs the question 6 by failing to illuminate how any of the functions behave by defining them in 
terms of each other. Before one reflexively reaches for Watson [239], which any other time would be an excellent idea, 
make use of the fact that the Gamow factor for the triangular barrier is already known: 0 tri <x k 3 /F. One therefore has 
good reason to suspect that Zi(c,z) ~ exp(az 3/2 ). In the large z limit, insertion of that trial form into Eq. (18.21) suggests 
letting 9 a 2 - 4 c 2 = 0, so take a = 2c/3. Making use of Eq. (11.22) to hone intuition, a more realistic ansatz for Zi (c,x) is 

Zi(c, z) = exp (|cz 3/2 ) (18.24) 

2yfi V3 / 

where the coefficient z~ 1 ^ 4 comes from the factor k~ 1 / 2 in Eq. (11.22), and where G c (z) is (likely) a smooth function of z. 
Inserted into Eq. (18.21), the result is 

{ 16?2 (^) 2 + 8z(4c?3/2 ” ^ + 51 G c (z) = 0 (18.25) 

and is not evidently an improvement, but switching variables to z = «~ 2/3 and defining G c (z{u)) = H c (u) gives 

{36 u 2 dl + 24(3m - 2 c)d u + 5 }H c (u) = 0 (18.26) 


which is in the direction of simplicity. 

The convention used here, given that G c (z) and H c (u) are the same function, is that G c (z) will be used when z < 1 and 
H c (u) will be used when u < 1. The convention is therefore to facilitate the consideration of the limiting (small argument) 
case of H c (u): consideration of the function G c (z) is eclipsed by the availability of good polynomial expansions of Ai(x) 
and Bi(x) when |x| < 1, the forms of which will be taken up in Section 18.3. 

Observe that H c ( 0) (which is just G c (oo)) can be determined from the asymptotic forms of the Airy functions themselves 
[103], it is found that 

HA 0) = 2; HAO) = 1 

(18.27) 

H,i 0) = V2(l - 0; H_i( 0) = y2(l + 0 

The substitution of z(u) = ur 2/3 renders Eq. (18.24) as 

HAu) i / 2 c\ 

Zi (c,z(u)) = ~^—u l/6 exp ( (18.28) 

2\[n 

The introduction of the Zi(c, z) functions enables investigation of the smooth passage to the transmitted (e lkx ) and reflected 
( e~ lkx ) waves when the field vanishes and the electron energy is above the barrier (c 2 = — 1). That is, if Zi(c, z) is known, 
say at z = z 0 where x= 0, then how does it change a short distance away? It does so by 

« exp { 0 {(k 2 -k 2 0 - sfx) 3/2 - (k 2 - k 2 ) 3 / 2 } | 

« exp l-isik 2 - ^) 1/2 x] (18.29) 


6 "Begging the question" is a logical fallacy that is commonly, but incorrectly, thought to be failing to give an answer that pertains to the question. 
In the words of D.N. Walton, "To commit the fallacy of begging the question, two requirements must be met, in a given case: (i) there must 
be a circular sequence of reasoning, where the conclusion to be established is either identical to one of the prmises, or the premise in question 
depends on the conclusion, and (ii) the circular sequence of reasoning must be used illicitly in a context of dialogue (conversation) to escape 
the proper fulfillment of a legitimate burden of proof in that context." (see Chapter 16 of Hans V. Hansen and Robert C. Pinto, Fallacies: Classical 
and Contemporary Readings. University Park, PA: Pennsylvania State University Press, 1995, p. 230), which is what was done here. 
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as / -*■ 0, but that is the equation for a transmitted wave over a step barrier. Therefore, Zi(±z, z) are the analogs of e ±lkx when 
afield exists, making their usage, rather than the real valued Ai and Bi, a bit more useful for tunneling calculations in spite 
of the overhead required to understand them. 

The behavior of H c {u) for non-zero u is needed to progress. Differential equations with constant coefficients (what 
Eq. (18.26) resembles in the u -* 0 limit) motivates trying a solution of the form H c (u ) « H c (0)e~‘ p because terms of the 
form H~ l d u H suggest its utility. The leading order behavior of cp is then determined from the resulting equation 

36 u 2 ((d u cp) 2 - dl<p) + 24(2c - 3 u)d u cp +5 = 0 (18.30) 

The leading order behavior is obtained by neglecting terms of order u 2 and higher, giving cp « (5/72) ln[l - (3m/2c)|, but 
keeping only the linear order term in u from a Taylor expansion of cp does not work (try it). The next approximation 
just shy of a quadratic function is based on a Pade approximation, where cp is a ratio of polynomials in u, or, more to the 
point here, a function of the form <p(u) = Au/(Bu +1) (there is no constant in the numerator because of the expansion 
associated with the In term). Doing so, and neglecting terms of order u 2 , gives cp = 5zz/12(4c - 3m), or 

which is adequate for u small. 

Recall, though, that the definition of current in Eq. (9.60) involved more than just y/(x, t), it also involves d x ys(x, t ). 
Therefore, in addition to Zi(c, z), d x Zi (c, z) = Zi'(c, z) 1 is also required. The plane wave asymptotic forms e ±lkx , where d x e ±,kx = 
±ike lUx , suggest the analogy 

Zi'(c, z) = d x Z\(c, z) = Di(c, z)Zi(c, z) (18.32) 


such that in the limit f -* 0, Di(c, z) -*■ ±k (for c = ±1) or ±ik (for c = ±i). See how this is so. Observe that Zi(±l,z) is 
real because Ai(z) and Bi(z) are, and that Zi(-z,z) = Zi(z,z) t because Zi(—z, z)Zi(z, z) = Ai(-z) 2 + Bi(-z) 2 is also real. Lastly, 
because the Wronskian equation (Eq. (A2.27)) is real, Di(—z, z) = Di(z,z) t . These properties are useful. From the definition 
of z(x) in Eq. (18.20), it follows 

Then 




= 7 ( - ) (4c - m - 6 u 2 d u \n{H c (u)}) 


Because terms of order u 2 are neglected, the factor containing c) u \n{H c (u)) can be neglected because of the dependence 
of cp{u), resulting in the approximation 

sf'f* 

-ri — J. _c_f/ j. aa n a 


Because the Di(c, z) functions can also be expressed directly in terms of the Airy functions, this means the following 
approximations, valid for u 1, to combinations of the Airy functions results, the first two for c 2 = 1, the second for 


4 — m _ Bi'(z) 

4mV 3 ^ Bi(z) 

-4- m ^ Ai'(z) 

4m j /3 ~ Ai(z) 

_ 1 ^ 2/3 w 2 Ai(-z)Ai'(-z) + Bi(—z)Bi'(—z) 
2 ~ Ai(-z) 2 + Bi(—z) 2 

_ 1 _ (\/n) 

m i/3 w Ai(-z) 2 + Bi(-z) 2 


(18.36) 

(18.37) 

(18.38) 

(18.39) 


7 Although such a convention invites confusion with respect to the similar Airy functions Ai'(z) and Bi'(z) where the prime denoted differen¬ 
tiation with respect to argument, whereas for Zi, it denotes differentiation with respect to position x, the convention was established earlier 
[85, 202] and is retained for continuity. 
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In these four equations, the left-hand side is proportional to the ratio with s/ 1/3 of Di(l,z) and Di(-l,z) (for c 2 = 1), and 
Di(z',z) + Di(-z',z) and Di(z,z) - Di(-z,z) (for c 2 = -1), expressed using Eq. (18.35). In all, the right-hand side is the exact 
relation in terms of Airy functions. The limit that u -*■ 0 for c = i is an important case, giving 


lim Di(z, z) = 


£ 2/3 

mV 2 


(18.40) 


(where s = -1) or ik! in the parlance of Eq. (9.65), or itc( 0) here. The leading order approximations are compared to the 
exact Airy function relations in Figure 18.4, and the agreement is reasonably good. 


18.2.1 Over the barrier: D(k > k a ) 

Although Eq. (9.66) almost immediately suggests the form of t(k) for / > 0, walking through the evaluation explicitly is 
useful. The generalization of Eq. (9.65) becomes 


1 1 \ /l\/zi(z,z 0 ) Zi 

ik -ik)\r)~\n\i,z 0 ) Ziz„) 

where the onset of the barrier is at z 0 = v(0) 2 // 2/3 . The transmission coefficient t is therefore 



(18.41) 


m = 


2k 

{k — zDi(z, z 0 )) Zi (i,z 0 ) 


(18.42) 


The remaining piece of the effort to find the transmission probability D(k) is the evaluation of the outgoing cur¬ 
rent j tram , as needed in Eq. (9.69). As the outgoing wave yr k (x) corresponds to t(k)Zi(i,z), this means that the quantity 
Zi(— i, z)Zi'(i, z) - Zi(z, z)Zi'(-i, z) must be found (recall that for the Zi, the prime indicates d x , not derivative with respect to 
argument). Invoking Eq. (18.22), and bypassing even thinking of Di(c,z) by using the fortuitous Wronskian relation of 
Airy functions of Eq. (A2.27), it is found that 


(h/2mi) (Zi(—z, z) Zi'(z, z) - Zi(z, z) Zi'(-z, z)) _ s/V 3 
hk/m jrk 


(18.43) 


where s = -1 because a barrier that is increasing to the right has no transmitted waves. This elegant result, along with 
u. =//x 3 , enables finding D(k) = (f l / 3 /nk)\t(k)\ 2 to be 


D(Jc > k 0 ) = 


_ 16 kK _ = 16 kK 

(|Di(z, z )| 2 + k 2 )\Hi{u)\ 2 + 8 kK ~ n,(z) - &skic 


(18.44) 


n c (zz) = {k 2 + |Di(c, z(z<))| 2 } \H c (u)\ 2 


(18.45) 
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for transmission over the triangular barrier of Fowler and Nordheim. The introduction of the real function n c (u) should 
be noted: although apparently just a convenience here for D(k > k 0 ), it returns when considering tunneling in D(k < k g ) 
in an interesting way. 


EXAMPLE: In the limit that the field F -*■ 0, show that Eq. (9.69) is recovered from Eq. (18.44). 

SOLUTION: A triangular tunneling barrier entails s = -1. Vanishing field implies f -*■ 0. Because z is evaluated 
at x = 0, z = (k 2 - k 2 )// 2/3 = if 2 // 2 / 3 . Because u = z~ i/2 , u = //x 3 -> 0. As a result, Di(±/, z) -> ±ix and |H ! (0)| 2 = 4. 
Inserting the components. 


lim D(k > k 0 ) - 


\6kK 

4 (k 2 + k 2 ) + 8kx 


4kx 

(k+K ) 2 


(18.46) 


18.2.2 Under the barrier: D(k < k 0 ) 

The treatment of tunneling involves moving from above the barrier (k> k 0 ) to below the barrier k < k 0 , but that is not as 
simple as changing c in the previous analysis, although it would amount to simply changing z to -z in the arguments of 
the Airy functions if they were used as the basis states. In fact, usage of Ai and Bi for the particular case of the triangular 
barrier is a bit simpler [240]. The Zi approach will not reveal its utility until more general potentials that suffer small 
values of / (and therefore large arguments of Airy functions), as well as shifting locations where ys k (x) emerges from 
the barrier, are considered. The Zi approach, however, requires accommodation that the Airy approach does not when a 
wave function passes from under the potential to over it, or vice versa. The reason is because Zi(/, ?) corresponds, like e nx , 
to a traveling wave and therefore contains both Ai and Bi components, the coefficients of which have to be matched. That 
is, for the wave function going from under the barrier to over the barrier, the wave function changes from exponential 
growth/decay to oscillatory (and the reverse when going from over the barrier to under it). The transition between the 
two behaviors occurs where z = 0. 

For the triangular barrier (s = - 1) the relationship between the coefficients to the left (under) to those on the right 
(over) can be found using the same methods of Eqs (18.2) and (18.3) but with the Zi functions instead of the exp(±/foc) 
functions: 

M 1 (x 0 )|| 1 > = M ! .(x„)|| ! ) (18.47) 


where the subscript refers to the value c takes on, and because x 0 is such that k 2 — k 2 — sfx 0 = 0, then 


M c (x 0 ) = 


/ Zi(c, 0) 
\Zi'(c, 0) 


Zi(—c, 0) \ 
Zi'(-c, 0)/ ’ 


\Q = 


o 


(18.48) 


where the circumstance shown in Eq. (18.47) is for under the barrier to over (the wave function emerges from the usual 
triangular barrier), as in Figure 18.5. Were a wave function to enter a triangular barrier (s = +1), then the 1 and/ subscripts 
would swap places. For the simple triangular barrier, t 1 = 1, r, = r(k), t, = t(k ) and r, = 0, analogous to Eq. (18.6). 


Figure 18.5 Schematic of 1 
the red diamond. 



function emerging from under a barrier to over it (left) and over a barrier to under it (right). The change in c occurs at 
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For the under-to-over case (s = -l, c= 1 on the left, and c = i on the right), shown in the leftmost diagram of 
Figure 18.5, and in the notation introduced in Eq. (18.3), S(x 0 ) = so then 


/ Zi(l, 0) Zi(— 1,0) \ ' / Zi(z, 0) Zi(-z,0)\ 
\Zf(l, 0) Zf(—1,0) J \Zi'(z,0) Zi'(—z, 0) J 


(18.49) 


This apparently entails dreadful matrix inversions and multiplications, but there are several simplifications behind it. First, 
the inverse of the first matrix contains a Wronskian and can use Eq. (A2.27). Second, although matrix multiplication 
results in superficially difficult factors (e.g„ [Sf^)] 3 I = Zi(z, 0)Zi'(-l, 0) - Zi(— 1,0)Zi'(z, 0)) the arguments of the Zi and Zi' 
are 0 so that when converted to Airy functions using Eq. (18.22) there is a good deal of cancelation and what remains is 
a Wronskian itself! This means that all the elements of S(x 0 ) are either ±1 or ±z. In fact, direct computation shows that 
the transition matrices that must be introduced whenever c = 1 -» z (under to over) or c = z -*• 1 (over to under) are 


S(x„) = 



(under -*■ over) 


(over -* under ) 


(18.50) 


and yes, the factor of (1/2) is correctly attached to the second form (work it out). Thus, whenever a change in c occurs 
in the Zi functions (either 1 -*■ z or i -* 1), a transition matrix S(x 0 ) must be inserted, the form dictated by whether the 
wave function goes into a tunneling region or emerges from it. 

Incorporating the transition matrix 5(x 0 ) of Eq. (18.50) into Eq. (18.41) so as to treat the case when k < k 0 gives 


ik -ik)\r)~ \Zi'(l,z„) Zi’(-l,z 0 )) X \l 1 

Although the manipulations proceed analogously to the case k > k 0 , there is a bit more bookkeeping involved because 
Zi(±l,2„) is real. Not surprisingly, however, the final result, using Eq. (18.45) for fl ±l (zz), is 

D(k < k ) =-—- l6kK ^ - (18.51) 

0 nozzle 4 / 3 " + n_ 1 (zz)e- 4 / 3 « - 8skx 

where, remember, zz =//x 3 . Notice how n c (zz) changes when c=i-+c = ±l: the arguments of the exponentials in the 
underlying Zi functions become real, resulting in the ? ±4 / 3k terms. 

For the simple triangular barrier, it may be that the Zi approach feels unnecessarily busy, but when dealing with 
potentials having several regions, particularly those for which the arguments of the Airy functions become uncomfortably 
large (e.g., as occurs when the field is small), the use of the Zi functions makes for easier calculations, as shall be seen 
when potentials composed of several linear segments are considered. Still, we will explore the one case where Airy 
functions themselves are simpler next. 



18.2.3 Airy function method 


Nothing is more simple than greatness; indeed, to he simple is to be great. 


- Ralph Waldo Emerson 8 


Using the Airy function definitions behind the Di and Zi functions gives an equivalent form to Eq. (18.45) for the 
case of the triangular barrier where s = -1, which has the virtue of being simple and therefore compact [196, 240] in 


s Ralph Waldo Emerson, An Oration delivered before the Literary Sodeties of Dartmouth College, July 24, 1838, http://www.emersoncentral 
.com/litethics.htm (09/03/2009 18:36:13). 
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that there are only two regions and one site of matching the wave function and its derivative, so that ±z is enough to 
determine which Airy function is used. The straight Airy function version makes use of 


|Zi(z',z)| 2 = Ai(-z) 2 + Bi(-z) 2 (18.52) 

|Zi'(z,z)| 2 =/ 2/3 (Ai'(-z) 2 + Bi'(-z) 2 ) (18.53) 


Zi(l,z) 2 + Zi(— 1, z) 2 = Ai(-z) 2 + Bi(-z) 2 (18.54) 

Zi'(l ,zf + Zi'(-l.z) 2 =/ 2/3 (Ai'(-z) 2 + Bi'(-z) 2 ) (18.55) 


so that 


D(k) = 


_ 4 k / 1 / 3 _ 

;r[k 2 (Ai 2 + Bi 2 ) + / 2 / 3 (Ai' 2 + Bi' 2 )] + 2k/ 1 / 3 


(18.57) 


where, for compactness, the arguments of Ai(±z) and Bi(±z) and their derivatives are suppressed, but positive argu¬ 
ments are associated with under the barrier (k < k 0 ) and negative with over (k > k 0 ). Now the continuity at the bar¬ 
rier maximum, where k = k 0 and z 0 = 0, is fairly obvious: from the values of the Airy functions for zero argument 
(Section A2.6), 

4 k ./ 1 / 3 

D(k = k„) = -r--- (18.58) 

1.5839k 2 + 2/ 1 / 3 k„ + 0.84179/ 2 / 3 


With Eq. (18.58), the Kemble approximation can be appreciated for the simplest imaginable barrier: the behavior of 
D(k 0 ) is shown in Figure 18.6 for the usual metal parameters of p = 7 eV and <t> = 4.5 eV. The thin gray line therein 
corresponds to D = 1/2: in the range of field typical of field emission, D(k 0 ) conveniently hovers around the Kemble 
assumption (D(Jc g ) « 0.5) for the triangular barrier treated by Fowler and Nordheim [36], Its doing so bodes well for the 
general thermal-field equation of Eq. (17.61) in the Q -*• 0, or triangular barrier, limit. 

The reasonableness of the Kemble approximation for the triangular barrier D tri is consequently of a different charac¬ 
ter than for the rectangular barrier D rec : at the barrier maximum, the transmission probability D tri {E) smoothly passes 
from E</i + ® to E>f4 + <£> because the underlying Airy functions in the combinations 9 Ai 2 + Bi 2 and (A/) 2 + (Bi') 2 are 
well-behaved, smooth, and finite as their argument passes from negative to positive values, as apparent in Figure 18.7 
for metal-like parameters at several fields, generated using the algorithm of Section A3.10. In the tunneling regime. 


Figure 18.6 The transmission 
n = 7 eV and ® = 4.5 eV are 



presumed. The thin gray line corresponds to D- 1/2, as suggested by the Kemble approxij 


9 The argument x is being used for the general case, as is done in Section A2.6, to allow for positive and negative values, in contrast to z, which, 
in the present section, is only positive. 
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Figure 18.7 The 

and® = 4.5 eV< 



presumed. The thin dashed gray line corresponds to E = a + ®. The black line labeled Step corresponds to Eq. (18.46). 


the Gamow-like exponents of ±(2/3)z 3 / 2 likewise approach the origin z = 0 in a smooth way, so that the linearization 
of 0 tri (E) there is well-behaved. The approximations made in the development of the general thermal-field equation of 
Chapter 17 if applied to a triangular barrier would therefore be reasonable, as they should be given that the triangular 
barrier is the Q -*■ 0 limit of the image charge barrier. 

18.2.4 A Fowler-Nordheim refrain 

The day returns, but nevermore 
Returns the traveller to the shore, 

And the tide rises, the tide falls. 

- Henry Wadsworth Longfellow 10 

A refrain is a poetic device of repeating a line or phrase at the end of stanzas (as "and the tide rises, the tide falls" 
is in Longfellow's poem of the same name) in order to draw attention to an idea or notion through its repetition, 
sometimes to emphasize differences between sections of the poem, or to make the content of the refrain take on 
more meaning. The technique is likewise powerful in the development of methodologies to explore particularly 
useful problems in physics by tackling the same problem with different approaches, that is, through repetition with 
variation. The original Fowler-Nordheim equation for the triangular barrier, first encountered in Section 13.1, is a 
grand example of the same problem that can be considered in multiple ways. Treating it now will be through limiting 
forms of the components of D(k < k 0 ) because of the magnitude of the quantities involved, in order to ascertain 
the form of the coefficient of J PN (F) that eludes identification using only the Kemble approximation and Gamow 
factor. 


EXAMPLE: Determine the magnitude of the parameters k, k a , f, x, z and u for metal-like conditions (ft = 7 eV and 
O = 4.5 eV) under the assumption that the incident electron is at the Fermi level, the slope of the triangular barrier 
corresponds to 6 eV/nm, and x 0 occurs at the left edge of the barrier for which x 0 = 0. 

SOLUTION: Use 2m/Ti 2 = 26.247 (1/eVnm 2 ). Then/ = 2mF/Ti 2 = 157.48 nm“ 3 , k 2 F = 2mn/h 2 = 183.73 nm ~ 2 ,k 2 = 
2 + <&)/h 2 = 301.84 nmr 2 , and K 2 = k 2 0 -k 2 F = 118.11 nmr 2 . From z = x 2 // 2 / 3 = 4.0501, it follows u = z~ 3 / 2 = 
//x 3 = 0.12269. 


10 Henry Wadsworth Longfellow, "The Tide Rises, the Tide Falls" in M.L. Rosenthal, Poetry in English: An Anthology. New York: Oxford University 
Press, 1987, p. 1880. 
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The smallness of u (equivalent to the largeness of z) entails that H(±l, u) m2 and Zi( 1, z) » Zi(— 1, z). The leading order 
approximation of Di(l,z) m ~(f/u) l/3 = -k, as obtained from Eq. (18.35). And so, for Cu-like parameters, 


4 k / 1 / 3 


4 he 

= (k2+^) C 


(18.59) 


But k 2 =k 2 0 - k 2 when k = k F , and so, using k 2 = (2m/h 2 )(p + O), k 2 = 2mpi/Ti 2 , and/ = 2mF/Ti 2 , it follows, as anticipated 
inEq. (13.9), that 


4 

» tB= ' ,) = i^ exp 


/ 4 V2m® 3 \ 

V ) 


(18.60) 


where the argument of D{E) has returned to energy (E = h 2 k 2 /2m) in a one-dimensional problem, to facilitate the 
correspondence to Section 13.10. Recall that in the development of the triangular barrier Fowler-Nordheim equation 
(Eq. (13.10)), - lnCDfE)) is taken as linear in E, so the weak variation of the coefficient of Eq. (18.59) is neglected, leaving 
only the exponential of 0 tri (E = p) of Eq. (13.5). 


18.3 Triangular barrier: numerical 

What a fine creed that is! So far as I can see, your religion consists of arithmetic. Men, I must say, can get some weird notions in their 
heads, and people who've studied a lot aren't always a lot wiser. 

-Moliere" 

Generating D(E) for a triangular barrier using Eq. (18.57) and the code in Section A3.10 relies on the numerical evalu¬ 
ation of the Airy functions Ai(x) and Bi(x) and their derivatives. These function calls are generally a part of programming 
languages such as MATLAB or the IMSL libraries used by FORTRAN. Two problems arise. First, one may not have access 
to either the programming languages or libraries that possess such capabilities, and, second, even if one does, the argu¬ 
ments of the Airy functions required (as can be the case for the more complex potential barriers composed of multiple 
segments) are such that numerical sensitivity or errors are introduced that can be overcome by reliance on the Zi func¬ 
tions, or rather H c {u) and Di(c, z) if they were available. There are ways to obtain these functions, and there is advantage 
to doing so: raw computational evaluation using Airy functions themselves will require exponentially large (or small) 
terms, but clever recasting using H c and Di allows for a more intuitive understanding using functions that do not become 
unmanageable, and finding them becomes a good exercise in how to deal with functions that are rather good at hiding 
their behavior. 


18.3.1 Large argument evaluation 


Life is what the least of us make most of us feel the least of us make the most of. 


- Willard Quine 12 


The form of Eq. (18.26) indicates that H c (u) may be represented as a power series in u and that the coefficients of the 
power series can be related to each other, a technique known as the method of Frobenius [60]. That is, substituting 


H c (u) = ^ bjii (18.61) 

into the differential equation and collecting terms associated with various powers of u requires that each coefficient of 
each different power of u separately be zero because the value of u is arbitrary. Of course, that is easier said than done. 


"Moliere, Don Juan (trans. Richard Wilbur). Harcourt: San Diego, 2001, Act 3, Scene 1, p. 73. 

12 Willard Quine, as quoted in John D. Barrow, The Constants of Nature: From Alpha to Omega - the Numbers That Encode the Deepest Secrets of the 
Universe, 1st American edn. New York: Pantheon Books, 2002, p. 141. 
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Proceeding slowly, insert the power series of Eq. (18.61) into Eq. (18.26) and consider all terms containing a particular 
by it will be found that 

367(7 “ 1) dbj + 24 j (3 u - 2c) lJ^bj + 5 u’bj = 0 (18.62) 

Each one of these terms is in a summation over ;, and so each summation can be re-indexed so that the powers of u are 
the same rather than the index of bj. Doing so, one finds 

(36 j 2 - 36 j + 5)b j _ 1 tj- 1 = A&cjbjU!- 1 (18.63) 


an equation from which bj can be found if is known [85, 241]: 

, -(6J-5X6/-1), 


or, using the asymptotic forms [103] to find b 0 as done in Eq. (18.27), 

1 V 2 ' 
b 0 = H c ( 0) = i {(c ■+ 3)(c 2 + 1) + 2\/2(c - l)(c 2 - 1) } 


(18.64) 


(18.65) 

(18.66) 


which corresponds to Eq. (A2.34). This is fairly nice until one thinks about it: for large;', bj ~ (3;/4)^_ 1 , which means bj 
for; large behaves as (3/4 yj\. That's not good. Eq. (18.61) is a devilish divergent series, reminiscent of those that Niels 
Abel inveighed against in Section 6.3. It does not mean that a Taylor series expansion about u = 0 with only the first few 
terms kept is precluded, but rather it means that the infinite series for finite u to find H c (u) should not be done because 
it will not converge. 

If the truncated Taylor expansion leaves one ill at ease, there are other ways. The values of H c { 0) and H c { 1) are known 
(the latter from the next section) and so it is possible to use finite difference methods [242] to make headway in the 
solution of the differential equation for H c (u ) [85], In the general case, let f(x) be defined on N equi-spaced points 
Xj = ;Ax such that f(x } ) = Further, let the boundary conditions f 0 and f N+] be known. The differential operators d/dx 
and d 2 /dx 2 are then approximated in a second-order central differencing scheme discussed in Section Al.3.2, by 

< 18 - 67 > 

- 2 <-+W ( 18 ' 68 > 

Proceed now in two steps. First, use the finite difference representation of d/du and d 2 /du 2 in Eq. (18.26). Second, 
group terms according to which term of H c (Uj. t ), H c (Uj), or H c (u j+l ) that they multiply. Third, since 0 < u < 1, therefore 
Am = 1 /(IV + 1). An equation of the form 


DjH c (x hl ) + CjH c (Xj) + UjH c (x j+1 ) = 0 


(18.69) 


results, where the notation D, C, and U refer to "down", "center", and "up", in anticipation of the diagonals they will 
occupy in the matrix equation to be developed. Equations of the sort that Eq. (18.69) is can be put into a matrix equation 
where the two known values, H c (x 0 ) and H c (x N+l ) serve as the boundary vector terms. The equation will resemble 

0\H) = \B) (18.70) 


where 


c j $4 0 ■ ■ ■ 0 

d 2 c 2 u 2 ■ ■ ■ 0 

0= ■ Dj Cj Uj 


0 ■■■ D n _ 

0 ■ ■ ■ 0 


(18.71) 


j jv- 1 






224 Chapter 18 



'H c (u ,)' 


-DyH^oj 

1 H) = 

H c (Uj) 

; 1 b) = 

0 


H c (u n ) 


-C n H c (1) 


where u 0 = 0 and u N+l = 1 are used in the boundary value vector \B). The following relations hold for Dp Cj and Up 

Uj = 12{—2c(N + 1) + 3j(j + 1)} (18.73) 


Cj = -72/ + 5 

Dj= \2{2c(N + 1) + 3j(j - 1)} 


(18.74) 

(18.75) 


where any constant factor (such as a denominator (N + 1)) can be eliminated from both sides of the matrix equation 
given by Eq. (18.70). 

Efficient algorithms exist to solve tridiagonal matrix equations, that is, to find \H) = 0~ l \B) by using an algorithm 
akin to Gauss-Jordan elimination [236], a procedure well known from elementary linear algebra [91], The process 
can be made even more elegant by noting that because there is no integer j for which Cj vanishes, numerical checks 
designed to prevent a divide by zero situation, which would otherwise arise if one (or more) of the Cj were to equal 
zero, do not have to be included. The streamlined tridiagonal matrix algorithm unfolds in the following sequence of 
steps: 


Downward 

Require: C, U, D and B are vectors of length N. X is a scalar. 
Bj = 0 ;B 1 = -DyH'. (0); B N = -U N H C ( 1) 
for = 2 to N - 1 do 

X=Cj- DjUj _! 

Cj = Cj/X 
Bj = -DjBj_jX 

B n = (B n - D n B n _\)/(C n - Dj.C^ ) 

end for 

B n = (B n - D N B N _^)/(C N - D n C n _ j) 


Upward 

h n = b n 

for j = 2 to N - 1 do 

Hj = Bj - CjH j+l 

end for 

where, first, in the downward elimination, the instruction Bj = —DjBj_ l /X is used instead of the general Bj = (Bj — 
DjBj_ x )/X because all Bj are initially zero except at; = 1 or N, and second, the dummy term X need not be saved after it is 
used to update Bj. The labeling of "downward" and "upward" reinforces that the lower diagonal D is being eliminated in 
a Gauss-Jordan procedure followed by the upward elimination of the modified upper diagonal U: performing the same 
operations on the boundary vector B is what solves the matrix equation. The code is implemented in Section A3.11. The 
behavior is graphically shown in Figure 18.8. 

As a consequence of the above numerical algorithms, H c (u) may be obtained for u < 1. Values of H c (u) for u between the 
Uj points, an example of which is in Table 18.1, are obtainable through interpolation [243], and finite difference methods 
may be employed to calculate derivatives of H c (u) so as to obtain Di (c,z(u)) from Eq. (18.34). The details of doing so are 
where the pain is [85, 197, 199], If the Uj are evenly spaced (u J+l - Uj = A u), and / = H c (Uj), then, for example, a simple 
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Figure 18.8 The functions Hfu), H_fu), and S(H_.(m)), where 91 and 3 denote real and imaginary parts. 


Table 18.1 Values of H c (u), where 91 and 3 denote real and imaginary parts. 


U, 

H,(u) 

(u) 

«[H,-(o)l 

S[H,(u)] 

0.00 

2.0000 

1.00000 

1.4142 

1.4142 

0.10 

2.0229 

0.99031 

1.3985 

1.4276 

0.20 

2.0524 

0.98178 

1.3817 

1.4382 

0.30 

2.0895 

0.97411 

1.3648 

1.4463 

0.40 

2.1260 

0.96711 

1.3482 

1.4523 

0.50 

2.1555 

0.96067 

1.3324 

1.4567 

0.60 

2.1767 

0.95468 

1.3172 

1.4597 

0.70 

2.1901 

0.94907 

1.3029 

1.4618 

0.80 

2,1971 

0.94381 

1.2892 

1.4630 

0.90 

2.1992 

0.93884 

1.2763 

1.4636 

1.00 

2.1975 

0.93413 

1.2640 

1.4637 


quadratic interpolation can make use of 13 

KUj + P^u) = 1 - p)hj _, + (1 - p 2 )hj + |p( 1 + p)h j+l (18.76) 

which has the required behavior that for p = -1,0,1, then m -» u j _ l , Uj, u j+l and h{u) -+ h j _ ] , hj, h j+l , as they should. 

18.3.2 Small argument evaluation 

Logic, n. The art of thinking and reasoning in strict accordance with the limitations and incapacities of the human 
misunderstanding. 

- Ambrose Bierce 14 

One might suspect that a comparable ordeal to that of evaluating H c {u) awaits for the calculation of G c (z), but instead 
existing small argument series expansions of Airy functions [103] that are suitably truncated can be used [85], The 
algorithms here preserve the accuracy required to perform numerical tunneling calculations when the time comes. For 


13 See the Lagrange three-point interpolation formula in Davis and Polonsky, Chapter 25 in Numerical Interpolation, Differentiation, and Integration 
[103]. Although more complex formulae may be used with different orthogonal polynomials, the least tiring way to achieve greater accuracy 
is simply to decrease A u by increasing the number of Uj. 

14 Ambrose Bierce, The Devil's Dictionary. New York: Dover Publications, 1993, p. 79. 
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small argument such that -1 < x < 1 (where x = ±z) 


Ai(x) = 


Bi(x) = 


P(x) _ S(x) 

3 2 / 3 r(2/3) 3*/ 3 r(l/3) 

P(x) | -S(x) 

3 i/6f( 2 /3) 3“ 1/6 r(l/3) 


The truncated polynomials P(x) and S(x) are then given by 




(18.77) 

(18.78) 


(18.79) 

(18.80) 


In both cases, five-digit accuracy is obtained by setting N = 4. Derivatives of the Airy functions are then obtained by 
finding P'(x) = d x P(x) and S'(x) = d x S(x), which in turn enable the evaluation of Di(c, z), or the functions G c (z) of Figure 18.9 
and Table 18.2. 



Figure 18.9 The functions G^z), G_y(z), 9tand 3(G_,(z)). The value of the curves as they exit the right-hand side of the graph correspond to the 
values of the curves exiting the right-hand side of Figure 18.8. 


Table 18.2 Values of G c (z). 

z 1/3 6(1,*) G(-1,z) »t(G(-/,z)) 3(G(-/,z)) 


0.00 0.00000 

0.10 0.38791 

0.20 0.65542 

0.30 0.89836 

0.40 1.1353 

0.50 1.3736 

0.60 1.6113 

0.70 1.8350 

0.80 2.0220 

0.90 2.1471 

1.00 2.1975 


0.00000 0.00000 

0.22365 0.22396 

0.37438 0.37828 

0.50160 0.51764 

0.61006 0.65120 

0.70072 0.78123 

0.77434 0.90574 

0.83231 1.0202 

0.87667 1.1199 

0.90981 1.2014 

0.93413 1.2640 


0.00000 
0.38736 
0.64831 
0.86777 
1.0523 
1.2012 
1.3133 
1.3899 
1.4357 
1.4577 
1.4637 
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CHAPTER 19 

Transfer matrix approach 


"This earthquake is nothing new” replied Pangloss, “the town of Lima in America experienced the same shocks last year. The same causes 
produce the same effects. There is certainly a vein of sulphur running under the earth from Lima to Lisbon ." “Nothing is more likely, ” 
said Candide, “but oil and wine, for pity's sake !" "Likely! * exclaimed the philosopher. “1 maintain it's proved ." 

- Voltaire 1 

The 8.5-9.0 magnitude earthquake on All Saints' Day (November 1, 1755) some 200 km southwest of the coast of 
Portugal, and the subsequent tsunami it generated, all but obliterated Lisbon and caused the death of several tens of 
thousands. Voltaire used the occasion to savagely lampoon the adage of the German polymath Gottfried Leibniz (the 
co-inventor with Newton of calculus), that this was "the best of all possible worlds" by making Dr. Pangloss, who pon¬ 
tificated about such hokum, an object of scorn. A fatal defect of Pangloss (aside from his insufferable optimism) was a 
tendency to draw conclusions on flimsy grounds. That is something to be avoided in the claims about how D(k) behaves, 
particularly for rounded barriers. 

Specifically, it may seem reasonable to expect that the qualitative behavior of the transmission probability for a 
parabolic barrier lies between the rectangular barrier (Figure 18.2) and the triangular barrier (Figure 18.7) results, 
because as E(k) approaches the barrier maximum the parabolic Gamow factor 0 decreases faster than for the square 
barrier but slower than for the triangular barrier. It may then seem likely that the parabolic barrier transmission prob¬ 
ability is described by the Kemble approximation. But one should not end there with such a Panglossian claim, even 
though the relation between the parabolic barrier and the Kemble approximation was behind the general thermal-field 
equation of Eq. (17.61), largely because of the similarity of the image charge potential (at its apex) to a parabolic potential. 
Although Froman and Froman [244] show that the parabolic potential transmission probability is given by the Kemble 
form (their Eq. 9.12b), the mechanics to do so may leave one crying for more wine than oil. What can be done without 
suffering through an analysis more onerous than that deployed for the Airy functions? 

Numerically, there are some simple ways to proceed, and they will be referred to collectively as the transfer matrix 
approach (TMA) methods. There are two flavors: the more palatable approach [183, 245-248] uses a generalization 
of the plane wave basis behind Eq. (18.3), and is suggested by Figure 19.1. The Airy function generalization [85, 196, 
197, 249-252] is, however, better for general potentials, particularly those for emission into vacuum under an applied 
field, and require far less discretization of the potential for regions where the potential can be approximated by a linear 
function. Thus, the Airy method is more powerful and accurate, but takes more work to set up. The numerical results 
of these methods will add gravitas to the otherwise Panglossian "proof" based on the rectangular and triangular barrier 
results that the Kemble approximation for a parabolic barrier is good. It will become even more useful and even obligatory 
when more complex ion and resonant potentials are undertaken. 


19.1 Plane wave transfer matrix 

Paradox though it may seem - and paradoxes are always dangerous things - it is none the less true that Life imitates art far more than 
Art imitates life. 

- Oscar Wilde 2 

Representing a potential region as closely packed rectangular barriers of various sizes enables the wave function in 
those regions to be represented as plane waves, which are easy to manipulate, allowing for straightforward reasoning 
about the nature of tunneling and resonances. Proceed systematically. As in Figure 19.1, let a general potential V(x) be 
defined at the points Xj (not necessarily equi-spaced) such that Vj = V(Xj). Let the potential between the points x ; - and x j+] 
be constant and equal to Vj. Schrodinger's equation then has solutions of the form ip k (x) = exp(±ix(k)x) or exp(±v(L)x), 


1 Voltaire (trans. John Butt), Candide. Penguin Books, Middlesex, England, 1947. page 34. 

2 Oscar Wilde, The Complete Collection (the Picture of Dorian Gray, 14 Short Stories, 9 Plays, All Poems, Selected Essays and Letters), Kindle eBook, 2013, 
Essay: The Decay of Lying (1889). 
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Figure 19.1 Labeling assodated with a two-region potential barrier with constant potential segments. 
The If.) correspond to the two-component vectors containing the transmission (t,) and reflection (r,) 
coefficients. The blue squares identify the potential points Vj = V(x ; ). The red diamonds represent where 
the matrix equations M||) are matched by matrix equations analogous to Eq. (18.2). Compare to 
Figure 19.9. 


for electrons with energy E > Vj or E < Vj, respectively, and the wave function will be a combination of them such that 
the wave function and its first derivative are elegantly expressed in the matrix equation of Eq. (18.1), with the coefficients 
inside |£„) being such that the yr(x) and d x yr(x) are continuous at x = Xj. Although plane waves are simple, that is not the 
only choice: two other linearly independent solutions are sines and cosines [253], which entail different advantages but 
are otherwise employed in much the same way. 

If there are n flat segments that approximate V(x), then the generalization of Eq. (18.3) will be (using a more attentive 
notation) 

if 0 >={n%^ +1 )} i4+i> s s «i4+i> (i9.i) 


where x J+1 is the location where the matching of the wave function and its first derivative is occurring, r J+] refers to k to 
the right of it, and Kj to the left, where 

hKj = yjlm (E-V (x<x < x j+l ) - yjlm (E - Vj) (19.2) 

When E < Vj, then x ; - -*• iKj. Although the notation c = ±l,±z might be introduced as for the Zi functions, for the plane 
wave approach, such extra bookkeeping turns out to be unnecessary because k(x) does not switch from real to imaginary 
(or vice versa) between x j+l and when there is no field term. Consequently, S can be expressed in terms of the center 
and difference wave numbers K = (k a + k b )/2 and A k = (k a - k b )/2, in terms of which 


S(k a ,k b ;x)= 


{ Ke~ iAkx 
yAA: e ,Kx 


Ak c“ !70t \ 
Ke tAkx ) 


(19.3) 


The calculation of the matrix elements is straightforward (an example of the numerical algorithm is in Section A3.12.1). 
Once the product of matrices that defines S„ is computed, then [196] 


1 k tram 
[S„] hl k mc 


(19.4) 


where the notation [M]y denotes the z'th row, jth column of the matrix M, k inc is the incident wave number (left-hand 
side) and k trans is the transmitted wave number (right-hand side) if both left and right are at constant potential. It is 
convenient to choose the potential of the left- and right-hand sides to be the same (y(±oo) = 0), so that k tram = k inc = k, 
and so D{k) = \t{k)\ 2 . If the potential is under bias (as it would be, for example, for a resonant tunneling diode (RTD) 
through which current is flowing), then the factor k tram /k inc would have to be reinserted. 

To evaluate the matrices at Xj, multiply them in the correct order, and use the appropriate boundary conditions 
expressed in the |£) vectors (an example of the numerical algorithm is in Section A3.12.2). Applied to the rectangu¬ 
lar barrier leading to Eqs (18.10) and (18.11), a single barrier is specified by two points (where it begins and where it 
ends) and one potential (how high it is). The numerical evaluation of D{k) then requires three Xj and a corresponding 
number of Vj, as in Figure 19.2. Using Eq. (19.4) and the algorithm of Section A3.12.2 results in Figure 19.3, showing 
that the rectangular barrier results are recovered. 
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circles designate the other (rightmost) positions and potentials of each constant potential segment. A point prioi 
location is needed, and the potential after the barrier is chosen to be 0, although it need not be. 



Figure 19.3 The transmission probability D(k ) associated with the potential profile of Figure 19.2 for mjm = 0.25 (where m n is the effective mass) and 
V„ - 0.3 eV. 


19.1.1 Kemble approximation revisited 


Let your reason serve/To make the truth appear where it seems hid... 


- William Shakespeare 3 


The single barrier can now be altered by putting several barriers alongside each other and tailoring their heights for 
some desired profile. When their heights vary quadratically with how far they are from the center of the barrier, the 
parabolic barrier behind the Kemble approximation is modeled. Thus, let the barrier envelope be a parabola given by 

y(x) = y„ jl-(0 2 j (19.5) 

for a barrier of width 2 L at the base. Assuming that 16 barriers are placed adjacently, the left boundary of each barrier 
being at Xj and the height of the barrier being Vj = V{x } ), the barrier constructed of flat segments will resemble Figure 19.4. 
Of course more segments could be used, and therefore greater fidelity to the parabolic barrier achieved, but 16 segments 
is adequate for present purposes. 

The D{k) evaluated using the TMA is to be compared to the Kemble approximation using the quadratic Gamow factor 
9 qua (E) of Eq. (11.19). The present formulation differs slightly (Eq. (11.19) measured the barrier height from p and took 
the base width as L 0 , whereas now the barrier height is V 0 and the base width is 2 L). Those small changes result in 6 qua (E) 


3 Ref. [37]: Measure for Measure, V. 1.65-66. 
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Figure 19.5 Comparison of D(k) calculated using the TMA applied to V(x,) of Figure 19.4 and the Kemble approximation of Eq. (17.9) using Eq. (19.6) 
for 9 for r = mjm = 0.25, L = 5 nm, V 0 = 0.3 eV. Left, wide range of k; right, focus on k a k 0 . 


becoming 

<W*)= ' (V 0 -E) (19.6) 

where the electron mass m -* m n is an effective electron mass, if transport is occurring in bulk material rather than in 
vacuum. Evaluating D(k) using the TMA and comparing it to the Kemble approximation 1/(1 + exp(0 qua {E))) results in 
Figure 19.5. The agreement is fairly good, even for the lumpy, ill-tempered potential shown in Figure 19.4, which gives 
fairly reasonable agreement with the Kemble form. Increasing the number of rectangular barriers that make up V{xf) 
improves the correspondence. Therefore, without having actually proven the conjecture, the grounds for believing that 
the Kemble approximation works well for parabolic potentials are substantially strengthened. 

One matter that creates apprehensions is why, after long using metal work functions of several eV and a free space 
electron rest mass, would one switch to smaller effective masses and shorter barriers? Superficially, such changes help 
spread out the D(k ) that results, making it more appealingly rounded near k a k 0 and less step-like as one crosses from 
under the boundary to over it. But there is another reason. The TMA, as used here, owes much to the very interesting 
study of superlattices by Tsu and Esaki [183], the two-barrier version of which is the RTD. 4 RTDs typically have barriers 
and wells composed of, for example, GaAs/AlGaAs layers, for which mjm, V 0 and L are small. Generic values of those 
parameters for semiconductors are used here. The interest in RTDs is because they exhibit negative differential resistance, 
wherein an increase in voltage across the barriers results in a seemingly paradoxical decrease in current through it: the 
increase is caused by energy levels in the well region for which Dfk) approaches unity for k< k a [126, 130, 255, 256], 
A two-barrier potential is shown in Figure 19.6, created using the methods of Section A3.12.4. A barrier such that V 0 = 
0.3 eV and a rest mass of m n /m = 0.25 corresponds to k 0 = 1.403 1/nm, so the sharp peak (D(k) —>■ 1) at k = 1.3609 1/nm 


4 Esaki shared the Nobel Prize in 1973 for his work on tunneling and resonance behavior. Tellingly, his Nobel lecture [254] begins with the 
triangular barrier of the Fowler-Nordheim equation. 
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k [1/nm] 

Figure 19.7 D(k) calculated using the TMA applied to Vty) of Figure 19.6 for r = mjm = 0.25, L = 5 nm, V 0 = 0.3 eV. Top, linear y-axis scale to show 
the oscillatory behavior for k > k and the suggestion of bands, or regions, where D(k) drops precipitously; bottom, log-scale showing the resonance 
states for k < k 0 , where narrow spikes in the transmission probability appear. 


indicates a resonance level in Figure 19.7. Others are visible in the second (semi-log plot) figure: the resonances appear 
as narrow Lorentzians (Section A2.7). The peaks at the lower k resonances are generally larger than shown, a conse¬ 
quence of the coarseness of the spacing of points chosen in the figures. The behavior as parameters are varied exhibits 
a fascinating evolution, particularly when the number of barriers, the heights, and the right-hand boundary value are 
changed. 

So far, D(k) has been investigated under no bias conditions (the right boundary potential is at the same level as the 
left boundary potential). An RTD under zero bias does not have a net current flowing through it. In contrast, emission is 
enhanced when an adsorbed atom or incident ion affects a field emission site by providing a resonance level and leading 
to increased tunneling effects [257-259]. The RTDs differ from field emitters in that there is a supply function from both 
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k [1/nm] 



Figure 19.8 A comparison of the two-barrier and four-barrier superlattice potentials. Top, for k<k observe that 0 4 (&)P 2 a Z> 2 (k); bottom, for k > k a 
observe that regions, or bands, appear to be forming, an effect that becomes more pronounced as the number of barriers increases. 


sides of an RTD (recall the Tsu-Esaki formula of Eq. (11.2)), and so at zero bias, the same current flows in each direction 
(k > 0 flows to the right, k < 0 to the left). Under bias, the right-hand side is brought to a lower potential energy level, 
and less current flows through the double barrier from right to left. As long as electrons from the left-hand side are at an 
energy such that they can resonantly tunnel, they will contribute significantly to the current. When the resonant level 
drops below the left-hand side conduction band minimum, the contribution of the resonant level declines and so the 
current versus voltage declines, that is, the differential resistance (change in current with applied bias) becomes negative 
rather than positive as it normally is. 

More unusual behavior can be brought about. For a three-barrier potential, if the wells are such that the resonant 
levels align, then the transmission through the resonant level will continue unabated, but the addition of more barriers 
means that the sequential tunneling (first through one barrier, then the next, and so on) will be increasingly suppressed. 
Doubling then quadrupling the barriers and watching the behavior of D(k) demonstrates the effect nicely [85], but a simple 
case catches the effect. Taking the barrier of Figure 19.6 and doubling it (accomplished by setting N b = 4 in Algorithm 
A3.12.4), it is expected that D 4 (k) (the subscript indicating the number of barriers) would be comparable in magnitude 
to D 2 (k) 2 or, equivalently, D 4 (k ) 1/ ' 2 « D 2 (k) for k < k 0 and away from the resonance values, as seen in Figure 19.8. 

But there is more. Above the barriers (k > k 0 ), greater structure is put on D(k) as the number of barriers increases. 
Transmitted and reflected waves interfere with each other so that for some values of k, D(k) drops, as shown in the 
bottom plot of Figure 19.8. When many barriers are present, the size of the drop increases, and regions, or "bands", 
open up where transmission occurs between regions where it is suppressed, and anticipates the Kronig-Penny model of 
Section 21.3. 


19.1.2 Gamow factor revisited 

.. .there is a pleasure in recognizing old things from a new point of view. 


- Richard P. Feynman [260] 
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Bizarre quantum effects should not overshadow the relationship between the behavior of D(k) revealed by the TMA 
compared to the fluid-like description of Section 11.2 introducing the Gamow factor. It may not be clear how they are 
related, but there is a simple way to see it. Reconsider the TMA for the single rectangular barrier. Let the left side of the 
barrier at x = 0 be labeled by a 1 index, and the right at x = L by a 2 index. Then, writing out the matrix equation in its 
entirety without the aesthetic simplifications of Eqs (19.1) and (19.3): 



but instead of grouping the first two matrices into 5j and the second two into S 2 , consider the middle two matrices and 
call them B, or 


"Ci)C 


When kL » 1, then cosh(xL) « sinh(xL) « e Kh jl, and so 


( cosh(xL) -k 1 sinh(xL) 
-k sinh(xL) cosh(xL) 


/ cosh(xL) -k 1 sinh(K-L)\ ^ e K,< ( k -1 
l — k sinh(xL) cosh(xL) J 2k l — k 2 k 


(19.8) 


(19.9) 


Consequently, the term e Kl appears as an overall multiplier, and so, apart from the k and k odds and ends from the 
other matrices, D r (k) = \t(k)\ 2 oc e~ 2lcL , where the index on D is for the number of rectangular barriers from which it is 
composed. But then, a barrier represented by two adjacent rectangular barriers would have a similar factor for each, and 
so D 2 (k) oc Generalizing to N rectangular barriers. 


D N (k) 


exp 



(19.10) 


In the limit that all the Lj -*■ dx and the summation moves over to an integral as per Eq. (A1.10), then the argument of 
the exponent simply becomes the familiar Gamow factor of Eq. (11.14). Thus, the Gamow factor naturally emerges from 
the TMA, where it was hidden all along. 


19.2 Airy function transfer matrix 

... mathematicians may be completely repelled by the liberties taken here.. .just as a poet often has license from the rules of grammar and 
pronunciation, we should like to ask for “physicists' license " from the rules of mathematics in order to express what we wish to say in as 
simple a manner as possible. 

- Richard R Feynman [261] 

The Airy TMA of Eqs (18.41) and (18.51) is an improvement on the plane waves used in the S matrix in Eq. (19.3). 
This is because not only can trapezoidal segments better represent a general potential barrier than rectangular segments, 
but also the presence of a field at the right-hand boundary can be accounted for. The cost of that fidelity is that attention 
must be paid to when passage from under the barrier to over it occurs, or vice versa, as in Figure 19.9. When using the 
Zi functions this necessitates the introduction of the transition matrix of Eq. (18.50). The transition matrices and their 
bookkeeping, coupled with the greater effort required to numerically evaluate Zi(c, z), seems to tilt preferences to the 
plane wave TMA for the work-adverse. Those with greater fortitude realize that the plane wave approach requires many 
potential segments to obtain only reasonable correspondence for curved potentials, but with the Airy approach, a few 
segments are generally enough to achieve good accuracy. As a trivial example, the number of segments for an RTD under 
bias for the plane wave TMA requires many potential segments as it approximates linear regions as a staircase potential, 
but the Airy approach need not require any more segments than the unbiased case and remains exact because all the 
segments of the potential are linear in field. 

The task is therefore to find the matrix replacement using Airy functions to take over for S in Eq. (19.3). The approach 
will make full use of the Zi functions of Eq. (18.24). S a will be the product of two matrices, or S a = M, ■ M r , where l 
(for left) indicates the linear region before Xj (the; - 1 region) and r (for right) the linear region after x ; - (the; region). 
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Figure 19.9 Labeling associated with a two-region potential barrier with linear potential segments. 

The 1| ; .) correspond to the two-component vectors containing the transmission (f ; .) and reflection (q) 
coefficients. The blue squares identify the potential points that define k a , f, c, and s for each region. The 
red diamonds represent where the matrix equations M||) are matched by matrix equations analogous to 
Eq. (18.2). The orange diamond represents where a transition matrix (Eq. (18.50), in this case for 
under-to-over) must be inserted. Compare with Figure 19.1. 


Consider the components for the; region (for the (j - 1) region, the indices are simply altered by j -<■(/- !))■ Introducing 
the nomenclature (designed to facilitate coding): 

• fj = 2mFj/h 2 ; as a practical matter, if Fj < 0.005 eV/nm, then fj -*■ 0 and the plane wave representation is used 

• 9=1 if (k 2 + Sjfj(Xj +] - Xj)) < k 2 and i otherwise. 

•*, = tf 73 

• Zj = \k 2 +Sjfj(x j+1 -Xj)\/f* /3 

• Rj = exp[(4/3 )CjzJ /2 ] 

• Cj = In/iCjS^d - 3c 2 )) 
in terms of which 


M, =C hl exp (-|c,- i 
M r =exp(|c ; z ) 3/2 ) ^ 


3/2 \ v ( Qj-i Gi' (—Cj-i . 9 -i) 
i- 1 > \-qj. x R hX Gi\c hl ,z h {) 

Gi(cj, Zj) RjGi(-Cj, Zj) \ 
QjGi'iCj, Zj) qjRfrH-Cj.Zj)) 


—Gi(—Cj_ l , Zj _{) \ 

R j-vGi(cj_ l ,Zj_ l ) J 


where the "reduced" Gi(c, z) functions are defined by 

Gi(c, z) = Zi(c, z ) exp ( -1 cz i/2 ^ 


(19.11) 

(19.12) 


(19.13) 


Lastly, should c = 1 -*• i (under to over) or c = i -»• 1 (over to under), then M r -> M r ■ S(x 0 ), where S(x 0 ) is the matrix 
of Eq. (18.50). As a consequence of using the "reduced" Gi(c,z) functions, an overall multiplicative exponential term 
appears in front of each matrix, the form of which is instrumental in leading to the Gamow factor in the approach of 
Sections 19.1.2 and 19.2.2 below. 

Assuming that the matrix evaluations are performed and the matrices multiplied with care to include instances where 
over to under (or vice versa) crossings are accounted for, then an IV-component potential barrier will give rise to a 
transmission coefficient t N (k). Now, though, a field f N exists on the right-hand boundary. The evaluation of D(k) thereby 
becomes more involved compared to the plane wave version of Eq. (19.4). Evaluating;' (ra „ s using Eq. (9.61) and comparing 
it to j inc for the case where y/ N (k) is a sum of Airy functions corresponding to an outgoing wave as in Eq. (18.22), the 
transmission probability is now [199] 


m = 


\t N m 2 


(19.14) 


When applied to the parabolic potential or, more importantly, the image charge potential, to take full advantage of 
the economy of the Airy approach requires an artful placement of where the segments are and how wide they should 
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be made. And it is an Art. A few general guidelines encapsulate the aesthetics involved: (i) the Xj points should be more 
densely placed when the potential varies strongly (e.g., near the origin for the image charge barrier), (ii) placed sparsely 
when the potential is either constant or linear, and (iii) placed carefully where the potential undergoes a maximum 
or a minimum (ideally on the maximum or minimum). Although the {Xp V'} could be placed mechanically, densely, 
and uniformly without an unacceptable burden on modem desktop computers, it sacrifices elegance and economy of 
execution. 

In the case of the image charge potential of Eq. (11.25), the zeros and maximum are analytically known from Eq. 
(11.26) and its cohorts. For a more general potential, finding the zeros of a potential or field can make use of the methods 
of Section A3.7. A method employed below for the image charge is to find the [xj] such that the potential is reasonably 
linear between each Xj and x J+l . If the artistry of the placement is well considered, then even a small number of points is 
enough for a good account of D(k ) and therefore J(F, T). 

Consider the Airy method applied to the field emission problem represented by Figure 19.10: for metal-like and typical 
environmental parameters, a calculation of D(k) and its numerical integration with the supply function f(E k ) of Eq. (11.5) 
results in Figure 19.11. As far as the big picture is concerned, the Fowler-Nordheim (FN) equation of Eq. (13.25) gives 
reasonably good correspondence, but a comparison of the numerical values reveals that the FN equation is consistently 
smaller in the range of 20-50% (performing better at higher F). This is significant for an odd reason: much has been 
written about the Nordheim functions v(y) and t(y) and how to obtain them to great precision, but such efforts are 
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misguided when the discrepancy between the numerical solution of the field emission current density using TMA differs 
as much as it does from the FN equation itself. The very good approximations of Eqs (13.23) and (13.24), accurate to 
better than 0.4% and trivial to use in the guise of Eq. (13.25), are more than adequate for most considerations of field 
emission from metals. Therefore, efforts to "improve" the FN equation are better spent on what impact the pre-factors 
have [262], or identifying how the equation changes as emission near the barrier maximum occurs in thermal-field 
conditions (Chapter 17). 


19.2.1 Kemble approximation revisited, again 

Help me then to draw out the conclusion which follows from our admissions; for it is good to repeat and review what is good twice and 
thrice over... 

- Plato 5 

The ease of the plane wave TMA formulation stands in contrast to the more involved Airy function TMA so that 
affection for the Lear approach ("seek thine own ease". Section 9.1.1) might encourage exclusive usage of the former. 
Yet one must be careful in doing so, as there are times when the planar TMA causes unphysical artifacts to appear if and 
when the segment widths approach the inverse wave numbers \/k. If that is opaque, an example clears up the nature of 
the problem, and revisiting the Kemble approximation is a good laboratory to explore what that problem is, particularly 
because it is already by now so familiar. 

Therefore, consider the parabolic potential again, where the form of V(x) as seen in both the planar and Airy 
TMA is shown in Figure 19.12, and where the x axis is normalized to the base width of the parabolic potential. The 
potential is partitioned into 10 equally thick slabs: because of the manner in which the partitioning is numerically 
executed, in which Xj = L(j - l)/(N- 1) and V) = V(Xj), the first partition for the planar model has zero height. 
For m n = m and L = 2 nm both the planar and Airy TMA are in reasonable agreement, as shown in the left plot 
of Figure 19.13. However, when only L is changed to 10 nm, then the right plot of Figure 19.13 results. Wave 
function interference now has the opportunity to occur because the width of the partition is too large, and the 
consequences on the evaluation of D(k) are stark. In contrast, the Airy TMA approach performs well, in that the 
results do not change appreciably when the number of partitions is increased (N = 20) whereas the planar TMA, 
although much improved, shows interference (rippling) for k> k 0 even though double the number of partitions 
is used. 


19.2.2 Gamow factor revisited, again 

This act is as an ancient tale new told 
And, in the last repeating, troublesome... 

- William Shakespeare 6 



Figure 19.12 The planar (dashed blue line) and Airy (solid red line) representation of the quadratic 
potential barrier. The x, are indicated by the solid black dots. When the x axis is scaled by L, then the 
L-2 nm and 10 nm representations coincide. 


5 Plato, The Dialogues of Plato: Gorgias (trans. Benjamin Jowett), Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 7, Plato. 
Chicago: Encyclopaedia Britannica, 1952, p. 280. 

6 Ref. [37]: The Life and Death of King John, IV.ii.18-19. 
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sufficiently large that interference effects become manifest and undermine the accuracy of the planar TMA method for a low number of partitions, but 
not the more robust Airy method, for which the results do not change appreciably as the number of partitions increases. 


Reconsider the Gamow factor using the Zi functions. As in Eq. (19.7), the center two matrices describe the barrier itself, 
and so when the potential V(x) varies linearly between the left- (where z = z 0 ) and right- (where z = z L ) hand sides, and 
the energy of the electron is well below the barrier height, the matrix B is given by 


(Zi{l,z 0 ) Zi{-l,z 0 )\( Zi(l,z L ) Zil-1,Z L )\ 
\Zz'(l,z 0 ) Zi'(— 1, Z 0 )) \Zi'(l,z L ) Zi'(-Lz f )J 


(19.15) 


where no transition matrix is needed if the energy is such that the electron does not pass from under to over the 
barrier in the section. If the left-hand side is at the origin and the right-hand side is at L, then z 0 = (k 2 - k 2 )/f 2/3 and 
z L = (k 2 0 - k 2 + sfL)/f 2 ^. The asymptotic approximations H(l, u) -*■ 2, -»■ 1, and Di(c,z(u)) = csf l ^yfu can be 

made. Effort shows 


cosh[|(zf -4 /2 )J 
x(z L z 0 yi^ 


v / 3 v^ 1 \ 

K f 2/i y/hT 0 


(19.16) 


where, for large argument, cosh(x) « sinh(x), as was done in Eq. (19.9). Notice what has happened (again): the cosh term, 
being large, emerges as a coefficient to the matrix, so that for more general potentials constructed of trapezoidal slabs 
sandwiched together these pre-factors are going to cluster together in the form of a product. Moreover, the argument of 
the cosh term is seen to be 

| (z] ,2 -zl l2 )=s j yj k 2 - k 2 + sf xdx (19.17) 

The s on the right-hand side seems to be a problem, but is not: the potential is either increasing (s = 1) or decreasing 
(s = —1), making z L > z 0 or z L < z 0 , respectively, and therefore making the argument of the cosh term positive or negative, 
respectively. 

When the argument of cosh(x) is largish, then cosh(x) -> e*/2, and so, as with the plane wave approach, the product 
of exponentials results in the exponential of a summation, which will be of the form 


Yj [ J k ] ~ k2 + S /M ~ x l) dx [ *■(*)<& 

i—o Jx i v A) 


(19.18) 


where x(x) is the smooth curve that passes through all the kj - k 2 , hkj = ^j2mV- y and the property of integrals, namely 
that 


/* 




(19.19) 


has been used. With that last step, the Gamow factor is recovered as twice the expression given in Eq. (19.17), a necessary 
but not surprising result. 
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The behavior of multiplicative transmission probabilities can finally be seen in the next most simple barrier possible, 
namely, a rectangular (r) barrier pushed up against a triangular (t) barrier, or an rt barrier for short, which can also be 
(more or less) evaluated analytically. If the rectangular region is of width L, then it can be shown [188] that to a good 
approximation, 

D rt (k) « D r (k)D t (k) = e~ K(k)L D FN (k) (19.20) 

with j c(k) 2 =k 2 0 -k 2 , which recapitulates Eq. (19.18) rather well. 

19.2.3 Schottky deviations 

Next...come the philosophers...they still boast that they can see...things which are all so insubstantial...they bring out their triangles, 
quadrilaterals, circles and similar mathematical diagrams, piled on top of each other and intertwined like a maze, and then letters of the 
alphabet which they marshal in line and deploy hither and thither. 

- Erasmus 7 

An interesting usage of the Gamow factor analysis is to shed light on the deviations visible in a Schottky plot for thermal 
emission, wherein a plot of ln(J(F, T)) vs F 1 ^ 2 is mostly linear. The simple model motivated by Eq. (19.20) will exaggerate 
(by quite a bit) the magnitude of the departures from linearity, whereas the actual deviations are so minute that Erasmus 
would have lampooned their investigation. A plot of experimental data reveals them [135, 164], an example of which is 
shown in Figure 19.14 (based on Figure 6 of Haas and Thomas [135]) to give a sense of their behavior and magnitude: the 
periodic nature visible is a consequence of the variations in the average reflection coefficient term (1 — r) that normally 
attends the RLD equation, first encountered in the discussion following Eq. (12.3). 

The theoretical derivation of the deviations is involved, with the work of Cutler and Gibbons [263] being notable, but 
rather than reproduce it, a simple model and Eq. (19.20) are enough to justify why those deviations are plausible. The 
image charge potential can be very crudely treated as a rectangular barrier pushed up against a triangular barrier (a "slab 
+ triangular barrier" [188], shown in Figure 19.15), on which Eq. (19.20) is applied. The trick is to determine the height 
and width of the slab, which, as Figure 19.15 demonstrates, is field dependent. 

Some artistry is involved in setting the slab parameters. Choose the smaller zero of V{x) - p = 0, as the image charge 
barrier methods for calculating x ± (p) via Eq. (11.26) are available. Thermal emission is characterized by comparatively 
small fields, and so the small field limit is x min « Q/<t>. As for the right-hand side of the slab, its height should be the 
same as the height of the image charge barrier, and its location should change as the location of the barrier maximum 



sqrt(4QF exp ) [eV] 

Figure 19.14 Deviations from the Schottky relation for a molybdenum filament at a temperature of 1950 K. F^ refers to the experimental field (the 
field at the surface is larger if field enhancement is present). The deviations are revealed by removing the linear portion of a plot of In (J) as a function of 
yfi. The data was digitally extracted from Figure 6 of Haas and Thomas [135]. 


7 Desiderius Erasmus (trans. Betty Radice), Praise of Folly; and, Letter to Maarten Van Dorp, 1515. London, New York: Penguin Books, 1993, passage 
[52], p. 151-152). 
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Figure 19.15 The image charge barrier and the approximate slab + triangular barrier representation, shown for fields of 0.1 eV/nm (blue) and 
2.0 eV/nm (red) for it = 7 eV and <6 = 4.5 eV. The rectangular barrier spans from x mjn = Q/4> to x e - t/Q/F. The triangular barrier is such that it 
coincides with V lMle (x ma ^ = /i. The slab is the rectangular barrier to the left of the triangular region. 


x 0 = ■s/q/F changes. A convenient choice is x 0 itself. Between the two, the width of the barrier is approximately 



(19.21) 


where the symbol W for the width of the slab is because L is already equated to the width of the image charge barrier 
itself. The slab therefore mimics the thin portion of the potential with dimensions that vary in response to the applied 
field. 

Approximating a smoothly varying potential region by an abrupt rectangular slab comes at a cost. The price is 
anticipated by the smooth behavior of D{E) for the quadratic potential (Figure 19.5) compared to the rectangular barrier 
potential (Figure 19.5), that is, oscillatory behavior for the rectangular barrier is much larger than for the quadratic 
barrier. Insofar as the thickness variation of the rectangular (or slab) barrier region affects the current density, it does 
so to leading order as J as D(k m )J RID , where k m is the value of k that maximizes the current integrand ( hk/m)D(k)f(k ). 
The expectation is that k m will satisfy something like h 2 k 2 m l2m = n + </> + nk B T, where n is of order unity, that is, the 
integrand peak will be about a thermal energy above the barrier maximum, which is intuitively reasonable. 

The deviation is obtained by removing the part of ln(J) that is linearly dependent on \J4QF. That is difficult, but because 
ln(J WD ) varies linearly with yfAQF as shown by Eq. (12.3) (remember that $ = ® - \JaQF), and because the triangular 
barrier part captures most of J RLn , consider the ratio ln(J RLD /J) as a reasonable proxy. From Eq. (18.10) it is 



(19.22) 


with Ti 2 k^ n /2m = n + + Ti 2 K 2 /2m, where Ti 2 K 2 /2m = nk B T is a small energy, and h 2 k 2 m /2ni is about the average energy 

of an electron just emitted over the barrier. The smallness of k B T as 0.168 eV at T = 1950 K compared to n and r/> means, 
then, that to leading order 



(19.23) 


which itself varies weakly for fields of the magnitude shown in Figure 19.14 (assuming n « 1). That means the deviations 
can be traced to the sin term of Eq. (19.22), which causes similar oscillations to those observed in the experimental data, 
albeit on a far grander scale. In fact, assuming that n = 1, and that field enhancement due to surface roughness and/or 
curvature is about a factor of 4 (that is, the field at the emission site is 4 times greater than the experimental field 
F = AF exp , the later of which is used in Figure 19.14), then for n = 7 eV, ® = 4.5 eV, and T = 1950 K, calculation shows 
that Eq. (19.22) is as shown in Figure 19.16. Consequently, the oscillations in the deviations of thermal emission data 
from a linear Schottky relation are plausibly related to the wave nature of the electron. 
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19.2.4 Resonant transmission 

...I would like to point out that many high barriers exist in this world: Barriers between nations, races and creeds. Unfortunately, some 
barriers are thick and strong. But I hope, with determination, we will find a way to tunnel through these barriers easily and freely, to 
bring the world together... 

-Leo Esaki [254] 

The RTD barrier of Figure 19.6 is shown at zero bias, that is, V(x -*■ -oo) = V(x —► +oo). When a bias is applied, then 
the right-hand side of the barrier is brought down to a level —V b . The simplest approximation is that the bias causes a 
linear drop across the potential barrier, as shown in Figure 19.17, starting a tad before the first barrier and stopping a 
tad after the second. The barriers of 0.3 eV are characteristic of an RTD with AlGaAs barriers sandwiched by regions of 
GaAs, for which relevant parameters are m n = 0.0667 m, a doping density of 10 18 cm -3 (p « 0.08 eV), k F « 0.374 nm' 1 , 
T = 300 K, and k B T = 0.0259 eV [264, 265], although the barrier and well widths in the present example are chosen 
more for dramatic effect than for technological relevance. Lastly, effective mass changes from one heterostructure layer 
to another are an important modification [266, 267] and make changes to the matrix equations defining the transmission 
probability, but for simplicity now, treat the effective mass as the same throughout. 

The transmission probability D(k) associated with the potential profile shown in Figure 19.17 is shown in Figure 19.18, 
where the supply function f 0 {k) for the aforementioned GaAs parameters is shown by the gray dashed line overlay, 
normalized to 1 at k= 0, and labeled by the biases associated with Figure 19.17. A resonance level (sharp peak) is 
accessible by the supply function (i.e., they overlap where f(k) is not negligible), but as the bias increases, the peak 


0.3 


0.2 

0.1 

1 0.0 
0- 

-0.1 

-0.2 

0 2 4 6 8 10 12 

Position [nm] 

Figure 19.17 Potential barrier profile for an RTD under bias. Black line (V t = 0.0) corresponds to an unbiased state (Figure 19.6). The beginning and 
end of the linear slope lie a short distance before and after the barriers, respectively, just for the fun of it (although band bending would cause an 
analogous behavior). 
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k [1/nm] 

Figure 19.18 The transmission probability for each of the biases shown in Figure 19.17 for a barrier modeled after an AlGaAs/GaAs RTD. The resonant 
peaks occur at lower and lower k values until by V h = 0.2 eV the resonant peak disappears altogether. Gray line is the normalized supply function for 
comparison. 



in Figures 19.17 and 19.18. When 
resistance. 


shifts to lower k until it vanishes, at which point there is no resonant peak accessible until the bias becomes substantially 
higher. As a result, the current density goes through a maximum but then declines as voltage increases, as in Figure 19.19, 
creating a region where an increase in voltage results in a decrease in current, that is, negative differential resistance occurs. 

The bias across the potential barriers was linear for simplicity, but in real RTDs the well region can fill up or deplete 
depending on conditions, and band bending (Section 24.2) occurs before and after the barriers. As the presence of charge 
density leads to space-charge contributions, that well density has a consequence. Numerical simulations to investigate 
it reveal that the charge in the well region (and in the regions just before and after the barriers) is enough to bring the 
resonant level into and out of the range where resonant tunneling can occur. As a result, oscillations in the current 
density arise with a time scale (in the THz) dictated by how long it takes the well to repopulate and deplete. The I(V) 
characteristic that results therefore develops a hysteresis, that is, ramping the voltage up gives rise to a different I{V) curve 
than ramping the voltage down in the negative differential resistance region, although revealing the hysteresis requires 
more formidable approaches than TMA methods [127, 268, 269], Such odd circumstances lead to unexpected intrinsic 
bistability and current oscillations, and bear on the thorny problem of quantum mechanical tunneling times [255], 

Now consider what seems to be a minor modification to a well-examined potential barrier: the triangular barrier of 
height fi + ® = 11.5 eV and subject to a field of F = 5 eV/nm with a modification suggested by Eq. (19.18), namely, 
remove one of the slabs from it. Visually, what is being done is shown in Figure 19.20: a trapezoidal piece of width 
L = 0.6 nm centered at x c = 0.7 nm is excised from the triangular region, making the width of the first trapezoidal barrier 
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: red dashed outline. The depth of the well is 9.5 eV below the triangular barrier. The field is 5 eV/nm 1 


f well at 0.7 nm), leaving the potential 
ir both the barrier and well regions. 


L 0 =x c - L/2 = 0.4 nm. The potential well is made of a depth such that the right edge of the first barrier is at V = 0, so the 
well is of a depth of fi + ® - FL 0 = 9.5 eV. The height of the triangular barrier at x c is V c = 8 eV. If the slab were not excised 
from the triangular barrier, then D(k ) would be given by Eq. (18.57): call this solution D tri (k) to serve as a reminder that 
it is the triangular barrier transmission probability. Now the slab is excised. From Eq. (19.18) it is therefore expected that 
the transmission probability becomes 


D(k) * exp ^2 j '^ 



D tri (k) 


where k c (E) 2 = (2m/h 2 )(jA + ® — E — Fx c ). Taylor expanding the integrand, the linear term vanishes because only even 
powers of s (i.e., s 2j and not s 2J+l ) survive over the integration region. To leading order, therefore. 


D(k) » exp(2 K c {E k )L)D tri {k) (19.24) 

The current density J seq due to the sequential tunneling terms is then seen to be (more or less) comparable to 

J sec {F) » exp(2K c (n)L)J PN (F) (19.25) 


where J fn (F) is familiar from Eq. (13.10). The approximation is better than the off-hand treatment here, as can be shown 
more rigorously [199], but accuracy is not the goal, rather, seeing how the sequentially tunneling component stacks up 
to the resonant tunneling term is more interesting. 

The form of Eq. (19.24) is enough to suggest that if exp(-2 ic(E)L)D(k)/D PN (k) is plotted, any unusual deviation from 
a line of order unity should be indicative of a resonant energy level within the well. Sure enough. Figure 19.21 reveals 
two of them. In general, near a resonance, D(E k ) behaves as [255] 


D(E k « E res ) a 


! + {(]?*- E res )/AE} 2 


(19.26) 


where E res is the energy of, and A E is the width of, the the resonant level. 8 At the resonances, the peaked behavior evokes 
the Lorentzian functions of Section A2.7, and the temptation to treat 8 n (E) (where n = 1/A E) as a Dirac delta function is 
irresistible. One finds (in the zero temperature limit) 


J res 




in - E res )D 0 nAE8„{E - E res )dE 


(19.27) 

(19.28) 


®The notation is being handled a bit cavalierly: D(E) is shorthand for D(E k ) = D(k), where k in the latter corresponds to y/2mE k /h in the former, 
as introduced in Section 11.2. 
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Figure 19.21 The ratio exp(-2ir(B)L)D(f:)/P In .(/r) for the dashed line potential of Figure 19.20. The red line is a best fit Lorenztian term for the higher 
energy resonant level associated with the barrier. 



For \i = 7 eV, ® = 4.5 eV, and the parameters of Figure 19.21, direct numerical evaluation shows that J res is about 28x 
larger than J seq , and only the larger E res was being considered in the comparison. Resonance terms can be substantial. 

An account of why the form of the resonance is like a Lorentzian relates to the concept of an "attempt frequency" / 
in a tunneling formalism by Price [270, 271], although the present treatment owes more to the groundwork laid out in 
Section 11.2. In the Bohm approach, for which y/ = Re lS as in Eq. (11.7), the quantity (fi/m)d x S was a velocity. 9 Abound 
state for an infinite potential well [44] entails that y/(x) is real, so that S = 0 and no current flows because Eq. (9.60) is 
identically zero. For practice, consider how that changes for transport past a potential barrier by introducing the quantity 
y(x) = yf(xy l d x yr(x). Using the Bohm form, y is 

y(x) = d x ln(R) + id x S (19.29) 

Assuming that ln(R) varies slowly, the second term becomes the more important one, and is known from Eq. (11.13) 
(where the quantum potential £2 « 0 in the same approximation). Clearly, y is a function of E as well, and so can be 
expanded around the resonance energy E 0 according to 

y(E) « (E - E 0 )y'(E 0 ) (19.30) 

where the prime indicates derivative with respect to argument and the absence of a first term y(E 0 ) is a reflection of 
the vanishing of y near a "quasi-level" (which acts like the resonance level). If y retains a semblance of d x S, then y' will 
resemble 

y'(E 0 ) ~ - VT 1/2 ~ -J- (19.31) 

2 n nv 

where v 2 = 2(E 0 - V)/m acts like a velocity in the form of Sx/St. If St = 2// is a transit time (where / is an "attempt 
frequency"), then <5x = 1 Jk is a characteristic length for which the wave number k is appropriate, and so the expectation 
is that Eq. (19.30) should be something like 10 

/(£*)-§ (19.32) 

«/ 

which means that 

2k 

y(E) * (E - E„)j£ (19.33) 

Compare this to transport problems, where the incident and reflected waves are combined into y/ as 

yr = N ( e ikx + re~ ikx ) (19.34) 


9 This is trivially seen for a constant potential region, for which d x S = k and v = Hk/m. 

10 Compare Eq. (6) of ref. [270]: Price's treatment is more rooted in Gamow's tunneling theory, and therefore may have greater appeal for the 
more fastidious reader. 
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where r is the reflection coefficient. If that t/r is used in the definition of y = y/ l d x y/ and evaluated at x = 0 


the inversion of which, plus Eq. (19.33), results in 

_ _ hf - 2i(E - E 0 ) _ (hf - 2i(E - E 0 j) 2 
r ~ hf + 2 i{E - E 0 ) ~ (hf) 2 + 4(E - E 0 ) 2 
the denominator of which has the structure of a Lorentzian 

where the width of the Lorentzian is governed by the tunneling rate factor /. 


it results in 

(19.35) 

(19.36) 

(19.37) 
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CHAPTER 20 

Ion enhanced emission and breakdown 


...the Chimaira...a thing of immortal make, not human, lion-fronted and snake behind, a goat in the middle, and snorting out the 
breath of the terrible flame of bright fire... 

- Homer 1 

Emission mechanisms due to thermal, field, photoemission, secondary emission, and now resonant tunneling have 
been up till now as treated as distinct processes, not often seen operating together in the explanation of a phenomenon. 
Their segregation is seemingly natural, but wildly different parts grafted together need not be horridly monstrous. One 
circumstance where all of the emission mechanisms play a part makes for an interesting bit of physics [272]. This happens 
in the gaseous breakdown across gaps [273, 274], otherwise known as discharge. Even more interesting here is when 
the gaps are micro-scale and so-called "micro-plasmas" are generated. Conditions can be pushed to the point where the 
discharge becomes self-sustaining, and an avalanche breakdown occurs. Interest, initially motivated by concerns over 
spark discharges and erosion of material in microelectromechanical systems, has since grown due to possible applications 
for gas sensing, lighting, energy conversion devices, and micro-plasma physics [275], but ambitions here are a bit more 
circumscribed, being more towards how the emission mechanisms manifest themselves. 


20.1 Paschen's curve 

Our doubts are traitors 

And makes us lose the good we oft might win, 

By fearing to attempt. 

- William Shakespeare 2 

Electrical breakdown or arcing is so complex, dependent on the gases involved, the current levels present, the voltages 
that exist, whether the surface of the cathode is rough or smooth, and what the origin of the initial emission mechanism 
is, that the prospects of fashioning a reasonable account of it may seem doubtful. Electrons in the anode-cathode (AK) gap 
generally always exist, having been created by cosmic rays, radioactivity, or photoemission, as suggested in Figure 20.1. 
Likewise, different regimes exist in the current versus voltage relations: as the voltage increases, the current increases 
but plateaus, continuing to raise the voltage and at some point the current starts to increase again, until it takes off 
and becomes self-sustaining (even if the initial electron generation mechanisms are suppressed). When that happens, 
breakdown occurs [12]. The conditions for the onset of breakdown depend on pressure P, AIC gap separation D, and 
the anode voltage at breakdown q<p b = V b , and the curves that result depend on the nature of the gas involved. The 
characteristic V b versus D (or the product of gap distance and background pressure PD) curves for each gas are called 
Paschen characteristics, the shape of the curve being dictated by Paschen's law. The relation for some typical gases is 
shown in Figure 20.2. 

A bit of reflection shows, however, that the behavior of the curves is qualitatively explicable: to the left of the curve 
minima, the breakdown voltage V h increases as the product of pressure P and AIC gap separation D decreases because 
the separation between the gas molecules becomes larger, making the probability of another ionizing collision between 
generated electrons and neutrals less likely. Conversely, to the right of the minimum, V b increases with the PD product, 
and reflects the fact that when sufficient opportunity arises for electrons to have collisions with neutrals, a higher voltage 

Corner, Thellliad (trans. Richmond Lattimore). Chicago: University of Chicago Press, 1951, Book 6, lines 179-183. Hesiod, in Theogony (lines 
319-325), gives a similar description of the beast. 

2 Ref. [37]: Measure for Measure, V.4.78-80. 
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© 2018 John Wiley & Sons Ltd. Published 2018 by John Wiley & Sons Ltd. 


245 



246 Chapter 20 


ANODE 



Figure 20.1 Steps leading to breakdown: incident photons, cosmic rays, or ionized gas molecules strike the cathode surface and liberate electrons. 
Electrons accelerate in field F = VjD and when their energy is sufficient, they can collide with other gas molecules and ionize them. Ions are drawn to 
cathode, and liberated electrons are further accelerated, both of which lead to more electrons. 



P D [Torr-cm] 


Figure 20.2 The Paschen relation for breakdown voltage versus the product of the background pressure P and the AK gap separation D for the gases 
hydrogen, oxygen, ammonia, and carbon dioxide. The noble gases have a similar curve (see Figures 7 and 8 of ref. [274]). 


is required so that the electrons have a chance of being accelerated to a high enough energy to ionize neutrals when a 
collision occurs. 

The mechanism behind the exponential growth in generated electrons (Figure 20.1) begins with the creation of a 
free electron. When that accelerated electron acquires enough energy and collides with a neutral, it liberates n electrons 
(n = 2 in the figure), and each of those electrons typically accelerates again until they too have enough energy, E « VJ/D, 
to liberate electrons from the neutrals they strike. The growth is exponential because if there are N x electrons after the 
first collision, there will be N 2 = nN r after the second, and by iteration, Nj = a 7 (V,. If the electrons travel a mean free path 
/, pointed mostly towards the anode, then the iteration index goes as the distance from the cathode. This suggests that 
dN/dx = aN(x) (in preference to Nj +1 = nNj), which has the solution 

N(x) = N 0 exp(ax) (20.1) 

The number of electrons generated per unit time, or the current density of electrons, follows the same behavior, or 

J e {x) = J e { 0)exp(ax) (20.2) 

with the total number of generated electrons being 

J e (D) - J e ( 0) = J e ( 0)(e aD - 1) (20.3) 

The generation of the ions J ; (x) goes in the opposite direction, increasing towards the cathode. The total number of 
generated ions is the same as the number of generated electrons, or 

J,(0) - J t (D) = J e (D ) - J e (0) (20.4) 
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where no ions are generated at the anode, or J t (D) = 0. When the ions impact on the cathode, each ion creates secondary 
electrons, the number of secondaries per ion being y, or J e ( 0) = yJjifi). By joining Eqs (20.3) and (20.4), and using 
Jj(D) = 0, it follows that when the current of electrons and ions is self-sustaining then 

j.(0) = yJ i (0)(e aD - 1) a = I In (1 + y~ l ) (20.5) 


Recall, though, that a is the number of ionizations per unit length, that is, a should be something like a ~ AN e /l, where 
A N e is the number of created electrons, and l is the mean free path, or how far the electrons travel on average before 
they endure their next collision. But A N e should go as the number of electrons N e present and the probability of their 
collision P aU , and so 


AlV e _ N e ( NjAj \ _ N e NjAj 

~T = ~T lOTOJ = 


(20.6) 


where N t is the number of neutrals that will become ions after a collision, A t is the cross-section of the neutrals, V/l = A 0 
is the unit cross-sectional area, and the probability of collision behaves as P cott as N i (A i /A 0 )? 

From the ideal gas law, the density p = NJV of the gas of neutrals is simply P/k B T = pp = p, which, when coupled 
with Eq. (20.6), results in 

N e (x) = N 0 exp (—pPAjX) (20.7) 


but that implies 


a = A Njl « a 0 pPA t exp (-pPAfi 


(20.8) 


Because / is the distance over which the electron is accelerated in a field characterized by F = q£ until it has sufficient 
energy to cause an ionizing collision, it can be related to the ionization energy E t of the neutral atoms by E t = El, and so 


a « aJPA l exp (-pPAfiJF) 

= AP exp (-BP/F) (20.9) 

where the constants A and B are specific to the species of gas atoms or molecules which make up the background gas in 
the gap. The field in the gap is simply F = V b /D at breakdown, and that, coupled with Eq. (20.5), finally allows for the 
breakdown voltage V b to be related to the gap separation D (or the product of pressure and gap separation PD, as shown 
in Figure 20.2) by 

Vb = In [APD/ln(l + y -1 )] (2 °- 10 > 

where from the derivation it is clear that other factors matter as well, such as temperature and the dimensions and 
ionization energies of the neutral gas molecules involved. 

The form of the breakdown voltage in Eq. (20.10) can be rendered more generally as y(x) = x/ In (ax), and from it one 
sees the general contours of Figure 20.2 emerge: for large values of x (or PD), y (or V b ) appears to be increasing roughly 
linearly with x, but when x approaches (1 / a), y rapidly increases due to the singular behavior of 1/ In (ax). In between, 
y(x) goes through a minimum at y = x= e/a, as shown in Figure 20.3. Augment the curves by a factor to change scale, 
and obtaining agreement with experimental data becomes quite easy. 


20.2 Modified Paschen's curve 


She knows there's no success like failure 
And that failure's no success at all. 

- Bob Dylan 3 4 


3 Be alert to the perils of notation: V is a volume, but V b = q<p h is a breakdown energy as <p b is a breakdown voltage. The factors A t and A 0 are 
areas here, but in Eq. (20.9) A is a factor in the Paschen relation. Context determines content. 

4 Bob Dylan, lyrics to Love Minus Zero, No Limit, 1965. 
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Figure 20.3 The Paschen relation y = x/ ln(ax), based on Eq. (20.10), for various values of a. The minimum occurs aty m =x m - e/a and a singularity 
occurs for x= 1/a. 



Figure 20.4 Based on Figure 4 of Torres and Dhariwal [276] (data digitally extracted from a line). The Paschen curve is given by V h = 4,6.73D/ln (0.38P) 
for V in volts and D in microns. 


The Paschen relation works well until the gaps narrow to about 15 pm or smaller, at which point Eq. (20.10) fails 
to account for the experimental breakdown voltage and another model is needed to successfully account for the failure 
mechanism. Consider the data from Figure 4 of Torres and Dhariwal [276], reproduced in Figure 20.4, where the AK gap 
separation is in microns and the voltage across it is in the hundreds of volts. Where there should be singular behavior, 
there is instead a plummeting of the breakdown voltage towards the origin. At the minimum of the Paschen curve (about 
D = 7.2pm), the breakdown voltage is 334 V, corresponding to V b /D = 46 MV/m (or about two orders of magnitude lower 
than standard metal-like fields leading to field emission). As Torres and Dhaliwal ruefully note, the plummeting V b as D 
approaches microscopic spacings bodes poorly for the fabrication of micro-devices, and so its causes are more than just a 
challenging physics problem. 

Hypotheses on the origins of the modified Paschen curve of Figure 20.4 speculated as to the role ions play as they 
approach the surface. At different distances from the surface, the manner in which the ion potential modifies the emis¬ 
sion barrier changes. Field emission (assisted by field enhancement effects due to surface roughness) [275, 277, 278], 
thermo-field emission [279], and resonant mechanisms [272] (similar in behavior to the resonant states of a single 
atom on a field emitter tip [258]) can all contribute. Numerical simulations that include such emission physics match 
the small-gap behavior [280], Therefore, before applying the transfer matrix approach (TMA; Chapter 19), consider an 
approximate treatment first. 

A natural place to start is reconsidering the factor y in Eq. (20.5) by allowing field emission to affect it [275], That is, 
let 

Y~>Yo + Yf (20.11) 
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where y 0 is analogous to the large-gap, high-voltage term of Eq. (20.10), and j y accounts for field emitted electrons and 
therefore will be dependent upon the field that exists at the surface at the point of breakdown. Therein lies the first 
problem: machined metal surfaces always have some level of roughness [281, 282], surfaces which endure arcing or 
discharge can become pitted [283], the presence of particulates or micro particles can induce field emission sites, and 
local heating giving rises to protrusions [31, 284], The field on the apex of a protrusion can be treated using a field 
enhancement factor p g , as was encountered in Section 13.3 and will be considered in Section 30.1. For now, simply 
extrapolate from elementary results. It is well-known from elementary electrostatics that the field enhancement factor 
of a hemisphere causes the field at the apex to be three times greater than the background field (Eq. (30.10)). Couple 
that with the conjecture of Schottky [138], and others [285, 286], that field enhancement factors are multiplicative. If 
so, a bump on a bump on a bump (a reasonable approximation of a surface emission site) would endure a field at the 
apex comparable to 3 x 3 x 3 = 27 times larger than the background field. These simple hand-waving numbers will be 
improved upon in Chapter 30, but the conjectured magnitude is about right. Notice that the discussion has been about 
the field at the emission site compared to its background value F 0 = V b /D, where field enhancement and beta factor refer 
to the dimensionless ratio of the apex field to the background field. 

The field dependence of the probability of emission is dominated by the exponential factor in the Fowler-Nordheim 
equation of Eq. (13.25). A reasonable expectation is then that y f will manifest a similar behavior, or 

/ b ®V2 \ 

r f {F„) ~ C exp l--j m Ce~ b ' F ° (20.12) 


where C is a constant to be determined and, for convenience, b = 4\j2m<& 3 /(3h(S). Because departures from Paschen's 
curve in Figure 20.4 occur at about a gap spacing of 7.2 pm and a breakdown voltage of 335 V, there is where y 0 is likely 
comparable to y f , and so choose C=y 0 exp (b/F c ) so that 

r/ (E^^exp{,(i-i)} (20.13) 


where F c = (335/7.2) eV/pm = 0.05317 eV/nm (corresponding to an electric field of about 53 MV/m). Given the rapid 
decline of field emission current density when the field declines, for F 0 < F c the influence of field emission will fade 
rapidly and so Eq. (20.10) holds with y replaced by y 0 . Conversely, once F 0 surpasses F c , the same observation means 
field emission will rapidly dominate, or yy » y 0 . Since 1 /yy is then small, the approximation of ln(l + x) « x for x small 
means that 


V b (V a > F C D ) 


BPD 

In (y f APD) 


BPD 

[In (yy) + In (APD )} 


(20.14) 


Inserting yy from Eq. (20.13), 


V b = 


_ BPD _ 

In (y 0 APD) + ( b/F c ) - (b/F 0 ) 


(20.15) 


After recalling that F 0 = V b /D, then, surprisingly, isolating V b results i: 
equation, or 


i equation much like the original Paschen 


In ( CAPD ) In (y 0 APD) + (b/F c ) 


with the first form related to Eq. (10) of Radmilovic-Radjenovic and Radjenovic [287] (but see also Rumbach and Go 
[278]), who observe that the coupling of PD in the standard Paschen relation of Eq. (20.10) has been altered: pressure P 
and gap separation D now appear separately. The factor b = (4/3A/?) \/2m<i> 3 tends to dominate all the other terms with 
which it appears, therefore, to leading order, V b « F C D, although the neglected terms in Eq. (20.16) cause a mild visible 
curvature in Figure 20.5. 

And yet, a bit of mystery remains: for small values of D, the breakdown voltage in Figures 20.4 and (20.16) falls even 
faster than governed by V b « F C D, which indicates additional physics. Rumbach and Go [278] demonstrate that at very 
small gap spacings the ion concentration near the cathode can increase by orders of magnitude as the anode potential V a 
increases. The manner in which they change the emission barrier needs investigation. As ions approach the surface, the 
Coulomb field they have adds to the surface field and therefore the Schottky barrier reduction (contributing to thermal 
emission), and when the ion is close enough, tunneling can be increased (contributing to field emission). When adjacent 
to the surface, resonant tunnelling becomes a possibility. All intermingle as the ion approaches. 
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Figure 20.5 The aluminui 
approximately goes as V. 



20.3 Ions and the emission barrier 


Leave your crisp channels, and on this green land 
Answer your summons... 

- William Shakespeare 5 


How the ions induce a path by which electrons answer the call of freedom can be investigated using the Airy function 
TMA. Much like electrons and the image charge potential, ions too induce an image charge, but the emitted electrons 
need not be on the axis defined by the ion and its image, but rather are generally off-side by an amount p (in cylindrical 
(p,z) coordinates). That is, the potential an electron emitted from a metal experiences that is modified by the presence 
of an ion of charge +Zq resembles (after [288], but see also Eq. (4) of ref. [279]) 


V(z) = p + 0-Fz- 
V^z) = 4Z Q {A- 


■m) 


(20.17) 

(20.18) 


For an ion located at the cylindrical coordinates (/>,-, z ; ), then r ± (z) are defined by 
r ± (z) = {(.z±z i ) 2 + p 2 } 1/2 


(20.19) 


where p t is the distance of the electron from the axis defined by the ion and its image. That this is an approximation 
is not obvious, but is important to realize: for general three-dimensional potentials, the tunneling probability must be 
calculated along lines of steepest gradient [289], and an electron plowing through the ion potential along a line parallel 
to the ion-image axis is not following that path. Nevertheless, the one-dimensional potential given by Eq. (20.17) is both 
reasonable and unambiguous, and so it is used. 

How V ion {z) alters the usual image charge potential depends on just how far from the surface z t that the ion is, and what 
value of Pj is being used. Consider as a representative the image charge plus ion potential as the location of z ion moves 
progressively closer to the surface, as in Figure 20.6. As the ion approaches, first the barrier is lowered (thermal emission 
increases), then the barrier thins (field emission increases), after which well energy levels can facilitate transmission for 
energy levels well below the Fermi level (resonant transmission), and lastly when the ion is at the surface, the overall 
barrier is greatly reduced such that both over and under barrier emission occurs (thermal-field emission) whether a 
resonance level contributes or not. 


EXAMPLE: If an argon atom at rest is singly ionized 5 nm from the surface, crudely estimate the number of 
electrons emitted if the average current density across an area equal to twice the cross-section of the atom is 


5 Ref. [37]: The Tempest, TV. 1.130-131. 
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2 [nm] 


Figure 20.6 The image + ion potential given by Eq. (20.17) for z { = 2, 1, 0.8, and 0.3 nm for values of p t = 0.01, 0.05, 0.10 and 0.20 nm. The values of 
p = 6.77 eV and F = 3 eV/nm were used. The gray line in background is the image charge potential without the ion potential. 


J = 2.2 x 10 8 A/cm 2 during the time the ion approaches the surface. Let the surface F be 3 eV/nm. Such numbers 
are for illustration only. 

SOLUTION: The mass of the argon atom is 39.948 g/mole = 3.72 x 10 10 eV/c 2 . The acceleration is therefore F/M. 
The ion will travel a distance l = 5 nm in t = yJllM/F = 1.18 ps. If the radius is R = 0.07 nm, then the cross-sectional 
area is n(2R) 2 = 0.061575 nm 2 . The crude estimate for the number of emitted electrons is then Jn(2R) 2 t/q « 1. 


In the example above, the choice of 5 nm was not entirely random: the first ionization energy of argon is 1520.4 kJ/mole 
= 15.758 eV. An electron accelerated in a field of 3 GV/m would have to travel 5 nm or more to acquire enough energy 
to ionize an argon atom after a collision. Applied to the estimate of y, the emission of electrons as the ion approaches the 
surface represents a more credible description of how y f contributes to y, as discussed by Tirumala and Go [277], where 
emission was integrated over time and area during the ion's approach. Their resulting analytical model compared well 
with simulation and experiments dealing with modified Paschen's curve behavior using only a field emission contribution. 

The potential barriers of Figure 20.6 change dramatically for small changes in the position of the ion, and therefore 
those potentials do not enable a convenient analytical solution for either a Gamow factor or a current density equation. 
However, the vales of D(k') can be evaluated using the TMA of Eq. (19.14). Assume that the potentials have been artfully 
represented with linear segments densely placed in regions of rapid variation, further apart where long linear segments 
suffice, and having made sure that over/under and under/over matrices are properly placed, a program to evaluate D(k) 
returns an array of values D(kj). The supply function can likewise be specified at the same kj values to give f(kj), and from 
these two the current density through and over the potential barrier is found as a summation using Eq. (A 1.17). If k min is 
chosen such that D(k min ) <sc 1 and k max so that f(k max ) «: f(k F ), then the corrections due to the boundaries are negligible, 
and so, with kj = k min + (k max - k min ){j - 1 )/(N - 1) for; from 1 to N, 

\v ^ /hkj\ 


(20.20) 
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Figure 20.7 The current density J(F, T) (in A/cm 2 ) as a function of the distance z t (in nm) an ion (Z = 1) is away from the surface. Logarithms are in 
base 10. The current density peaks at z t - 0.525 nm. 



Figure 20.8 The current density J(F, T) (in A/cm 2 as a function of the distance z t (in nm) an ion (Z = 1) is away from the surface. Logarithms are in base 
10. The current density peaks at z t = 0.525 nm. 


where A k = {k max - k min )/(JSI - 1). For ease, a uniform spacing was assumed, but that can be modified, for example spacing 
k quadratically results in a summation equivalent to an energy integration. 

If applied to the potentials shown in Figure 20.5 and letting p, = 0.01 nm (so that the incident wave is approaching 
the ion core potential almost dead center), repeated evaluations of Eq. (20.20) as moves from far away to in close give 
the J values shown in Figure 20.7. Position can be converted to time by the relation z(f) = l - ( F/2m)t 2 . As a result, the 
total emitted charge, or the sum over the product of J(f ; ) with the time increment f ;+1 - tj = Af ; -, makes for an estimate 
of the number of field emitted electrons as a function of time, as shown in Figure 20.8, which is related to the value of 
Yf. The field F = 3 eV/nm was intentionally chosen low for the purposes of this example: a small increase in field, charge 
state Z, or dwell time of the argon atom on the surface will substantially increase the current density, and thereby the 
emitted charge. 

If the origin of the current density shown in Figure 20.7 were due to standard field emission processes from a 
copper-like metal with the effect of the ion to simply reduce the effective work function, then the Fowler-Nordheim 
equation itself could be used to back out what the presumptive work function would be. From Eq. (13.25), a reasonable 
quadratic fit gives ®(J) [eV] « 4.9193 - 0.11691n(J) - 0.010228(ln(J)) 2 for J in units of A/cm 2 . Consulting Figure 20.7, 
the maximum J is 1.3342 x 10 9 A/cm 2 , for which <t>(J) = 2.0115 eV, and therein lurks a problem. The Schottky 
barrier-lowering factor for F = 3 eV/nm is y/4QF = 2.0784 eV: the barrier maximum has been dragged below the Fermi 
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Figure 20.9 The tr; 

the Fermi level pi. 



probability D(E k ) (solid line) alongside the supply function ratio f(E k )/f(0) (dashed line). The resonance level is just below 


level n, and the Fowler-Nordheim equation under such circumstances is not an adequate representation. Something 
else is involved, and that something else might be resonances. 

To see if the resonances hypothesis is reasonable, return to the Airy TMA values of D(k ) for when the ion is 0.525 nm 
away from the putative surface. Graphing D(E k ) alongside the supply function f(E k ), as in Figure 20.9, shows that a 
resonance level has been brought to just below the Fermi level (at room temperature, f(E) appears to be linear and cross 
the axis at E = n, after which it vanishes quickly). Creating similar graphs for z t further out, or closer in, shows that the 
resonance becomes less favorable to emission. Therein lies the power of the Airy TMA, as the resonances would not 
otherwise be revealed. The modified Paschen relation is therefore interesting because a variety of emission mechanisms 
act in tandem and therefore the physics is rich. 




PART IV 

The complexity of materials 


The underlying physical laws necessary for the mathematical theory of a large part of physics and the whole of chemistry are thus 
completely known, and the difficulty is only that the exact application of these laws leads to equations much too complicated to be soluble. 
It therefore becomes desirable that approximate practical methods of applying quantum mechanics should be developed, which can lead 
to an explanation of the main features of complex atomic systems without too much computation. 

- Paul A.M. Dirac [290] 
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Metals 


The individuals see the good they reject; the public wills the good it does not see. All stand equally in need of guidance. 


- Jean Jacques Rousseau 1 


... men that are free, well-born, well-bred, and conversant in honest companies, have. ..honor. Those same men, when by base subjection 
and constraint they are brought under and kept down, turn aside from that noble disposition... 


- Francois Rabelais 2 


Many treatments of energy bands in metals and semiconductors start with how electrons move through the bulk 
material [82, 86], but an equally good place to start is how the atoms behave as they are brought together to form that 
bulk [ 104]. Each illuminates in a unique way the different behavior of electrons when free or bound, how these behaviors 
are manifest as collections of atoms are brought together to form the bulk material, and how electrons are then induced 
to differentiate between where they can go and where they cannot: whereas a physics approach focuses on how the 
Bloch waves which roam over the (perfect) lattice behave, a materials approach considers the linear combination of atomic 
orbitals (LCAO) and the chemical bonds they enable. 

Plane waves are useful basis states, given their simple relationship between k and energy E(k), and especially because 
of their utility in treating tunneling and emission. Nevertheless, inside the metal (or semiconductor), freely roaming elec¬ 
trons are from atoms that have lost their outermost electron, and what is left behind are ions that give rise to long-range 
Coulomb potentials. The free electrons that migrate long distances over the bumps and lumps of the underlying ions 
gives an understanding of how the band gaps behind the Kronig-Penny model arise. 

The opposite focus on the atoms that are brought into close proximity considers using basis states characteristic of the 
Coulomb potential, such as those behind the hydrogen atom of Section 9.1. Those potentials interact, and how they do 
so gives another way of appreciating how band gaps originate, which is the viewpoint behind the LCAO approach. 

21.1 Density of states, again 

I have told thee often, and I retell thee again and again... 


William Shakespeare 3 


A potential well with infinite walls located at ±L/2 constitutes a straightforward and simple quantum mechanical 
problem in the determination of its energy levels. Within the well, Schrodinger's equation relates energy to momentum 
as E = h 2 k 2 /2m, so the most general state of energy E is of the form (using the notation of Section (8.1.1)) where particles 
travel to the right (+k) or left (—k), or \y/) = a + \ k) + a_ \-k), where a ± are coefficients to be determined. As in Section 
9.2.2, the normal procedure is to match the wave function and its first derivative at boundaries, but for infinite walls 
there is nothing beyond them. This makes the problem not one of finding the coefficients, but of finding restrictions that 
are placed on the values of k. Because the wave function vanishes at both boundaries, the two equations for the two 


unknown a values are 



(21.1) 


3 Jean Jacques Rousseau (trans. G.D.H. Cole), Montesquieu/Rousseau, Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 38. 
Chicago: Encyclopaedia Britannica, 1952, "The Social Contract", Book II, Chapter 6, p. 400. 

2 Francois Rabelais (trans. Sir Thomas Urquhart and Peter Motteux), Gargantua and Pantagruel, Great Books of the Western World, ed. Robert 
Maynard Hutchins, Vol. 24, Rabelais. Chicago: Encyclopaedia Britannica, 1952, Book 1, Chapter 57. 

3 Ref. [37]: Othello 1.3.361-362. 


Introduction to the Physics of Electron Emission, First Edition. Kevin L. Jensen. 
© 2018 John Wiley & Sons Ltd. Published 2018 by John Wiley & Sons Ltd. 
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where the top row holds for x = L/2 and the bottom for x = —L/2. From linear algebra theory, if the determinant of 
the matrix for a homogeneous equation (one of the form y \a) = 0) vanishes, then non-trivial solutions are possible 
(a trivial solution is where all the elements of | a) are zero). Setting the determinant of the matrix in Eq. (21.1) to zero 
(i.e., det(M) = 0) therefore requires 

e ikL - e ikL = 2 i sin (kL) = 0 (21.2) 


which is satisfied for k -*■ kj = jn/L where j is an integer. Putting that result into the equation from the top row of Eq. 
(21.1) (or the bottom, just pick one) shows that a + = —a_ = a jF and so the general solution is 


y/j(x) = cij sin 


C¥) 


(21.3) 


where x' = x — (L/2). The energy levels for the one-dimensional well with infinite sides is consequently 


E j 


(21.4) 


Elegant, but so what? Well, remember that in developing the canonical equations of electron emission in Part n, it was 
useful to switch over from an integration over dk to one over dE, as done in Section 7.3. When dealing with electron 
transport with scattering in bulk materials, energy integrations are advantageous. Consequently, if there is a restriction 
on what values k can take, better to know how the distribution of energy states, or rather the density of states (DOS) 
function T) l (E), is affected, 4 where, yes, the subscript refers to the number of dimensions of the well and, yes, higher 
dimensions entail other difficulties. 

The DOS function has already been encountered for the continuum case in Section 7.3, but the circumstance now is 
how discrete phase space points are handled. That makes the energy levels discrete, and counting the number of levels 
within a given energy range A E is the task. Graphically, there is a very simple way to do that by using a histogram, exactly 
as was done with incomes in Section 4.1 or for velocities in Section 5.1. Because Ej goes as;' 2 , the number of levels is 
expected to decrease in each bin of size A E as E itself increases (at least in one dimension). The calculation is simple: let 
the highest filled energy level in the well be p (the Fermi level at zero temperature, where there are no unoccupied levels 
below it as per Fermi-Dirac statistics (see Section 6.2)). Calculate the values Ej and put them into bins of size A E so that 
the energy of the last electron in just fits into the last bin for the one-dimensional case. For N b bins, then A E = E 20 /N b , 
for example. "Filling a bin" is taken to mean bin n holds all the energy levels for which (n - l)AE < Ej < nAE. With the 
filled bins, consider the resulting histogram that counts the number of energy levels Ej in each bin. If the number of bins 
is 20, then the number of levels in each bin looks like Figure 21.1 if JV = 2000 and L is cleverly chosen so that Ej = j 2 . The 
data seem to fall on a familiar curve, and indeed comparing it to a power law shows that the curve is well represented 
by the form y(E) tx 1/yjE. The suggested curve gets better as bins for larger n are considered. 5 


EXAMPLE: For an electron in an infinite well with a ground state energy of E 0 = 1 meV (so that Ej = j 2 ), find the 
width L of the well. 

SOLUTION: Let E 0 = E(j = 1) = n 2 h 2 /2mL 2 . Using m= 510999 eV/c 2 and he = 197.327 eV-nm, then L = 
19.3915 nm. When L is large, the energy levels are very finely spaced. 


What happens if the number of dimensions is now 2 and the well is of width L in both dimensions? Now the wave 
function is [43, 44] 

V nj (x,y) = a nj sin ^ ^ sin ^J|0 (21.5) 

(here, ignore the prime on the x' =x —(L/2) and / = y — (L/2): the discussion is more or less the same) and so 
E n j = (h 2 /2m)(n/L) 2 (n 2 +j 2 ) with the restriction being that the sum n 2 +j 2 must be such that E n j < iu. By straightforward 
extension, in three dimensions, this would become E njl = (h 2 /2m)(n/L) 2 (n 2 +j 2 +1 2 ) such that E n]l < iu. When the 


4 Calling the DOS D(E) like ICittel [86] does would have been pleasant, but that name has already been taken by the transmission probability. 
Ziman [82] favored but that looks like a number function. The differing notation of different authors is irksome. So D(E) is chosen, 

following the convention of Section 7.3. 

5 Such findings are not a surprise for those familiar with Eq. (7.13) and the examples which follow it. 
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y(n) ~ y(20) (20/n) 1/2 

°0 5 10 15 20 

bin (n) 


Figure 21.1 A histogram with 20 bins of the one-dimensional energy states for E, = (h 2 /2m)(jx /L) 2 for the infinite well with a width of L = n‘h/^j2mE 0 
with flj = E a being the ground state energy. The black dots represent the number of energy levels E. in each bin. A curve of the form y(20)sj20/n is also 
included to guide the eye, where y(20) = 51. 



Energy Bin 

Figure 21.2 A histogram with 20 bins of the one-, two-, and three-dimensional energy states E, cx j 2 , E, k cx n 2 +j 2 and E n „ oc n 2 +j 2 + l 2 , respectively. 
The curves to guide the eye behave as yfix) oc x~'i 2 , y 2 (x ) <x x° and y 3 (x) cx x 1 ^ 2 , respectively. 


histograms for two and three dimensions are plotted alongside the one-dimensional case (with all of the curves normal¬ 
ized so that they can appear on the same graph) and compared to best-fit power laws, it is found the two-dimensional 
case is well-fitted by y 2 (fi) = constant, andy 3 (f?) oc \[e, as shown in Figure 21.2. 


21.2 Spheres in d dimensions 

The World is Spherical...whether because this figure is the most perfect of all, as it is an integral whole and needs no joints; or because 
this figure is the one having the greatest volume and thus is especially suitable for that which is going to comprehend and conserve all 
things; or even because the separate parts of the world i.e., the sun, moon and stars are viewed under such a form; or because everything 
in the world tends to be delimited by this form.. .And so no one would hesitate to say that this form belongs to the heavenly bodies. 

- Nicholaus Copernicus 6 


6 Nicholaus Copernicus (trans. Charles Glenn Wallis), On the Revolutions of the Heavenly Spheres Great Books of the Western World, ed. Robert Maynard 
Hutchins, Vol. 16, Ptolemy/Copemicus/Kepler. Chicago: Encyclopaedia Britannica, 1952, p. 511. The "World" referred to not only the Earth, 
but also the heavens. To the modern ear, attributing divine status to geometric figures sounds strange, but the paradigms of that time were 
different, for example 76 years after Copernicus, Johannes Kepler related the orbits of the planets to musical scales in his work Harmonices Mundi 
(Harmony of the World), which also sounds (no pun) strange. Concurrently, Kepler also had to defend his mother against serious accusations of 
witchcraft. His times had peculiar notions, as perhaps all times do. 
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Clearly something deeper is going on, and that something is geometry: the number of discrete states in a d- dimensional 
phase space that lie in a particular energy bin, where energy is proportional to the square of the magnitude of a 
^-dimensional momentum vector k, is now a counting problem, but if the phase space points are equally spaced on 
a grid, then the count will be proportional to the volume of phase space given by product of the surface area of a hol¬ 
low sphere and the thickness of its shell. Knowledge of this was presumed in Section 7.3, but consider the problem 
in greater detail. Borrowing the "bin" analogy of above, the bin holds all the energies that lie in the "element" dQ. In 
one dimension, the differential element dQ. is a line, in two dimensions it is an area, and in three dimensions a vol¬ 
ume: each case corresponds to a d- sided object. The construction of the d-sided differential element dQ d = dk ] dk 2 ■ ■ ■ dk d 
proceeds by iteratively finding the differential element dk d that is orthogonal to all the other d - 1 elements. The pro¬ 
cess symbolically represented in Figure 21.3 shows how this is done for up to three dimensions, and the method can 
be generalized to higher dimensions: algorithmically, dQ 2 = dk 2 dQ l = (kdO)(dk) = kdkdO, and similarly, di 2 3 = dk 3 dQ 2 = 
(k sin Od<p)(kdkdO) = k 2 sin OdkdOdcp. For d dimensions, then. 


dQ d = l^-' dk (sin otfdBj (21.6) 

i =o 

The problems of interest constrain d < 3 where d8 i = 0 and 6 0 = cp are the usual polar and azimuthal angles, respectively, 
in spherical coordinates. 

The important point is that E(k) depends only on the magnitude k in the form of k 2 . Therefore, all the angular 
integrations can be performed to give: 


dQ l -*■ dk 
dQ, -► kdk 


J dq> = 2nkdk 
dQ 3 -* k 2 dk J sin Odd J d<p = 4 nk 2 dk 


(21.7) 

(21.8) 


The final step defines dQ d = (dQ d /dE)dE. From E = h 2 k 2 /lm and dQ d — D d (E)dE, it is then straightforward to show 
(observe the factor of 2/(2 n) d in each equation) 


(2 nY 2 h \ E ) 


(21.10) 

( 21 . 11 ) 

(21.12) 


from which the behavior of Figure 21.2 makes perfect sense. 



Figure 21.3 Spheres in d dimensions. Left, a one-dimensional "sphere" is a line k, and its element is dk. Middle, a two-dimensional "sphere" is a circle, 
and its "volume" element is kdkdtp. Right, a three-dimensional sphere really is a sphere, and its volume element is k 2 dksmf)d0d<p. 
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21.3 The Kronig Penny model 


The principal idea of Bloch's theory is the assumption that the interaction of a given electron with the other particles of the lattice may 
he replaced in first approximation by a periodic field of potential. With this model an interpretation of the specific heat, the electrical and 
thermal conductivity, the magnetic susceptibility, the Hall effect, and the optical properties of metals could be obtained. 

- R.D. Kronig and W.G. Penney [291] 

A very wide well bounded by infinite walls makes for simple expressions for T)(E), but whether such an idealized 
potential describes a bulk material needs consideration. Matter is complicated. In preparation for how atoms cause that 
complexity, consider the next simplest potential well with infinite walls, for which half of the well is at a higher potential 
than the other half. That is, instead of V(x) = 0 for -L/2 < x < L/2 (in one dimension), let the potential for x < 0 remain 
unchanged, but for x > 0, let V(x > 0) = V 0 = h 2 k 2 0 /2m, akin to a step barrier stuck into a well with infinite walls, as 
schematically suggested in Figure 21.4. Now what happens to the energy levels? 

How the energy levels change depends on whether k is smaller or larger than k 0 . Consider the case k > k 0 first, as in 
the limit k 0 —► 0, the findings of Section 21.1 should be reclaimed. Let the real k be defined by k = -\Jk 2 — k 2 . Finally, let 
region 1 (deep well) be for x < 0 and region 2 (shallow well) be for x > 0. The wave functions are specified by 

y/, (x) = Ae ,kx + Be~ ikx (21.13) 

y/ 2 (x) = Ce lKX + De~ iKX (21.14) 


where the coefficients and hj are to be determined by the boundary conditions. As is now routine, one equation comes 
from y/ x (0) = y/ 2 (0), and one from the derivatives y/,'(0) = \p' 2 (0). The third equation is from the left wall, i/x n {-L/2) = 0, 
and the fourth from the right wall, yr 2 (L/2) = 0. The conditions on the coefficients can now be put in a matrix equation 
Ml | u) where the matrix y arises from the boundary conditions and the vector | u) arises from the coefficients, or 


1 1 1 1 (A' 


ik -ik 

e -ikL/2 e ikL/2 

. 0 0 




(21.15) 


where each row encodes one of the aforementioned equations. To have non-trivial solutions of the equations, the deter¬ 
minant of the 4x4 matrix must equal zero. An evaluation of the determinant followed by collecting terms gives 


k ( e ikL + 1) {e iKL - 1) + x (e ikL - l) (e irL + l) = 0 (21.16) 

for which a clever use of the half-angle formulae for sines and cosines results in 

(k + x) sin [(k + x)L/2] = (k — x) sin [(k-x)L/2] (21.17) 



Figure 21.4 Representation of how a well with infinite side walls is modified to make a split level well such that V(-L/2 < x < 0) = 0 and 
V(0 < X < L/2) = V 0 . 
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In the trivial limit that kjk -» 0 (i.e., k -*• k) the right-hand side vanishes and the usual relation k = jn/L results, as it did 
in Section 21.1. The values of allowed k may be obtained graphically, solutions to the problem being where the rapidly 
oscillating function on the left-hand side crosses the much more slowly varying function on the right. 

On the other hand, the case k 0 > k results in the substitution k -> Ik in the above analysis. Not using the half-angle 
formulae this time results in 

- =tan(£L/2)coth0rL/2) (21.18) 

Now consider the limit when k g -*■ oo, which drags k with it, and makes \/k 0 and tanh(K\L/2) —► 1. In that limit, 

solutions exist for k = 2jji / L, that is, only every other energy level survives as one would expect for a well half the size. 
For general k 0 , the number of energy levels is between the two limits and decreases as k 0 increases. 

When the infinite walls are removed and the deep well/shallow well is repeated over and over, something akin to the 
one-dimensional potential shown in Figure 21.5 results (the labeling of "superlattice" to draw attention to the similarity 
of the potentials of Section 19.1, and because "ion cores" associated with the bulk material atoms have not yet been 
introduced). The general problem, in which the well regions and barrier widths are of different size, is known as the 
Kronig-Penny model [52, 86, 291]. A flavor of what is to come has already been hinted at in Figure 19.8: not only will 
discrete energy levels in the wells (k < k 0 ) arise, but regions where there are no allowed solutions for (k> k 0 ) arise as well, 
and the latter will come to be called "bands." 

Imagine the number of wells significantly expanded to N wells: these will come in time to be associated Coulomb poten¬ 
tials of the ion cores but first, rectangular wells look nothing like that, and second, the problem at hand is one-dimensional 
and a lattice of atoms in a bulk material is very much three-dimensional. Nevertheless, keeping the destination in mind 
makes the journey more comprehensible. If N is large, the first well and the Nth well have very little influence on the 
center wells, and most of the wells can be visualized as existing in similar circumstances, but to really remove any dis¬ 
tinguishing feature between them, impose periodic boundary conditions so that the N + 1 well is taken as the same as 
the first well. Now all the wells are equivalent with respect to the physics of interest, such as energy levels. 

A translation (or rotation, if periodicity is obtained by attaching the Nth well to the first well to form a ring of N 
links) that does not change the energy levels means that the wave function v P(x) describing the ring is the same as for the 
rotated ring up to an overall phase factor. If the distance from center to center of two adjacent wells is a distance 1 = a + b, 
where a is the well width and b is the barrier width, and Nl = L, then it is hopefully apparent that V(x + f) = V(x). But 
this relation also has to be reflected in the wave function itself, with the only difference being that if the wave function 
is shifted a distance a, that is only reflected in an overall phase factor. Therefore, the behavior of V(x ) demands 

¥(X + a) = e'v'Vix) (21.19) 

Periodicity of the boundary conditions (i.e., the identification of the N + 1 well with the first well), however, means that 


>P(x + Nl) = »F(x) 


but that simply means that exp(/N<p) is one of the N roots of 1, or 

_ 


Coupled with Eq. (21.19), that entails 



'V(x) = e a,z]x/N, u k {x) 


(21.20) 


(21.21) 

(21.22) 


Figure 21.5 Representation of how the split level well of Figure 21.4 is repeated to make a superlattice (the infinite side walls are now removed; the 
faint white lines at the barrier edges are only to guide the eye). The wells and barriers need not be equally wide. 
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where u k (x) is a function with the periodicity of the lattice, or u k {x + l) = u k {t). Identifying kj = 2nj/Nl = 2nj/L, where 
L » l is now the dimensions of the box containing all the wells, results in 


<P(X) = e lk > x u(x) 

(21.23) 

which is the one-dimensional version of the Bloch theorem: when the potential is periodic, the eigenfunctions are 
the product of the plane wave with a function u(x) that has the periodicity of the lattice [82, 86, 104], As usual, let 
V 0 = h 2 kl/2m and k 2 = \k 2 0 — k 2 \. When Eq. (21.23) is inserted into Schrodinger's equation, two equations result: the first 
(call it u a (x)) corresponds to a well and the second (call it u b (x)) corresponds to a barrier. The equations they satisfy are, 
for k 0 < k, 

|d x 2 + 2 ikjd x + k 2 - k 2 j u a (x) = 0 


| d 2 x + 2 ikjd x -k 2 - k 2 1 u b {x) = 0 

(21.24) 

Solutions are found by trying u = e lwx in each region and finding what w is. For each region. 

it is found 

w 2 a + 2 kjW a + k? - k 2 = 0 

(21.25) 

wl + 2kjW b + k 2 -K 2 = 0 

(21.26) 

where it is seen that the role k plays for w a is the role that /V plays for w b , much like for the rectangular barrier of Section 
18.1. From the quadratic equation, two solutions exist, and it is straightforward to show that they are w„ =kj±k and 
w * = kj ± ix, with the most general wave function being a combination, or 

u a (x) = A exp [~i(kj - k)x\ + B exp [—1(% + k)x\ 

(21.27) 

u b (x) = Cexp [(x - ikj)x] + Dex p [-(ic + ikj)x] 

(21.28) 

The matching of the wave function and its first derivative at x = 0 where the first well meets the first barrier specifies 
two equations. Two more are specified by the periodicity conditions that relate the u a and u b wave functions and their 
derivatives. Because these equations are set to 0, they can be multiplied by any convenient factor that will make taking 
the determinant of the matrix y simpler. Therefore, consider the four equations given by 

k«(0) - u b { 0) = 0 

(21.29) 

d x {u a (x) - u b {x)} x=0 = 0 

(21.30) 

e'V {u a (a) - u b (-b)} = 0 

(21.31) 

e ikja &{»M} x=a ~ Mu b {x)} x= _ b ) = 0 

(21.32) 


In matrix evaluations, there is no substitute for cleverness, and in creating y from the above four equations, the combina¬ 
tions A±B and C±D appear, so that rewriting the equations in terms of A' = A + B, B' = A - B, C' = C + D, and D' = C— D 


simplifies the resulting matrix to 

' i 

0 

-1 

0 


-ikj 

ik 

ikj 

_ 

y = 

cos {ka) 

i sin (ka) 

-cp cosh(xi;) 

cp sinh(K-fe) 


k h x 

h-2 

<ph 3 

—<ph 4 ) 


where, for compactness, the following abbreviations are used: cp = e lk > a and 


hy = -ikj cos (ka) - k sm(ka) 
h 2 = kj sin(fat) + ikcos(ka) 
h 3 = ikj cosh(icfc) + k sinh(*-2>) 

h 4 = ikj sinh(xi>) + k cosh (jcb) (21.34) 
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Figure 21.6 The behavior of the right-hand side of Eq. (21.36) (called ytk)) for parameters such that bk 0 = k/2 and a = 2b (red), and a = 4b (blue). 
Because the left-hand side of Eq. (21.36) is < 1, the greyed regions in the figure correspond to values of k that are forbidden. As a result, there are gaps, 
or bands, between allowed values of k, and therefore E(k). 


The advantage of the relabeling of the coefficients is to render the top row of Eq. (21.33) easier with respect to the 
evaluation of determinants by the method of cofactor expansion [91]. Even with the simplifications, it takes a bit of 
effort to show that setting the determinant of M to 0 results in the relation for k<k 0 : 

cos (kjl) = ( ^2^ ^ sin(kfl)sinh(icfr) + cos(ka) cosh(jrfr) (21.35) 

where, recall, 1 = a + b. When k > k 0 , then k —> ire and so 

cos (kjl) = - ( * 2 Jck ) sin(K-fc) + cos (ka) cos(rcb) (21.36) 


The limits of these equations are illuminating. For the case where the barriers between the wells are removed, the 
energy should be given by that of the plane wave states, and indeed as k 0 -* 0, then rc -* k in Eq. (21.35), or 


cos (kjl) = cos {ka) cos (kb) - sin (ka) sin (kb) = cos (kl) 


(21.37) 


from which E(k = kj) = h 2 k 2 /2m, as expected. In the opposite limit, where the height of the barriers becomes large, or 
k. -* oo, then 


sin(ka) _ _2k 
cos (ka) k 0 


(21.38) 


or k = (jnja) for integer; (compare the implications of Eq. (21.2)), that is, the energy levels of the single well with 
infinite walls of Section 21.1 are recovered. Such levels are very narrow, but when k 0 is of finite size, some wiggle room 
is recovered and the levels broaden, particularly for energies near the barrier height V 0 . 

For the general k > k 0 case, though, the left-hand side of Eq. (21.36) oscillates between +1 and -1, but the right-hand 
side can take on values outside those limits. Those values of k that are associated with the right-hand side being greater 
than 1 or less than -1 are therefore disallowed: in other words, bands occur as anticipated by Figure 19.8, and correspond 
to gaps at finite intervals in the continuous energy spectrum of the electron as it moves through the lattice. Graphically, 
this is seen in Figure 21.6, where the right-hand side of Eq. (21.36) is shown as a function of (k — k 0 )/k 0 . The regions 
for which |y(k)| < 1 (within the band of white) represent where allowed k values exist. By contrast, the lines in the gray 
region correspond to regions where no k value satisfies the relation of Eq. (21.36). A close-up of the band regions is 
shown in Figure 21.7. 


21.4 Atomic orbitals 

"Modern materials science is at least as concerned with defects in crystals and noncrystalline materials as it is with the properties of perfect 
crystals. (In a phrase often attributed to F.C. Frank “crystals are like people - it is the defects in them that makes them interesting.") 
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Figure 21.7 Close-up of Figure 21.6, showing the region of bands (pale green) for the a = 2b case. 


Band theory and free electron theory have very little to say about such things, because they rely on perfect translational symmetry and 
Bloch's theorem. To understand the properties of defects...we have to take an altogether different viewpoint and set the problem up in 
real space. * 

- Adrian P. Sutton [104], Chapter 1. 

A different outlook to envisioning free electrons roaming over the entire bulk material is to consider their behavior near 
two interacting atoms. At some point, the differing views will need reconciliation, but for now consider two hydrogen 
atoms (the simplest type) and how they interact. Quantum mechanically, a state vector I 1 ?) will describe them both, and 
the Hamiltonian H acting on the system of both atoms will give the total energy H |*P) = E I 1 ?), or specifically. 


li{Ci ki) + C 2 \ip 2 )}=E{C l \ Vl ) + C 2 \y, 2 )} 


(21.39) 


If the hydrogen atoms are far apart from each other, then both yq(?) = ( r\y / l ) and i p 2 (f) = (r\w 2 ) are going to look very 
much like the hydrogenic wave functions of Eq. (9.33), and (yq |yr 2 ) « 0. It is convenient to retain this approximation and 
take y/j(j) to be those hydrogenic wave functions to leading order and to take their overlap as (yq |y/ 2 ) = 0. In actuality, 
their overlap is not zero, but it does fall off exponentially with r, and it can therefore be regarded as negligible, simplifying 
the problem considerably. For both atoms. 


Hj k> = {H + V i (?) } K> = E » \vj) ( 21 - 4 °) 

Compare that to the full Schrodinger's equation, which is 

#!¥> = { T ^ + V l (r)+V 2 (r)\ |T> = EPF) (21.41) 


That means there are two equations for the two unknown Cj in Eq. (21.39) to be determined by (yr ; |ff| v P) = ECj for 
j = (1,2). For convenience, take the hydrogenic wave functions to be in their ground state (Is), so that the form of V(r) is 
the same for both. In writing out the equations for Cj, terms of the form (i/q | V(r) \y / 2 ) arise and represent the influence 
of one nucleus on the other. Because the hydrogenic wave functions are real for s states. 


(¥i\V(r) \v 2 ) = (y/ 2 \ V(r) |yq) = -8E 


(21.42) 


and so E = E 0 - 8E, the sign convention chosen so that 8E itself is positive (the potential is attractive and the energy 
change therefore negative). The two equations for the C ; encapsulated in {wj\H - F’I'P) = 0 can therefore be put into a 
matrix equation of the form 



(21.43) 
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For solutions to exist, the determinant of the matrix of coefficients must equal zero, and so 

(. E o -E) 2 -8E 2 =0^-E=E o ±SE (21.44) 


As in Section 8.1.2, the eigenvectors can be found depending on whether the + or - sign is chosen for the eigenvalue. 
The two eigenstates are determined to be C 2 = ±C, for E = E 0 ± 8E, and so the normalized states are 


k±) = ^|{ki)±|i 


a)} 


(21.45) 


corresponding to a symmetrical (+) and anti-symmetrical (-) state when |yz,) and \y/ 2 ) are interchanged. In terms of 
charge density, the antisymmetrical density vanishes at the point equidistant between the two nuclei (think 

of the antisymmetry of sinx about x = 0, for example). In contrast, does not. Physically, the antisymmetric 

state pushes electron charge out of the center region and is therefore identified as an antibonding state, compared to 
the symmetrical case where charge is piled up between the nuclei and pulls them together, identifying it as a bonding 
state [104]. The point here, however, is that a consideration of the local interaction between the atoms has resulted in a 
discrete energy shift between one state and the other in a way that is a prelude to how bonding and antibonding between 
dissimilar atoms introduces the notion of "electronegativity" (how much different atoms seem to crave an electron they 
share) in Section 21.5, and anticipates an analogous difference in the behavior of the electron density for energy states 
just above and below what will be called the "band gap" for a sinusoidal potential of Section 21.6. 

Regarding the bonding and anti-bonding states, these eigenstates are the analogs of the "stable strategies" discussed in 
Section (8.1.3): a system set up in such a state persists and its density p ± = ( y ± |y/ ± ) is time independent. On the other 
hand, what if the electron were confined initially to one of the atoms (say, atom 1, or QfO) = 1, C 2 (0) = 0). How then 
would the density associated with atom 1, or |Ci(f)| 2 , evolve over time? Such an initial state is a superposition of the 
symmetric and antisymmetric states in equal measure. From the time-dependent Schrodinger's equation ihd t \y/(t)) = 
H |y/(0), it follows that 

W(t)l = ~ {e iE + t/h k+> + e iE ~ t/h |vO) (21.46) 

V2 

Then, knowing C x (t) = (y/, |i/z(r)) leads to the conclusion 

Q(0 = | (?""+' + e m -‘) = e' E « t/h cos (^t) (21.47) 

The same logic shows that C 2 (f) is the same form with cos -*> sin, so that the probability density of the electron on atoms 
1 and 2 is 

|C,(0l 2 =cos 2 (yd); |C 2 (0|; 2 = sin 2 (^f) (21.48) 


respectively, and |C,(0l 2 + |C 2 (f)| 2 = 1, as it must. The location of the electron oscillates back and forth between atoms 
1 and 2 over time by tunneling through the potential barrier between them. When the atoms are too far apart, then 
the overlap factor SE is small and is an indication that the barrier between the atoms to the electron's migration is large, 
making the tunneling probability small; conversely, when the atoms are close, the overlap is significant and the passage 
back and forth more likely. The language of discourse is comfortably recognized from the discussion of the tunneling 
probabilities associated with a barrier in Section 9.2.3. 


21.5 Electronegativity 

I will pick up the hook. 

You -will see something new. 

Two things. And I call them 
Thing One and Thing Two. 

- Theodor Geisel 7 


7 Theodor Geisel (aka Dr. Seuss), The Cat in the Hat. New York: Random House, 1957, p. 33. 
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The most obvious generalization to the two hydrogen atoms problem of the previous section is two dissimilar atoms, 
called 1 and 2 in deference to convention. When the atoms are isolated, H 0 1 Wj) = Ej \vj) for each atom. This changes 
matters because now the ground state energy of each atom is different. The modification to Eq. (21.43) is now 


(e.-e 

l -8E 



(21.49) 


Although visually not much of a change, it results in the eigenstate energies for the bonding and antibonding states 
acquiring an interesting twist. Setting the determinant of the multiplying matrix to zero as before gives 


(£, - E)(E 2 - E) - 8E 2 = 0 


(21.50) 


With the introduction of the average energy E 0 = (E ] + E 2 )/2 and the difference energy A E = (E l —E 2 )/2, E is then 
(compare Eq. (21.44)) 

E ± = E 0 ± \/8E 2 + A E 2 (21.51) 


As before, the eigenstates are obtained by putting each eigenenergy E ± into the matrix equation of Eq. (21.49) and finding 
the relationship between C, and C 2 . Those relationships are 


C 2 (E ± ) _ A E _ I (AE\ 2 _ 1 

q(i? ± ) " 8E + V V 8E ) ~ ^TT^± r 


(21.52) 


with the (+) sign on E ± associated with the bonding state |i/r + ) and the (-) sign associated with the antibonding state | y/_). 
The factor r = A.E/8E is the ratio of the difference in energies {E } - E 2 )/2 factor to the overlap factor 8E. The behavior 
is shown in Figure 21.8: when r -* 0, both atoms are identical and the values of the coefficients are equal in magnitude 
(C 2 = ±C,) whereas r » 1 signifies one atom has a much deeper (in terms of well depth) ground state, and the atom 
with the higher electronegativity starts to dominate the time the electron spends with it. In the limit of a purely ionic 
bond r will be very large: when it is infinite, the "shared" electron ceases to spend any time in the vicinity of the lower 
electronegative atom. In short, one atom attracts the shared electron more strongly than the other. Pauling empirically 
described the electronegativity E P of an atom as a measure of its ability to pull electrons to itself from its neighbors: atoms 
with a greater electronegativity therefore hold on to the electron more of the time. In Pauling units, 8 fluorine is 3.98 and 
cesium is 0.79. The idea of an electronegativity of an individual atom is unusual because generally atoms are in molecules 
and electronegativity refers to how atoms behave there. An alternate definition due to Mulliken defined electronegativity 
E u in terms of the energy required to rip an electron off an atom (I A , or ionization potential) and the energy added to an 



r = AE/8E 


Figure 21.8 Ratio of the coefficients of the states 11) and |2) for dissimilar atoms, as per Eq. (21.52), for the antibonding (solid circles; E_) and bonding 
(open squares; E + ) states of Eq. (21.51). 


s The units have been revised since Pauling's introduction of them: Pauling had set fluorine to be 4.0. 








268 Chapter 2 ] 


atom when an electron joins it (/, or electron affinity): it is therefore not predicated on an arbitrary scale and is defined 
by 

IA + / 

Em = (2L53) 

with units of eV (or kJ/mole, depending on preference). The relationship between E P and E m is reasonably linear and 
given by 

E P = 0.336 (e m — 0.615) (21.54) 

The aforementioned values for fluorine and cesium follow the general trend that moving up and to the right on the 
Periodic Table corresponds to an increase in electronegativity, but moving down and to the left corresponds to a decrease, 
trends seen in Figures 21.9 and 21.10. Intuitively, this makes sense: the alkalis (first column of the Periodic Table) have 
low electronegativities because they readily abandon their outermost electron to other atoms. Cesium being so large 
would have the lowest electronegativity, and the alkali earth metals such as barium would not be far behind. 

Unexpectedly, tungsten has a strong electronegativity compared to other metals near it on the Periodic Table. The 
passing comment that early work on thermionic emission, discussed in Section 10.1, investigated the effects on emission 
due to coating tungsten with cesium and barium in particular, takes on a whole new significance. Tungsten covets the 
electron that cesium or barium contributes more than they do, resulting in a more substantial lowering of the work 
function barrier. A simple argument shows why, one that will be significantly expanded when Gyftopoulos-Levine theory 
is introduced (Section 31.8). A flat surface of tungsten upon which a layer of cesium rests will have the top layer of 
tungsten atoms dominate the shared electrons that the cesium top layer offers. A layer of cesium atoms will therefore 
acquire a positive sheet charge density and the topmost layer of tungsten atoms a negative sheet charge density on 
average, much like a capacitor (a visualization is shown in Figure 31.29). The potential drop across a capacitor of plate 
separation D is given by AV = q(jD/e 0 , where a is the (number) charge density on one of the capacitor plates. Assuming 
that the charge density is a fraction / of an electron (reflecting the reduced amount of time the electron spends with the 



Figure 21.9 Pauling electronegativities of the elements, excluding the noble gases and the lanthanum and actinium series. 



Figure 21.10 A graph of the electronegativities of the alkali and alkali earth elements (Groups 1 and 2, period indexed by;), and transition metals 
(Periods 4, 5, and 6, group indexed by;'). The background color corresponds to the color scheme used in Figure 21.9 to denote E p magnitude. Light 
green halos identify particular elements of interest, namely hydrogen (Pl-Gl), tungsten (P6-G6), barium (P6-G2), and cesium (P6-G1). 
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cesium atom) per unit area (taken to be (2 a c ) 2 , where a c is the radius of the cesium atom, or 0.225 nm). Let D « 2a c . 
Then the potential energy drop across the fictitious capacitor standing in for the dipole layers is 


AO ^ q 2 o 
~D*~^ 


AO = 8nQ 


(21.55) 


EXAMPLE: Assume the radius of a cesium (Cs) atom in a surface monolayer is its covalent radius of a c = 0.253 
nm (a covalent bond exists between atoms with an electronegativity difference smaller than 1.7). Assume one Cs 
atom occupies a square area of (2 a c ) 2 at full monolayer coverage. Assume that the average fractional charge on the 
Cs atom is fq. Using the empirical value of AO = 2.9 eV (tungsten (W) has a work function of 4.5 eV; the minimum 
work function for W partially coated with Cs is 1.6 eV.), and knowing that the minimum work function occurs for 
a partial monolayer of 60%, find/. 

SOLUTION: Let d = 2a c . Taking the number density of the Cs monolayer to be a =fq6/(la c ) 2 then from Eq. (21.55), 


/ = 


a c AO 
8 nQO R 


(21.56) 


EXAMPLE: Although the capacitor model of Eq. (21.55) is crude, use Eq. (21.56) of the previous example to 
provide a guestimate of r. 

SOLUTION: When C, = ±C 2 the magnitude of both is 1/2. Therefore, let / be the amount in excess of 1/2. The 
two equations to be solved are therefore 

C 2 + C 2 = 1 
Cf-C 2 = 2f 

which is solved by C, = y/(l/2)+f and C 2 - s/{\/2)-f. Using Eq. (21.52), it follows r = 2f/yJ\ - 4f 2 » 0.281. 


21.6 Sinusoidal potential and band gap 

I am death, shatterer of worlds, 
annihilating all things. 

- Bhagavad Gita 9 

Although a ring of atoms can be used to illustrate further how band gaps and changes to the electron density arise as a 
consequence of the overlap integrals in the LCAO framework (as in ref. [104]), consider instead a potential closer in spirit 
to the Kronig-Penny model, in which V(x) = —V 0 cos(Apc) with k l = 2nl/L and L is the width of the one-dimensional box. 
The negative sign insures that x = 0 is the location of a minimum. The problem is best handled using time-independent 
perturbation methods [41, 43, 44], The present treatment (based on ref. [85]) side-steps much of a discussion that 
would be entailed were a general potential V(x) being considered by considering instead a cos variation, which sim¬ 
plifies the treatment by allowing the utilization of methods and notations introduced in Section 8.2.2, along with the 


9 Stephen Mitchell, Bhagavad Gita: a new translation. New York: Harmony Books, 2000, verse 11.32, p. 138. The passage is spoken by Krishna 
to Prince Arjuna when Krishna reveals his divine form. Robert Oppenheimer recalled the passage when the first atomic weapon test was 
conducted at Alamogordo (July 16, 1945): another passage describing Krishna (verse 11.12) had an uncanny resemblance to the Trinity test 
itself, and its wording was used in the title Brighter than a Thousand Suns: A Personal History of the Atomic Scientists by Robert Jungk and James 
Cleugh. Krishna, though, is not associated only with annihilation, but also with creation. 
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creation and annihilation operators of Section 9.1.2. The notation gets difficult quickly, so to simplify it, instead of refer¬ 
ring to momentum eigenstates as | k n ), represent the momentum states instead as \n). Thus, for example, Eq. (8.46) is 
generalized to 

X±\n) = \n±\) (21.57) 

where X <-»■ X_ and A 1 * *+ X + , which mimics the creation/annihilation operators of the harmonic oscillator in Eqs (9.85) 
and (9.86). Switching to a (±) subscript is convenient to avoid entanglements with powers of the operators, and because 
a dagger notation is intrusive. The potential V(x) can be cast in terms of X ± by 

VQc) = ~V 0 (e ik >* + e~ ik ' x ) = -±V P (x\ X l + + X l _ |0> (21.58) 

The utility of the notation is powerful when coupled with the usual formulation of the perturbation technique. Because 
(n\l) = 8 nl , then (n\X l ± \0) = 8 n±l , and Kronecker delta functions make summations rather easy. The perturbation 
methodology expresses the potential matrix elements, and changes to the wave function, in terms of the unperturbed 
Hamiltonian eigenstates. The "unperturbed" Hamiltonian is taken to be H 0 such that 

h 2 k 2 h 2 kl 

H 0 |„) = —|,) = ^|n) (21.59) 

The question then becomes, what are the energies and eigenstates of (H 0 + V) |i //) = E\y/)? It would be nice to express 
the perturbed parts of the eigenvectors using a notation much like | n), but doing so is not so simple. A workable but not 
altogether satisfactory notation is to introduce numerical subscripts to n which denote the "order" of the perturbation. 
The perturbation order can be understood as follows. Suppose the potential operator is turned on mathematically by 
increasing a factor X from 0 to 1, where H%= H 0 + XV and \n x ) = \n 0 ) + X \n x ). It is reasonable to then expect that the 
modifications to the wave function are turned on in a similar way, or 

(H 0 + XV) (|« 0 ) + 2 K)) = K) + XE, \n v ) (21.60) 

so that now the \n 0 ) are the eigenstates of H 0 . But this seems odd - there are factors on the left-hand side that go as 
X 2 , without a corresponding term on the right. This is fixed by adding on higher order perturbation terms to both the 
energy and the wave function, but the process is unending, and so 

(H 0 + W) Jj X s | nj) = £ X 2 Ej \nj) (21.61) 

To reiterate, the n in |n ; ) refers to k n , the; subscript to the order of XX. The point is, the value of X is arbitrary, and so the 
terms associated with each X> on the left must correspond to the same X> on the right. The first few, for X°, X 1 , and X 2 , are 

H 0 \n 0 )=E 0 \n 0 ) (21.62) 

Ho K) + * |«o) = E 0 | n x ) + E, \n 0 ) (21.63) 

H 0 \n 2 ) + y\ni)=E 0 \n 2 )+E l \n l ) + E 2 \n 0 ) (21.64) 

where E 0 = Ti 2 k 2 /2m. A second requirement is that the wave function be normalized, or (n\n) = 1, and this, too, creates 
a set of conditions for each power of X 1 encountered because | n) = £ X |n ; ). The first three for X°, X 1 and X 2 are 

1 = ( n 0 \n 0 ) (21.65) 

0 = <Wol«i) + <«il«o) < 21 - 66 ) 

0 = («ol« 2 ) + («il«i) + <» 2 l«o) (21-67) 

and so on for the higher order terms, but of the batch the first two are particularly important. The first of the equations 
entails that the unperturbed wave functions corresponding to |« 0 ) are already normalized. The second equation entails 
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that (n 0 \n t ) = 0, that is, to order 2 1 the perturbed wave functions are orthogonal to the unperturbed wave functions. 10 
However, that means if Eq. (21.63) is multiplied on the left by (n 0 \ then 

(n 0 \v\n 0 ) =M« 0 |n 0 ) = E i (21-68) 


which is a standard result. But when Eq. (21.58) is taken into account, it is seen that the left-hand side is 0 for the 
sinusoidal potential, and so the first order energy change for the sinusoidal potential perturbation vanishes. That is rather 
important. It means that to leading order, the energy is still given by the friendly relation E = h 2 k 2 /2m. 

Having found that the energy does not change to first order, the question turns to the wave function, and from it the 
density profile. To zeroth order, the density is constant, as 


Z/ro (W) ( w oK) = Z/ro OW) (21-69) 

is position independent. To find the first-order perturbation | n x ), return to Eq. (21.63) and rewrite it in light of E y = 0 
for the sinusoidal potential, to obtain 

(E 0 (k n )-H 0 ) \n x ) = V\n 0 ) (21.70) 

where the explicit dependence of the energy terms is shown. Multiplying all this by the identity matrix I does not 
change anything, but let I be given by 

7 =X bo) Ool < 21 - 71 > 


(compare Eq. (8.30)). Observe that the terms for j = n on both sides vanish, and so the summation can be constrained 
by the restriction; n. Equating the coefficients of the remaining |/ 0 ) then gives 




E 0 (n) — E 0 (j) 


(21.72) 


where for convenience the notation is modified so that E 0 (k n ) -*■ E 0 (ri) (freeing up k to be used as a summation index 
below) and so 


p n (x) = |(x|«>| 2 = 1 +E! +S 2 

s y (x\jo)(jo\v\n 0 )(n 0 \x) + c.c. 

1 £ E 0 (ri) — E 0 (j) 

^ _ yi yi ( x \ k o) (K\ V \ n o) ( n o\ V bo) (7 qI x ) 
2 " i+nkn (£ 0 («) - E 0 (j)) (E 0 (n) - E 0 (k)) 


(21.73) 

(21.74) 

(21.75) 


where \{x\n 0 )\ 2 = 1 and the notation c.c. means complex conjugation. 11 Now the consequential form of V in Eq. (21.58) 
can be used to great advantage because 


<; 0 |2il« 0 ) = 5 jin±l 


(21.76) 


which makes the summations much easier. The undaunted shall find that 


4V 0 cos fax) 
E 0 (/)-E 0 (2n) 


E 0 (l ) (E 0 (l) - E 0 (2n)) 


^ cos(2 k t x) + 


E 0 (l) + E 0 (2n)} 
E 0 (l) — E 0 (2n) J 


(21.77) 

(21.78) 


10 Well, not quite: the second equation simply says the real part of (k 0 I«i) vanishes, but because the wave functions are specified up to an 
overall phase factor, the wave function can be rotated until only the real part remains, and so the statement is essentially correct (see ref. [41 ]). 

11 This equation is less complex than the more general perturbation equation because properties of the sinusoidal potential V(x) have been 
invoked. 
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Analogous to E 0 (j) = (2 k 2 Ti 2 /ml?)j 2 , let V 0 = (2n 2 h 2 / mL 2 )v 2 , which allows S, and S 2 to be more compactly written as 



(21.79) 



(21.80) 


where the denominators are the consequence of simplifications like E 0 (n ±1)- E 0 (n) <x 1(1 ± 2 n) when processing Eq. 
(21.72) and the combination of the fractions in which they reside. 

When their denominators are looked at closely, Eqs (21.80) are rather remarkable, given that they give rise to a sinu¬ 
soidal perturbation in the electron density that matches the period of the potential. Remember that the lattice periodicity 
matches that associated with the ionic cores that exist in a real bulk material, thereby making l a rather important index. 
As a result, both Sj and S 2 , going as l/l 2 and l// 4 , respectively, will be small except when 2n = l ± 1 (2 n ± l because of 
the restriction imposed by Eq. (21.70)). As a result, when the n of k n jumps the value n = 1/2, the whole behavior of 
the density term p n (x) = |(x|k„)| 2 switches from being more piled up over the ion cores corresponding to the troughs 
of the sinusoidal potential to being less piled up over those cores, in analogy with the bonding/antibonding charac¬ 
ter of the two hydrogen atoms (see discussion following Eq. (21.45)), and as would be seen for a ring of atoms in an 
LCAO model [104], This follows because as n = (1/ 2) - 1 goes to n = (1/2) + 1, the denominators l 2 - An 2 change from 
4(1- 1) to -4(1+ 1), and so the sign of the denominators changes: what was a trough in the density now becomes a 
peak. 

More realistic potentials are not sinusoidal, but they do have periodicity to them, and so the potentials can be 
represented by their Fourier transforms V(x) = cos (k,x). As a result, as with the Kronig-Penny model, band gaps 
open up in the E(k) relations. When that happens, E(k n ) is not parabolic with k n near those edges where n « 1/2, 
but away from there the density p(x) is reasonably constant and the relationship between E(k) and k is reasonably 
parabolic. 

21.7 Ion potentials and screening 

...the effects of [electron interactions] would be mollified by electron correlation, tending to keep electrons apart, and surrounding any 
given electron by a virtual hole in the charge cloud. This hole has the effect of a positive charge, which screens out the electric potential 
of the electron at all but very short distances. Two electrons could then only interact if they came very close. Such a screened potential 
occurs naturally in any system where a cloud of electrons is available to spread itself out about a Coulomb source. 


- John M. Ziman [82], p. 159. 


The extension to three dimensions complicates the description of materials and crystals, requiring the introduction of 
Bloch states, Brillioun zones, and reciprocal lattice spaces [53, 78, 86, 292]. It shows that bands can overlap: for a metal, 
the valence and conduction bands do so, so that instead of the valence band filling up completely, the conduction band 
becomes partially filled. For an intrinsic semiconductor (a semiconductor that is without any dopant species), a band gap 
between the conduction and valence band exists with the (negative) Fermi level equidistant between them, and electrons 
in the conduction band are a consequence of thermal excitations from the valence band. Doped semiconductors can have 
positive Fermi levels measured from the bottom of the conduction band, but the carrier density is still well below that of 
metals. This will have consequences for how external fields are shielded compared to metals, as taken up in Chapter 22, 
but more immediately it affects how the Coulomb potentials associated with the ionic cores are shielded. 

Mobile electrons in the conduction band will migrate so as to surround the bare ionic charges of the lattice atoms and 
shield their influence. Changes in the density of the electrons are reflected in changes in the chemical potential p, or 
Sp = 5([), and governed by (see also Section 25) 



(21.81) 
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21.7.1 Low temperature/high density 

How frozen and faint I then became, ask it not, Reader, for I do n 


% because all speech would be little. 


- Dante Alighieri 12 


For conditions such that pp » 0, such as are characteristic of high carrier concentration and/or low temperature (see 
Section 6.4 for p > 0), then 


&4> Jo (i + 


(1 +e iKE- li ) 


as a consequence of Eq. (7.24), where 1 3(E) = Z3 3 (E) is the three-dimensional density of states of Eq. (21.12) (the 3 index 
is dropped for simplicity), and the relation holds for Pp » 1. Thus, 


Sp (r) « V(p)ScP (r) 


(21.83) 


The Coulomb potential is rotationally symmetric, so from Poisson's equation, 

-T-7- (r^^-Stp) = —Sp = 16 xQD(p)6(p (21.84) 

r 2 dr \ dr ) e 0 

The right-hand side is of the form k 2 S<p. Equivalently, one can show that a solution of the form cp = (4Q/r) exp (—kr) is a 
solution, 13 but whichever path one favors, it follows that 


k=(\6nDtp)Q) l ' 2 = 


Vi 


k TF 


(21.85) 


where a 0 is the Bohr radius, and k TF is the Thomas-Fermi screening constant. 


EXAMPLE: Find k TF for copper, where p = 7 eV. 

SOLUTION: Evaluate k F = sjlmp/h = 13.5546 nm _1 . Then, in units of eV, nm, fs, and q. 


k TF - 


4- 13.5546 
k- 0.0529177 


1/2 

I = 18.0592nm _1 


21.7.2 High temperature/low density 


Till swoll'n with cunning, of a self-conceit, 

His waxen wings did mount above his reach 
And melting, heavens conspired his overthrow! 

- Christopher Marlowe 14 

For conditions such that -Pp » 1, which are characteristic of low carrier concentration and/or high temperature 
(Section 6.4 for p < 0), then the leading order electron density is given by the first term in Eq. (6.37). Take the background 


l2 The Divine Comedy of Dante Alighieri (trans. Charles Eliot Norton), Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 21, Dante. 
Chicago: Encyclopaedia Britannica, 1952, Canto XXXIV.16. In Dante's Inferno, the ninth circle was a frozen, hardened wasteland. 

13 If one is unhappy with the ansatz approach here, the problem is revisited in Section 26.6, where it is put on a better footing. The point here 
is to show how the equations introduce length scales that identify relevant terms like k TP without having to solve equations like Eq. (21.84) 
explicitly. 

14 Christopher Marlowe and S. Barnet, Doctor Faustus. New York: New American Library, 1969, Prologue, p. 23. The passage refers to the Greek 
myth of Icarus, who flew on contrived wings in the rarefied upper atmosphere where the heat of the sun melted the wax holding the feathers 
in place, causing Icarus to plunge to his death. The story has long been a metaphor of hubris. 
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positi ve charge as fixed in position. In a Jellium-like model, where the background charge is p 0 , then the electron density 
near the charge will be p = p 0 exp(/h/>), so that Poisson's equation becomes 

<‘"-‘1 (21 - 86) 

If the electrons are not large in number so that shielding is weak, and if the temperature is sufficiently high, /?<£ will not 
be large, then with the approximations e?* « 1 + fitp and 4> = (4 Q/f) exp (—kr). 


J_ d_ 
r 2 dr 



\6nQp 
k B T ^ 


k D - 


/ \ 6nQp 0 \ 1/12 

V KT ) 


(21.87) 

(21.88) 


where k D is Debye's screening constant. 15 A modification for semiconductors is that the dielectric constant K s of the 
medium (e.g., K s = 11.9 for silicon) affects the factor Q by making the replacement Q —► (K s - l)Q 0 f[K s + 1) where now 
Q 0 = ahc/A = 0.359991 eV-nm [99], 


EXAMPLE: Find the Debye screening constant for a semiconductor doped such that the density of electrons in the 
conduction band is 10 18 cm -3 . Assume the temperature is 300 K and that K s = 11.9. 

SOLUTION: Converting, p 0 = 0.001 nm" 3 and k P T = 0.025852 eV. Therefore, in units of eY, nm, fs, and q, 

16 ■ 0.001 • n - 0.359991 • 0.844961 N 1 / 2 _ 760Qi6 -l 

0.025852 ) ~ ' nm 


21.7.3 Many cores 


What are the core values that we, as Americans, hold in common? That's not how we usually frame the issue, of course; our political 
culture fixates on where our values clash. 

- Barack Obama 16 


The presence of multiple shielded cores in a lattice can modify the apparent shape of the shielded Coulomb potential. 
A suggestion of how is afforded by a screened Coulomb potential (called the Yukawa potential) for which 


q = f^) exp {-kRs) 


(21.89) 


where s is a dimensionless measure of the radial distance from the core, and R is a measure of the distance of the atom 
in a bulk material to its neighbor. For simplicity in the present example, assume a cubic lattice and let R be twice the 
covalent radius. Then, apart from the overall energy scale 4 Q/R, the screened Coulomb potential has a similar form 
governed only by the factor kR. Use the previous two example calculations, and take R Cu = 0.276 nm and R sj = 0.222 
nm, so that ( kR) Cu = 4.98434 and ( tcR) si = 0.170728. A comparison of the dimensionless potential 0/(4 Q 0 /R) is shown in 
Figure 21.11. Contrast that with ion potentials that are close packed on a cubic lattice, and let s be measured along the 
x axis. Then the cumulative potential is approximated by summing over an array of screened potentials on a cubic lattice 
by 


m | 



(21.90) 


where s 2 kl = (j — s) 2 +k 2 + l 2 and the summation is over each index spanning the range from -N to N. The potential is 
specified up to an additive energy constant, and so subtracting off a value so that the midpoint between the middle atom 
and its nearest neighbor is approximately zero gives the behavior shown in Figure 21.12. The potentials were generated 
using the algorithm in Section A3.12.5. 


15 As observed by Kubo [61], Mott first derived the relation, and so the appellation "Mott's screening constant” may be a better designation. 
16 Barack Obama, The Audacity of Hope: Thoughts on Reclaiming the American Dream. New York: Crown Publishers, 2006, p. 52. 
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x/R 


Figure 21.11 Screened, or Yukawa, potential for copper-like and silicon-like parameters using the algorithm of Section A3.12.5. 



x/R 

Figure 21.12 Screened, or Yukawa, potential for copper-like and silicon-like parameters taking into account contributions of neighbors as in Eq. (21.90) 
using the algorithm of Section A3.12.5. 


The ability of electrons to migrate in response to the presence of a charge anticipates that they will also migrate so as 
to shield the inside bulk of a material from an externally applied field. Therefore, electron emission from semiconductors 
will be complicated by accounting for how the conduction bands bend in response to the magnitude of those external 
fields. 














CHAPTER 22 

Semiconductors 


22.1 Resistivity 

The effect of heat in increasing the conducting power of many substances.. .is well known. I have lately met with an extraordinary 
case of this kind...which is in direct contrast with the influence of heat upon metallic bodies... The substance presenting this effect is 
sulphuret of silver ...On applying a lamp under the sulphuret between the poles, the conducting power rose rapidly with the heat.. .and 
the susulpheret was found conducting in a manner of a metal. On removing the lamp and allowing the heat to fall, the effects were 
reversed.... 

- Michael Faraday 1 

Faraday is credited with having first measured (c. 1833) a characteristic feature of semiconductors [294]: their con¬ 
ductivity increases with temperature. More accurately Faraday had measured an intrinsic semiconductor: doping the 
semiconductor to make it extrinsic complicates matters (Section 22.4). Metal conductivity generally decreases: resistivity 
(the inverse of conductivity) for metals commonly behaves as 

p(T)\ metal a p(T a ) + a(T — T 0 ) (22.1) 

whereas for intrinsic semiconductors (like silicon), the behavior is closer to 

p(T)\ semi a pCTpexp {-<x(T- T 0 )} (22.2) 

For example, intrinsic (high-purity) silicon data [295] is compared to data [296] for the metals palladium (Pd), gold (Au), 
and silver (Ag) in Figure 22.1, with the equivalent parameters shown in Table 22.1 for T 0 = 273.15 K. 

Why the difference? Consider a simpler question first: what distinguishes an insulator and a metal (because a semi¬ 
conductor is betwixt them)? A distinguishing feature of metals is that the flow of electrons is comparatively easy; that 
of insulators is where the flow is restricted. But as Figure 22.1 shows, that distinction is not helpful when pure silicon 
obstructs current compared to a metal at room temperature yet has comparable resistivity when the temperature is only 
100 K higher. A more precise definition [297] bases the distinction on identifying the behavior of the conductivity: 

• The conductivity of a metal is finite as T -* 0 K. 

• The conductivity of an insulator goes to 0 as T -*■ 0 K. 

At T = 0, the valence band of an insulator is therefore completely full, and the conduction band is empty. Although band 
theory [53] provides a much more nuanced account, an adequate simple-minded treatment is to take a semiconductor 
to be an insulator with an appreciable number of electrons thermally excited into the conduction band: the difference 
between a great insulator like diamond and one that conducts at room temperature like silicon is related to the size of 
the band gap. 

As introduced in Section 7.4.1, current flow is characterized by a drift velocity v d ~ Z/t where / is comparable to the 
mean free path and t to the average time between collisions of the electron with its fellow electrons and background 
atoms: a common measure of this is electron mobility [52], obtained by drawing a relationship between current and the 
electric field as in J = a£ = o(F/q), and current as given by Eq. (3.5), or J = qv d p so that 


a = qp,i e (22.3) 

where the conductivity p e , with units of cm 2 /V s, is the ratio between the drift velocity and the electric field. Therein lies 
a reasonable way to look at it: a mean time between collisions and a scattering rate presupposes scattering events made 


1 Michael Faraday, Experimental Researches in Electricity, ref. [293], Sections 432-434. "Sulphuret of silver" as known to Faraday is known now 
as "silver sulfide", Ag 2 S, a semiconductor with a band gap of 0.9 eV, more commonly experienced as the irritating black tarnish on silverware 
and jewelry. 


Introduction to the Physics of Electron Emission, First Edition. Kevin L. Jensen. 
© 2018 John Wiley & Sons Ltd. Published 2018 by John Wiley & Sons Ltd. 


277 


www.ebook3000.com 



Chapter 22 


Table 22.1 Resistivity parameters for metals compared to intrinsic silicon in terms of 
Eq. (22.1) for metals and Eq. (22.2) for semiconductors. Data based on refs [296] and 
[295], and shown in Figure 22.1. 


Material 

Eq- 

p D (JJ-cm) 

a (1/K) 

Palladium (Pd) 

22.1 

955.74 

3.9563 

Gold (Au) 

22.1 

206.61 

0.82405 

Silver (Ag) 

22.1 

14.21 

0.61115 

Silicon (Si) 

22.2 

1.1369x10 s 

0.057158 



T [K] 

Figure 22.1 Behavior of the resistivity of intrinsic silicon (data digitally extracted from line 6 of Figure 2 of ref. [295]) to the resistivity of some metals 
(specifically, palladium (Pd) gold (Au), and silver (Ag), from data in the tables of ref. [296]). The lines based on the equations show the linear fit to the 
metals (appearing curved because of the log plot) and the exponential fit to the semiconductor. 


possible by an electron picking up energy from fields existing in the bulk material. The only reason that they would not 
scatter is if they had no place to go (bringing to mind the analogy of the phalanx in Section 7.1) because the final states 
they would like to scatter to are occupied. Accelerating electrons increase in energy. Insulators have a filled band, as in 
Figure 22.2, with no empty states available in the gap that an accelerated electron can access, so current is prevented from 
flowing. Gaps are generally of such a magnitude that electrons do not often acquire the energy (from thermal agitation 
or long wavelength photons) to cross it. Tellingly, diamond field emitters, for example, show good transport: the problem 
is getting electrons into the conduction band [298], As shown in Table 22.2, good insulators, like diamond, have a large 
band gap whereas the more commonly used semiconductors, like silicon, germanium, and gallium arsenide, have smaller 


' Semi / Insulator I 

' I 

I 



Figure 22.2 Left, a metal in which the bottom of the conduction band E c is below the top of the valence band E v . Right, a semiconductor/insulator: an 
electron e that is promoted (via thermal excitation, photo-excitation, and so on) from the valence band into the conduction band leaves behind a hole h. 
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Table 22.2 Parameters for common semiconductors based on Figure 22.4 and Eq. (22.14). K s is the static dielectric constant. / is the 
electron affinity (difference between the vacuum level and bottom of the conduction band) for semiconductors. The number density is 
p(J) (#/cm 3 ). E notation denotes that nEp is understood as n x 10 P , for example with n = 4 and p = 5, then 4E5 is understood as 4 x 10 5 . 
X for diamond (C) is for a clean diamond surface; hydrogen-terminated surface of diamond has negative electron affinity (NEA) properties 
where the vacuum level is below £ c . x is measured from £ c . The factors a and b are from Figure 22.4. 


Parameter Units C Si GaAs GaP 


E g(!°) 

b 

S 

(hco) 
E g { 300) 


K s 

P( 300) 


eV 5.4068 

eV 0.5565 

eV 1290.3 

eV 2.5025 

eV 0.1112 

1290.3 

eV 5.3992 

eV 0.35 

0.57 


5.7 

#/cm 3 6.2079E-27 


1.1693 1.5179 2.3236 

0.0850 0.1708 0.3616 

320.70 329.74 614.89 

1.5379 3.0055 3.4122 

0.0276 0.0284 0.0530 

320.7 329.7 614.9 

1.1249 1.4326 2.2701 

4.05 4.07 4.3 

I. 1 0.068 0.35 

0.56 0.5 0.5 

II. 8 13.2 11.1 

6.21430E09 1.84140E06 5.80013E-01 


ones. Metals, on the other hand, are characterized by electrons always having a level to scatter to: the conduction band 
is partially filled (the valence and conduction bands overlap) even as T -*■ 0. 

Metals and semiconductors differ in how full and how close the bands are. If a unit cell is the smallest part of a crystal 
that reflects the arrangement of atoms in the bulk (in the proper language, the number of unit cells corresponds to the 
number of Brillouin zones), then there are two electron states (one for each spin direction) per unit cell per band. Then, 
the number of electrons in the unit cell can be related to whether the solid is a metal or not. Paraphrasing Ziman [53], 

• Alkali metals (column 1 in the Periodic Table, e.g. Cs, K, and Na) are monovalent (single electron in the unit cell). 
Metals in column 11 (Cu, Ag, Au) likewise have a half-filled band. They are metals. An odd number of electrons in a 
unit cell is a metal, as in column 13 (Al, Ga, etc.). 

• Materials with half-filled bands in column 15 (As, Sb, Bi) have five electrons per atom but arrange themselves so that 
there are two atoms per unit cell. Thus, their outermost band is almost full, but because of partial overlap with the 
higher band, there is some current. These are semi-metals. 

• Materials with an even number of electrons in the unit cell would seem to be insulators, unless there is band overlap. 
This can continue with increasingly more complicated examples of the divalent and transition elements (where the 
d shells make their influence known). The tetravalent elements are particularly interesting. Ziman writes: 2 

There is an interesting progression in Group TV of the Periodic Table, from C, which, as diamond, forms a semiconductor with such a 
large energy gap that in practice it is an insulator, to Si and Ge which are semiconductors, to Sn, which is both metal and semiconductor, 
to Pb which is a metal. The energy band structure of all these elements can be represented by N.F.E? models... 

The tour has come full circle: from relating the origin of the band gap to both the behavior of the wave functions over a 
corrugated potential barrier and to the overlap integrals of the atomic orbitals basis states, a nearly free electron model is 
encountered again. 4 From the NFE vantage point, what semiconductors bring to the emission equations can be considered 
once the number of electrons in the conduction band is found. 

22.2 Electrons and holes 

O, cursed be the hand that made these holes! 

- William Shakespeare 5 


Experiencing Section 4.1 of ref. [53] is worth the effort. 

3 NFE stands for "nearly free electron". 

4 There is much more to be said on the relation between the "band" and "bond" approaches. Sample Ziman [53], Ashcroft and Mermin [292], 
Sutton [104], and Quere [78]. 

5 Ref. [37]: Richard III, I.ii.14. 
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When an electron does acquire enough energy to skip across the band gap E g through thermal agitation 
(or photo-absorption or collisions with a high-energy primary electron, but "thermal" shall be a proxy for all), its 
absence is noticed. An electron in the conduction band can roam the bulk, but its vacancy is a missing electron on the 
atom it leaves behind, and that vacancy can be filled by robbing an adjacent atom of one of its electrons. In that way, 
the vacancies move, and can do so with velocities similar to those of the electrons. It is very, very convenient to let those 
vacancies be represented as positively charged particles and endow them with mass and velocity; when that is done, the 
particles are called holes, the flow of which constitutes a current in the opposite direction of electrons. More generally, 
both holes and electrons flow, although one overshadows the other in doped materials (Section 22.4) and so the phrase 
"dominant charge carrier" need not refer to electrons exclusively. 

The hole distribution can be made to follow the appearance of the electron distribution in Eq. (6.17). If all the states 
in the valence band up to E v are filled, then the total number of electrons N in a volume V or, better, p = N/V, is simply 
(compare Eq. (7.11)) 



(22.4) 


which is analogous to Eq. (7.12), but as the temperature increases some of those electrons are thermally excited into the 
conduction band, and so 



(22.5) 


The total number of electrons does not change as the temperature increases, and if the volume does not either 6 then 
Eq. (22.4) = Eq. (22.5) or, after collecting terms. 



(22.6) 


Actually, a bit more is required: the holes do not have to have the same mass as the electrons in the conduction band, 
often they will not. It is common to append subscripts to D{E), as in T) e (E) and E) h (E), and to speak of electron and hole 
masses m e and m p explicitly. 

The function f(s) = l/(e s + 1) is very convenient. It is straightforward to show 1 -f(s) =f(-s). The form of the 
Fermi-Dirac distribution is analogous to f(s), and therefore the side of Eq. (22.6) that pertains to holes is easily 
manipulated and recasts Eq. (6.17) as 



(22.7) 


(22.8) 


For intrinsic semiconductors, not many electrons are in the conduction band and fewer still at lower temperatures. 
Whereas metals have about 10 23 electrons/cm 3 , intrinsic semiconductors have many orders of magnitude less, for 
example room temperature intrinsic silicon has a number density on the order of 10 10 cm -3 , or about three freely 
roaming electrons in 100 nm 3 . Given that N c /P i/2 « 2.8 x 10 19 cm -3 (Eq. (6.19)), p is going to be quite negative, 
meaning that it resides below the conduction band minimum E c and in the band gap. If so, then Eq. (6.30) can simplify 
matters considerably because the expansion can be truncated after the first term. Letting m e = r e m and m h = r h m, where 
r e and r h are the ratios of the effective mass to the electron rest mass for electrons and holes, respectively (Section 23), 
then p e (T) = Ph( T ) results in 


r]' 2 exp [p{E v - n)\ = r 3/2 exp [/?(/< - E c )\ 


(22.9) 


6 It does. Penance for deceptively ignoring it will be atoned for in Section 22.3. 
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Isolating \i then shows 

»(T)=\{E c + E v )-^k B T\n(^\ (22.10) 

At room temperature, k B T weighs in at a relatively pathetic 0.025852 eV, and since the electron and hole masses are 
expected to be comparable ( r e ~ r h ), the temperature-dependent term is going to be small indeed. Therefore, although 
the Fermi level does depend on T for intrinsic semiconductors, for the most part it simply sits midway between the 
conduction and valence band edges to a very good approximation. Additionally, the leading order terms in an expansion 
of the product of the electron density p e with the hole density p h are starightforward to obtain: keeping only the first 
terms in the expansion of F 1/2 (x) and recalling that the band gap is defined by 

E g = E c -E v (22.11) 


then 



(22.12) 


where the thermal wavelength k T is from Eq. (5.24). When the electron and hole masses are equal, then p e = p h = p t , 
where p t is the intrinsic carrier density. 


22.3 Band gap and temperature 

A man who doesn't know the truth is just an idiot, hut a man who knows the truth and calls it a lie is a crook. 

- Bertolt Brecht 7 

In developing Eq. (22.6), the volume of a crystal was implied to not change (or not change much) with temperature, 
an assumption that contradicts the tendency of solids to expand when heated. If the volume changes with temperature, 
then the eigenstates change, and so the band gap must change as well. It is well known that the variation of band gap 
with pressure on a semiconductor can be understood from the effects of pressure on the atomic orbitals [299], in that 
increasing the pressure moves the cores closer together. Similarly, an increase in temperature changes the magnitude of 
the atomic vibrations, pushing the cores apart and so it, too, must lead to a change in the band gap. Claiming the change 
is small does not mitigate the offense, being more wishful thinking than accurate. Penance is due. 

A commonly used empirical relation, known as Varshni's equation, is 

E g (T) « E g (0) - (22.13) 

but the following form [300] gives a closer correspondence to data 8 

E g (T) » E 0 (O) - s E g { 0) - 2 Sk B Tg(0/T) (22.14) 

where k B 0 = (hco) is an average phonon energy and S is a dimensionless "coupling constant". The behavior of g(0/T) is 
shown in Figure 22.3. Eq. (22.14) gives a satisfactory fit to experimental relations for a number of semiconductors, as 
shown in Figure 22.4 and Table 22.2. Comparing the asymptotes of Eqs (22.13) and (22.14) for T -* 0 and T -* oo shows 
that B ++ 0/ 2 and A ++ 2Sk B . 

22.4 Doping of semiconductors 

I cannot praise a fugitive and cloistered virtue, unexercised and unbreathed.. .Assuredly we bring not innocence into the world, we bring 
impurity much rather; that which purifies us is trial, and trial is by what is contrary. 

- John Milton 9 


7 Bertolt Brecht (trans. C. Laughton), Galileo. New York: Grove Press, 1966. The quote is from Scene 9 in the German original, but Laughton's 
translation regrettably omitted it, as observed in the Introduction by E. Bentley (p. 26). 

8 Ref. [300] prefers E g (T) ss B^(0) - S{hoo)(cosh({hm)/2k g T) - 1) but they are equivalent. 

9 John Milton, Areopagitica, Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 32. Chicago: Encyclopaedia Britannica, 1952, 
p. 390-391. 
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e/T 


Figure 22.3 The function gtp/T) defined by Eq. (22.14). Observe that large temperature corresponds to small values of 0/T. 



Figure 22.4 Temperature dependence of the band gap for several common semiconductors. Data for silicon from ref. [301]. Data for GaAs and GaP from 
ref. [302]. Diamond (carbon) data digitally extracted from Figure 8 of rf. [303]. Fits are based on Eq. (22.14). 


In an impassioned polemic against censorship written in 1644 during the first English Civil War, John Milton (author 
of Paradise Lost) argued that Good is not a consequence of averting consideration of Bad. Rather, exposure to "impurities" 
and the ensuing struggle to determine what is right or best or worthy is essential to vet such value judgments: to know 
from experience and forego indulgence is praiseworthy and of greater moral character than not indulging because of a 
lack of comprehension. Impurities, in a sense, make possible that which is valued. 

In a way, so too with semiconductors. Much of their technological advantage arises because the intentional inclusion 
of different atoms in an otherwise pure semiconductor can give rise to mobile electrons (such atoms are donors) or take up 
electrons and thereby give rise to mobile holes (such atoms are acceptors). For example, in an impressively good insulator 
like diamond, boron is an acceptor as it lies to the left of carbon in the Periodic Table; in contrast, nitrogen is a donor 
as it lies to the right of carbon. Silicon, lying one row below carbon, is affected by those so-called dopants in much the 
same way and a variety of donor (e.g., P, As, and Sb) and acceptor (e.g., B, Ga, In, Al) atoms are used. Semiconductors 
doped by donor impurities are called n- type, and those doped by acceptor impurities are called p-type. Semiconductors 
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that include dopants are called extrinsic semiconductors, in contrast to the pure and undoped intrinsic semiconductors 
discussed in Section 22.1. 

Impurity atoms ionize, and it is the number of ionized donors (or acceptors) that dictate the electron (or hole) density. 
Because it takes a bit of energy to ionize a donor atom and release its electron into the conduction band, the electron 
concentration due to donors is a temperature-dependent quantity. A similar argument applies to holes: there, it takes a 
bit of energy for an electron to jump to the acceptor site, leaving a hole behind. The energy level of a donor with respect 
to the bottom of the conduction band is referred to as the binding energy of the donor, and similarly with acceptors. 
As the binding energies of the more useful "shallow" donors and acceptors tend to be tens of millielectronvolts, at 
room temperature conditions, most are ionized for common semiconductor materials and conditions. In equilibrium, 
a semiconductor with no net charge entails 


P h (T) - p e {D + iV+ - iv; = 0 


(22.15) 


About those irritating ± superscripts: they are needed because not all the donors or acceptors are ionized, but rather a 
fraction of them. At T = 0, none of the impurities are charged (electrons in the conduction band fall into the donor level 
within the band gap, and the electrons at the acceptor levels rejoin with holes in the valence band). Conversely, at high 
enough temperature, electrons from the donor sites can easily jump to the conduction band, and electrons in the valence 
band can easily jump to the acceptor sites, leaving behind a hole. This temperature dependence can be accounted for. The 
distribution of charged donors and acceptors is evaluated in much the same way that the Fermi-Dirac distribution was 
calculated in Section 6.2 (or see ref. [304]), but the degeneracy of the number of charges that donor or accepter states 
allow must be factored in. If the energy level of the donor is E D and the acceptor E A , then accounting for the degeneracy 
amounts to (remember that p = l/k B T) 


1 + g D e^~ E ^ 


1 +g A eO^ 


(22.16) 

(22.17) 


where g D is the donor degeneracy factor (often 2), and g A is the acceptor degeneracy factor (often 4). If the modified 
energy levels 


pE' D = pE D -ln(g D ) (22.18) 

pE' A =pE A + ln(g A ) (22.19) 

are used then the distributions are more easily expressed in terms of the Fermi-Dirac distribution function f FD encountered 
in Eq. (6.8) by 

N+m = N x f FD (E' x ) (22.20) 

where X denotes either D or A. Thus, Eq. (22.15) is easily written as 

^1/2(4) - ^Fi/ 2<rO + N D f(z D ) - NJ(z a ) = 0 (22.21) 

where z D = p{E' D - g), z A = P(E' a - g), z v = P(E 0 - g) and z c = P{E C - g), and where/(e) = 1/(1 + e z ). This equation is impor¬ 
tant: all of the z factors depend on the chemical potential g, and so this equation is a means of determining what g is as 
a function of the valence and conduction band edges E v and E g , the doping of acceptors N A and donors N D , and the 
temperature T. 

The evaluation of g is not as difficult as it may appear. Doping levels generally vary between 10 13 and 10 18 atoms/cm 3 , 
or a bit smaller than N c F l/2 {pg)/p i/2 = 1.92 x 10 19 nm' 3 at room temperature for g = 0. Thus, for typical doping levels, 
g is negative, such that below room temperature the leading order term of Eq. (6.30), or F l i 2 {z) « (yGr/ 2)e z for z < 0), 
is often sufficient. Such an approximation is often conveniently assumed from the start [52, 73, 213, 304] given the 
pedagogical utility of doing so, but if the doping level is higher than 5 x 10 18 cm -3 then more terms of Eq. (6.30) should 
be retained or the integral definition of F x / 2 (z) used. Both are compared here to show that the approximation is good, 
but less so at higher temperatures. 
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N d [10 21 /cm 3 ] 

Figure 22.5 Relation between the doping concentration N D and the chemical potential pi for various temperatures. Remember that the number of 
ionized dopants is governed by Wj = NjJ^E'p) as per Eq. (22.16). 


To keep the discussion specific, consider phosphorous-doped silicon {E g = 1.17 eV, E D = -0.045 eV below the conduc¬ 
tion band). To keep the discussion simple (and reflecting a bias for electrons), let N A = 0. Lastly let the electron and hole 
masses be the same as the rest mass of the electron (m e that approximation is not in general true (electron 

and hole masses differ, and then there are also heavy and light holes) but the assumption lets N v ~ N c , does not alter the 
qualitative features of the figures below, and has the virtue of allowing terms to be collected in an aesthetically pleasing 
way. Using the leading order term of Eq. (6.30) and rearranging Eq. (22.21) results in 

N d » ^ (A - e~*W) (1 + 2e^-^) (22.22) 

as shown in Figure 22.5. As a consequence, N D vanishes when pi = -2? /2, corresponding to an intrinsic semiconductor 
when m e = m h . The general trend is clear: for small doping levels («: 10 15 cm -3 ) the semiconductor Fermi level is near 
the middle of the band gap and the semiconductor is intrinsic, as the doping level increases (» 10 15 cm -3 ) the doping 
dictates the carrier concentration and the semiconductor is extrinsic, as shown in Figure 22.6. For very high doping, pi 
approaches the conduction band minimum, for which another terminology is introduced: when the energy separation 
between E c and pi is larger than k B T (commonly, when pi-E c < 1k B T) the semiconductor is said to be non-degenerate, and 
when pi — E c > 3k B T, then it is said to be degenerate. Alternately, and colloquially, when the semiconductor starts looking 
more like a metal (lots of electrons in the conduction band), then it is said to be degenerate (although such a way of 



Temperature [K] 

Figure 22.6 Chemical potential pi(T) as a function of temperature. The exact relations (closed circles) are given by solutions to Eq. (22.21). The 
approximate relations (lines) are given by solutions to Eq. (22.22). The algorithm for the evaluation is in Section A3.13. 
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speaking blurs the significant differences between metals and semiconductors). This is readily visible by looking at the 
distribution of holes versus electrons as p climbs from the center of the gap (designated E t ) to the conduction band min¬ 
imum E c : the closer it gets, the more the holes (dashed line) diminish and the electrons (solid line) increase, as shown in 
Figure 22.7. 

Lastly, consider the very curious Figure 22.8, which compares the electron carrier concentration for the silicon-like 
example to the doping level and which is calculated using the algorithm of Section A3.13. At low temperature, 
p e « {\fn/2)N c P~ 3/2 e fi,i . The T 3/2 and negative pp dependence drives p e to zero as T -* 0: the carrier concentration is 
said to "freeze out". Conversely, in the high temperature limit p is increasingly negative, but asymptotically becomes 
-EJ2 as in the high-temperature behavior of Figure 22.6: the chemical potential is then characteristic of an intrinsic 
semiconductor. Lastly, at typical temperatures all of the dopants are ionized, as was assumed in Section 16.1 in 
the calculation of the depletion layer w of Eq. (16.4). All the curves of Figure 22.8 bear the same family resem¬ 
blance of fall off at low T and exponential growth at high enough T, a behavior reasonably captured in the range 
shown. 



Figure 22.7 The hole p h and electron p e distributions (labeled h and e, respectively) as a function of energy for different values of the chemical potential 
p measured in millielectronvolts from the center of the band gap. Compare Figure 7.1. 



Figure 22.8 Ratio of electron concentrate 

occurs when the temperature is so high that p - E a /2. The exact relations (closed circles) are given 

relations (lines) are given by solutions to Eq. (22.22). The algorithm for the evaluation is in Section . 
although it degrades at higher doping densities. 


occurs when p ~ E v . Intrinsic 
by solutions to Eq. (22.21). The approximate 
\3.13. The approximation is reasonably good, 
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22.5 Semiconductor image charge potential 


Prometheus: Do you think I will crouch before your Gods-so new-and tremble? I am far from that... 

Hermes: Just such the obstinancy that brought you here, to this self-willed calamitous anchorage. 

Prometheus: Be sure of this: when I set my misfortune against your slavery, I would not change. 

- Aeschylus 10 

The electrons in a metal seem free, but they must move in response to an electric field: they therefore pile up at the 
surface until the bulk of the metal is shielded, that is, held at an equipotential. As a consequence, electric fields terminate 
at the surface and are normal to it: any transverse surface field would start a flow of electrons in response. Such is the 
fate of those "nearly free" electrons (just as Hermes must travel the world at the whim of the god of lightning). In an 
insulator, however, the electrons are bound to their atoms like Prometheus to his rock, the effect of the electric field 
only causing the charge distribution of the atoms to distort. Dielectrics can be treated as an arrangement of little dipoles 
orienting in response to an electric field, but not moving charge over a distance because of it. 

The little dipoles in a dielectric material placed in an electric field therefore incompletely shield the internal or material 
region (call it Region 2) from the field in the external or vacuum region (call it Region 1). F 2 in the material is related to 
fq in the vacuum by the dielectric constant e of the material according to [99] 


(p 1 ! - eF 2 ) -h = 0 


where h is the normal to the surface. The tangential component of the fields is continuous across the boundary; alternately 
the potential is continuous across the surface. 11 A charge q s embedded in the dielectric will likewise cause the dipoles to 
reorient themselves in response, so that the potential at a distance r from q s is then 


cp{r) = - 


(22.24) 


where the notation of the dimensionless K s is preferred over e = K s e 0 . Such a seemingly simple observation has conse¬ 
quences for the nature of the image charge contribution to the potential energy barrier. Outside a metal, a departing 
electron experiences a full image of itself across the plane defined by the metal surface, but if the metal is replaced by a 
semiconductor, what the electron experiences is somewhat lacking by comparison. 

To anticipate the dependence of the image charge potential on K s , think back to the metallic image charge potential 
(Section 11.3). In response to an external charge, the distribution of charge at the metal surface rearranges such that 
it produces a potential in vacuum equivalent to a mirror charge at the same distance from the surface but inside the 
metal, equal in magnitude and opposite in sign, to the external charge. That is the vacuum region perspective. Inside 
the metal, though, no field exists, so what perspective accounts for that? It is as though an image charge overlaps the real 
external charge, again opposite in sign, but of the same magnitude and distance from the surface, making the sum of 
the external charge and the image identically vanish as seen from the inside of the metal. The two regions therefore differ in 
their outlook. When the metal is replaced by a semiconductor with a dielectric constant of K s , however, the magnitude 
of the overlapping image charge changes from q s to q { < q s . 

To find the image charge q t , consider a more general problem. Let a charge q s sit a distance a outside the surface of 
a semiconductor of dielectric constant e = K s e 0 . In cylindrical coordinates, the distance from q s to an observation point 
at the location (p,z) is r_, where r%. = p 2 + (z± a) 2 , so that r + = r_ when 2=0. Outside the semiconductor, the potential 
energy is 

qq, ddi 

V(p, Z > 0) = (22.25) 

4^e 0 r A**- * 


Ane 0 r 


where q t is the image charge magnitude, whereas inside the semiconductor 
V(p,z< 0)=-™---™- 


10 Aeschylus (trans. David Grene), Aeschylus II: Prometheus Bound, The Complete Greek Tragedies, ed. David Grene, Richmond Lattimore. Chicago: 
The University of Chicago Press, 1956, lines 960-967. 

"Remember the convention here: fields £ and potentials cp are multiplied by unit charge q to convert them to forces F and potential energies 
V, but in units of q = 1, the discussion can use either terminology, and so the more familiar one is used. 
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where r_ is 
written as 


the denominator of the second term because is in the vacuum. Now impose Eq. (22.23), which can be 
e 0 d z V{p, 0_) = K s e 0 d z V(p, 0 + ) (22.27) 


Using r + = r_ for z = 0 and eiiminating all the common factors, which turns out to be most of them, all that is left is 

q s - q t = K s (q s + qf) q, = (22.28) 


Applying this to the image charge potential for an 
replaces the metal is [305] 


emitted electron, the modification to Eq. (11.2 5) when a semiconductor 




(22.29) 


It is seen that a metal corresponds to the limit K s -* oo and the replacement of the electron affinity / by the sum of 
chemical potential p and work function ®. The specification of the barrier height by % for semiconductors is because, 
as seen in Sections 22.3 and 22.4, doping levels and fields can affect the relationship of p to ®, whereas % remains a 
fixed value above the conduction band minimum. As shown in Figure 22.9, the inclusion of K s makes the image charge 
potential barrier somewhat higher than for metals. 

Although Eq. (22.29) describes the image charge modification for field emission from bulk semiconductors, it is nev¬ 
ertheless the case that often semiconductors are coatings on metals that can be rather thin, even so thin as to (in the 
case of adsorbates) constitute a monolayer. In such cases there are image charges of image charges to enforce boundary 
conditions. A single layer of atoms resting on a metal surface can be treated as a thin film of thickness / and dielectric 
constant K s , so that the method of images applied to that configuration results in [111] 




(22.30) 


when a layer is present. For more than a few atomic layers, however, the last term becomes small as / -*■ oo, and its neglect 
is common. Field emission from adsorbate-covered surfaces (an in-depth treatment found in Chapter 6 of ref. [Ill]) bears 
a resemblance to work function reduction due to adsorbates behind the Gyftopoulos-Levine theory in Section 31.8, and 
so will be revisited there. 


22.6 Dielectric constant and screening 

...self-deception is the handmaiden of deceit: in hiding the truth from ourselves, we are able to hide it more fully from others. 

- David Livingston Smith 12 


12 David Livingston Smith, Why We Lie: The Evolutionary Roots of Deception and the Unconscious Mind. New York: St. Martin's Press, 
2004, p. 3. 
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Much like hiding a lie, a charged inclusion can be hidden inside a material if the material participates in shielding it. 
A charged inclusion inside a dielectric pulls charges of the opposite sign towards it, and the more effectively the mate¬ 
rial allows the inclusion to do so, the more effectively the effects of that inclusion's charge are hidden. The material's 
dielectric constant e = K s e 0 is a measure for how well a material can do so. A K s of unity signals that the material is no 
better than vacuum for hiding or shielding the charge from the outside, that is, the charged inclusion is evident to all. 
A K s that is very large (as occurs for metals) means that charge separation is increasingly effortless, so that the inclusion's 
existence can be hidden very well. Expressing the dielectric constant as e = K s e 0 makes revisiting the Yukawa potential of 
Eq. (21.89) conducive to understanding semiconductors, particularly with respect to screening [86, 292] and band bend¬ 
ing at interfaces [53], 

If a positive charged particle, represented by p ; , is in a material 13 then Poisson's equation is in the form 


-V 2 0(r) = — pj(r) = 16xQp i (r) 


Then, just as with the free and bound charges treatment of the image charge in Section 22.5, let the potential r/>(r) be 
related to the charge distribution, including both the bare charge and the charges which screen it (denoted by the s 
subscript), or 

-V 2 0(r) = 16 nQtptf) - p s (f )) (22.32) 


Now draw an analogy with bulk dielectrics. Recall that a parallel plate capacitor with a vacuum in the anode-cathode 
(AK) gap has a field between the two plates of F = V a /D = o/e 0 , where a is the charge density a = Q/A and A is the area 
of the capacitor plate (in this way, one derives that the capacitance is C = Q/V a = e 0 A/D). If, instead of vacuum, the AK 
gap is filled with an insulator of dielectric constant K s , then the field in the gap becomes F -*■ F/K s , which means that 
o/e 0 g/K s e 0 = g/e. Because of the position-dependent nature of the ion charge potential, the more general equation 
is now „ _ 

4>0) = / e(j - r')<j)(r')dr' (22.33) 

Jn 

Equations of this form can use the convolution theorem [60]: if tildes denote the Fourier transform of a function 
(e.g., g(k) is the Fourier transform of g(r)) , then 


j £(r - r’)4>(r')d? = (2^)“ 3 JJJ d? dk dk' e(k)e ik ' Cr - ?) m)e lk '' ? 

= J J dkdk! i(k) 4>(k')e fkrr 8(k - k!) 

= J dk i(k) 4>(k)e l ' k r 


(22.34) 


This is convenient because the Fourier transform of Eq. (22.33) can be written as 


4>m = mm 


Ins 

m = 

m 


(22.35) 


For the Coulomb potential of a point charge, $ ; (&) is interestingly tricky because although all of the integrals involved 
are elementary, the limits of integration are not. The integrals are 


$j(k) = Q J = 2icQ J rdr J e lkr co 


(22.36) 


A common technique is to append a factor e Sr to the integrand, and at the end of the calculation take the limit <5 0 

(or 1/8 oo). 14 The intermediate integral step is then 


$,(k) = 


2niQ 

k 




e~ Sr dr = 


AnQ 
k 2 + 8 2 


(22.37) 


13 A positive charge puts the sign back into Poisson’s equation as in standard accounts, and recall that p is a number density. The sign will be 
important to remember in Eq. (22.40). 

14 Memories of the troubles one can get into by failing to appreciate which infinite limit to take first may arise, that of 1 /5 or the upper integration 
limit for r, as in Section 6.3, but here it is not so critical. 
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Now the limit 8 -*■ 0 is well-behaved, and so 

4>m = ^ (22.38) 

Replacing $ ( (r) by its Fourier transform in Eq. (22.31), the action of the Laplacian operator amounts to replacing V 2 —<■ —k 2 
because 

-V 2 f dke rh7 {■ ■ } = / dke' l7 k 2 {- ■ ■ } (22.39) 

as {■ ■ ■ } contains no r-dependent terms. Now assemble the pieces. From Eq. (22.35) and Eq. (21.83) 


e(k) = 


Hk) 

4>Ck) 


i (p(k) - pj(k) 

pCk ) 

16 nQD^)8cP _ k 2 p 

k 2 8cp k 2 


To get cp(f) requires taking the Fourier transform, and so (where s = cos 0) 


^='^[{i£ii) dk L* k " d, C dv 


(22.40) 


(22.41) 


which is, of course, the ansatz offered right after Eq. (21.84). 
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Effective mass 


...the dynamics of an electron wave packet (or, as we shall now say, of an “electron in a state k ") may be very different from those of an 
elementary free classical particle. However attractive may be the analogy between hit and momentum, we must not be led into believing 
that the velocity of an electron in a solid is anything like this quantity divided by the mass. 


- John M. Ziman [82], p. 92 


“You keep using that word!" the Spaniard snapped. “I don't think it means what you think it does ." 


- William Goldman 1 


Treating electrons as particles creates an expectation that the the mass of the electron is related to the energy E by 
E = p 2 12m, where p is the momentum. The relation can be written as 


A difficulty with transposing that understanding to a quantum mechanical treatment arises because electrons are not 
point particles, but rather wave packets composed of a distribution of | k) with different velocities such that p = hk for 
each |&). The same relation will be shown to hold between m and E, but the meaning of the mass term changes. 

23.1 Dispersion relations 

Glory is like a circle in the water, 

Which never ceaseth to enlarge itself, 

Till by broad spreading it disperse to naught. 


- William Shakespeare 2 


The concept of dispersion relates frequency co to wave number k. It is the origin of distinctions between phase velocity 
and group velocity, as will be important in treating lattice waves (e.g., phonons), but it arises for wave phenomena 
beyond the examples considered here. Simply, dispersion arises because the different waves that make up a wave packet 
have different wave numbers and therefore different velocities, causing the wave packets to spread out over time (as 
in the positive time regions of Figures 9.9 and 9.10 for the no barrier case) and "disperse to naught", as it were. A few 
examples give the flavor of how cofk) varies with k. 

Photons are a gentle case, given that co/k = c is the phase velocity of the photon and hco is its energy, so that m = ck 
denotes a linear dispersion relation. The group velocity v, = dco/dk is then the same as the phase velocity c, a consequence 
of all photons traveling at the same speed. When they do not, as when light passes through a high index of refraction 
material such as a prism, the various wavelengths disperse and, for white light, a familiar rainbow pattern emerges. 

The dispersion relations of matter waves are revealed by Schrodinger's time-dependent equation: inserting y/ cx e l{kx ~ 0,i) 
into -ihd t yr = ~(h 2 /2ni)d 2 y/ indicates that co = Ttk 2 /2m and dco/dk = Tik/m. This is commensurate with the non-relativistic 
E = Ti 2 k 2 /2m, so that co = hk 2 /2m and dco/dk = hk/m. 

The group velocity v g will be the concept from which a better understanding of effective mass shall emerge. It is 
common to introduce v g in the context of the evolution of wave packet by [41] 



(23.2) 


'William Goldman, The Princess Bride. New York: Ballantine Books, 1998, p. 102. 
! Ref. [37]: Henry the Sixth, Part 1, 1.ii.134-138. 


Introduction to the Physics of Electron Emission, First Edition. Kevin L. Jensen. 
© 2018 John Wiley & Sons Ltd. Published 2018 by John Wiley & Sons Ltd. 
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where y/ k (x, t) = e t(kx ~ wr) and Aik) is a distribution in k. It is then shown that v P(x, t) can be put in the form of plane waves 
encased in an envelope function traveling at the group velocity v g = dm/dk, and that a correspondence with a classical 
particle then demands that dco/dk = p/m. The derivation is elegant, but not quite simple. 

A more appealing approach uses the Wigner function of Section 9.3.2 to arrive at the same result, 3 which is possible 
because/(x, k, t ) contains all the information in it that y/ k (x, t) does, but hews to phase space intuitions. The group velocity 
of the wave packet is M 

^(k) = J J ~f^x, k, t)dxdk (23.3) 

All that shall be required is that f(x, k, t) is initially centered at (x 0 , k 0 ), is symmetrical about k 0 , declines sufficiently fast 
(say, at least exponentially fast) as x -*■ ±oo and same with k, and let the potential that the wave function propagates in be 
at worst linear in position (e.g., V(x) = V 0 - Fx). Because the Wigner trajectories follow the classical particle trajectories 
asinEq. (9.115), and using Eq. (9.111), 

h ... r r , M p \ 

m {k) = L ** 

■ (23.4) 


Now use the symmetry of/(x, k, 0): symmetric (or even) functions of (k - k g ) (such as constants or (k - k 0 ) 2 ) integrated 
over all space with it will be non-zero, but anti-symmetric (or odd) functions, such as (k - k 0 ) itself, vanish because the 
integral of an even and odd function is identically zero. Therefore, by writing Id + (Ft/m) as (k' - k 0 ) + (k 0 + (Ft/m)), the 
term in the first parentheses is odd, and the second even. Therefore, 


hk/t) 


(23.5) 


where kXt) = k 0 + (Ft/m) is the velocity of the center of the wave packet if/(x, k, 0) is a Gaussian in both x-x 0 and k - k 0 . 
Now Eq. (23.5) gives the velocity of the "particle" as desired, and shows that it is the group velocity. 

The other term, namely mass m —<■ m n , requires comparable scrutiny, in recognition that the "effective mass" of the 
electron in an energy band in the first Brillouin zone of a material can be something different than its vacuum value. 4 
The mass term r e is a consequence of electron transport in materials - it does not a priori have to be constant. If it is 
not, though, consequences accrue, the first being the relationship between the force on the electron and its change in 
momentum, or 


F 


dp 

dt 


d, 3 

= dt {m ” V) = 


drA dv 
V dv ) dt 


(23.6) 


Because r e = hk/mv and the chain rule, then 

r + v ^Ij. - r - v ^ + f^dk _ T^dk 
e dv e mv 2 mdv mdv 


(23.7) 


because first two terms are equal and opposite. The final step is recalling that v = dco/dk = h l dE/dk so that dv/dk = 
h~ l d 2 E/dk 2 and so replacing dv/dt in Eq. (23.6) gives 


42 r e m d 2 E dk 
h dk 2 dt 


(23.8) 


Therefore, matters have been engineered so that the form of the old familiar equations like Eq. (23.1) can be retained to 
an extent: 


1 _ 1 

r e m m n 


(23.9) 

(23.10) 


3 This is not to say one should deprive oneself of the treatment (see Ziman, Section 2.9 of ref.[82]), which dives right into the complications 
that attend describing packet dynamics with classical-like equations and effective masses: such a treatment is edifying, but beyond needs here. 
4 An asterisk (*) on m* = r e m is another common way to denote effective mass, as is letting m be the effective mass and m 0 be the electron rest 
mass (confusing), or (as in Eq. (22.9) and earlier) introducing a coefficient r e so that m n = r e m. For consistency, the coefficient r e method is 
retained here, so that m continues to refer to the electron rest mass and is constant. 
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as long as it is recognized that the definitions hinge on what is meant by E and its relation to k. That will take some work, 
but at least now it is understood what the words mean: m n is not the mass of a particle, but rather the curvature of the 
E{k) relation, and therefore an effective mass. 


23.2 The k p method 


Travaillons sans raisonner ...c'est le seul moyen de rendre la vie supportable. 


- Voltaire 5 


The Bloch states u(k) of Eq. (21.23) that arise in the Kronig-Penny model (Section 21.3) signal the first hint of where 
complications are to come in the revised Schrodiger-like equations of Eq. (21.24) (albeit for one dimension): a new term 
2 ikjd x appears and k 2 -*■ k 2 - kj . An argument can be made about how to solve such an equation, but making use of the 
work already developed in solving Schrodinger's equation suggests a perturbation approach would make the experience 
easier. For three dimensions, let kj go to a continuum and p = -ihV, resulting in 2 ikjd x -*■ (2 /K)k ■ p, or 

+ V(r) - ij^-k • V j u(k, r) = E , (k)u(k,f) (23.11) 


where 


(23.12) 


which looks very much like the familiar Schrodinger's equation with the addition of a perturbation defined by 




(23.13) 


What kind of difference does H kp make? To answer, start by considering a seemingly asinine choice for the 
Kronig-Penny model: let the height of the barrier vanish, or V 0 -> 0, but otherwise keep the periodicity of the lattice as 
R (i.e., the wave function satisfies y/(k,r+ R) = i p(k,r)). Thus, the wave function can be written 

y/(K,r) oc exp(z& • r) exp (23.14) 


where k is the reduced wave vector (in the first Brillouin zone), and K - k = G is a reciprocal lattice vector such that 
G-R = 2nn for n an integer. Thus, for the special case V 0 = 0, then u(k, r) oc exp (id ■ r). When K is in another Brillouin 
zone, it can be brought back into the first Brillouin zone by a translation of G. This is commonly demonstrated by diagrams 
of the form shown in Figure 23.1. By such a construction, k is restricted to the first zone (-n < ka < n in the figure, where 
a is the one-dimensional lattice constant in the shameful units of (a = 1)). 

When V g is then finite, the k ■ p term acts as a perturbation. Bands open up where the crossings occurred, and near the 
bottom of the band the usual parabolic relations emerge, but as k increases, the effects of the k ■ p makes itself known. 
Here is where the problem becomes rapidly complicated and difficult, requiring as it does an understanding of the electron 
orbitals introduced in Section 21.4. The Kane model [294, 306] introduces appropriate concepts, which are moreover 
useful for the alpha semiconductor model of Section 23.4, to treat the non-parabolic nature of E(k) in semiconductors. 
A new wrinkle is an inclusion of the electron spin of ±1/2 for which the spin operator is a. Spin coupling introduces 
another term H n into the Hamiltonian already shown in Eq. (23.11) of the form 

*•=4 ,23 ' ,5 > 
The actual form of H a , however, is less important than what it does by introducing a small splitting in the energy depend¬ 
ing on whether the spin is "up" or "down". The impact of both H a and H kp will be determined using the unperturbed 
basis states \u t ) that are the solution of 

{Li + m ) W =E A u i> (23 ' 16) 


5 "Let us work without disputing ...it is the only way to render life tolerable." Voltaire (trans. John Butt), Candide. Middlesex: Penguin Books, 
1947, p. 34. 
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Win 

Figure 23.1 Band structure in the first Brillouin zone (yellow region) for V 0 = 0 in a one-dimensional model with a lattice constant of a = 1. When 
V 0 > 0 band gaps are introduced at the cross-over points where E(k ± G ) is degenerate with E(k). 


where p is the momentum operator -ihV as before. The basis states in the conduction band are 5-like in the language of 
Section 21.4, but now they are additionally cognizant of spin, in recognition of which let \u c ) = |s±). The valence band 
states are made up of the p-like states, but so as not to confuse the momentum operator and the state, designate them by 
| u v ) = \X±), | Y±), and |Z±). Introduce two energies P and A: assuming the spin is along the z axis, they are defined by 

V = -^(S|P z |Z> (23.17) 

A = ^j(X|. PyV x V) - P x VyV)\Y) (23.18) 

where A is the spin-orbit energy splitting of the valence band from the H c term, and V arises from (S\H kp \P). Using the s- 
and p-orbital wave functions and the methods of ref. [306], these quantities can be found, but there is no imperative to do 
so: what matters here is that they exist and with cleverness approximations to them or limits for them will prove useful 
(e.g., P <x he/ 246 = 2.228Q generally holds for III-V compound semiconductors as observed by ref. [306], although V is 
given in Eq. (23.24) below). For the four chosen basis states, the roots obtained by diagonalizing the Hamiltonian result 
in 

E s 

0 

kP 

0 




where E s indicates the conduction band and E p the valence band. Explicit expansion then collection of like terms results 
in the determinant equation being given by 


(3 E p + A) {E s OE p + A)(3E p - 2A) - 3k 2 P 2 (3E p - A)} = 0 


(23.20) 


The fourth row/column corresponds to the heavy hole band, and it sets the choice E p = -A/3. E s is at the bottom of the 
conduction band, so the band gap appears to be E s - E p = E g + A/3 before the perturbation, but a diagonalization after 
the perturbation restores E s — E p = E g as the band gap energy for k = 0, the details of which are less useful here than 
the result. The ratio A /E g is an important parameter and identifies how much the band gap decreases as a result of the 
spin-orbit interaction. In the limit that it is large, E k in the conduction band 6 takes on the form 


E k 


h 2 k 2 

2m 


li/.+Jzp !«”) 


(23.21) 


6 E k can also be found in the valence band, as in ref. [306], Eqs (58-60), but conduction band electron emission is the casus belli for the present 


www.ebook3000.com 










Effective mass 


For small values of k 2 , using the expansion \J\ +x « (x/2) shows that E k is indeed parabolic in k, or 



This is pretty heady stuff: it says that the effective mass of the electron in the conduction band is dependent 
electron mass m, the band gap E g , the k ■ p factor P, and the spin-orbit energy factor A, or 


(23.22) 
the free 


m 




(23.23) 


It is worth pausing for a moment and considering the implications. That the dynamics of an electron transporting through 
a material can be fairly well accounted for by changing the value of the electron's mass in the equations of motion is rather 
remarkable. Solving Eq. (23.23) for V then gives 


P 2 


2 m\r e J\E, E g + \) 


(23.24) 


which approaches Ti 2 E g /2m n when r e is small and A./E g -*■ 0. 


EXAMPLE: Using the approximation of P -*■ hc/246 and a band-gap of E g = 1.424 eV for GaAs, estimate r e = mjm 
in the two limits A /E -* oo and A /E -*■ 0. Compare to the commonly accepted value for GaAs of r e = 0.067. 
SOLUTION: From Eq. (23.23), in the limit A /E g -> oo, then 


In the limit A /E g -* 0, then 



4 me 2 \ 

3 X 246 2 E g J 


0.1123 


2 me 2 
256 2 E a 


= 0.07776 


Having found what E s and E p must be, and then choosing the energy reference to be the top of the valence band, 
attention returns to Schrodinger's equation in Eq. (23.11) for the general relationship between E and k. It can be found 
to be 7 

E'{E' - E g ){E' + A) - k 2 P 2 (rf + |a ) = 0 (23.25) 

but now Eq. (23.24) can be used to eliminate P 2 in favor of more familiar terms 


n 2 k 2 \_ E '( E ' - E 0 ) \ ( E ' + A]>(3JJ, + 2A) 1 
"2^T “ 1 ) ~ E~ g [(^ + A)(3E' + 2A)J 


(23.26) 


which hardly seems an improvement until some interesting features are noticed. First, for a number of semiconductors, 
l/r e is big (for GaAs, it is about 15), so that the left-hand side, to a reasonable approximation, is Ti 2 k 2 /2m n , where the 
effective mass m„ takes over the role previously held by m. Now look at the limits of small (A /E g -*■ 0) vs large (A /E g -*■ oo) 
spin-orbit coupling. One finds the same limit in both cases, because of the term in the square brackets, to be 


lim (m\, lim 

a/b^o y 2m n ) a/b ( ->oo \ 2 m n ) E g 


(23.27) 


This is a most agreeable simplification, and it works fairly well in practice. It is generally called a hyperbolic dispersion 
relationship, which sounds unusual but is actually more familiar than one might expect: such relations are found in 
treatments of relativity. 


7 See Eq. (53) of ref. [306] (alternately, Eq. (2.66) of ref. [294]) for a bit of discussion regarding it. 
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23.3 Hyperbolic relations 


Confusion never stops 
Closing walls and ticking clocks 

- Coldplay 8 


All motion is relative. Perhaps it is you who have moved away - by standing still. 


- Jerome Lawrence and Robert E. Lee 9 


If a ball is thrown against a wall, bounces off, and returns to its launch point, as in Figure 23.2, the time of flight 2A t 
can be related to the distance traveled 2 L and the velocity of the ball v b by 2Af = 2 L/v b . If, on the other hand, the same 
process is repeated from a platform moving at a velocity v p in a direction parallel to the surface of the wall but with 
velocity of the ball was unchanged, a person in the moving frame would say the transit time At' is longer because the 
velocity in his frame is smaller, or 

2 At* = 2L (23.28) 


For an observer in the rest frame, however, the velocity is still v b but the path length has increased, so 


sJl 2 + (v p At) 2 


(23.29) 


Solving for At and comparing to Eq. (23.28) shows At = At'. Time passes the same for both the rest frame and the moving 
frame. 

Put that understanding in a useful mathematical representation. Designate the moving platform with primed coor¬ 
dinates and the stationary platform with unprimed coordinates (as is usual). Relating the physics behind the differing 
views on the platforms is then accomplished by a Galilean transformation: if the unprimed coordinates in space and time 
are (x,y,z, t), then the primed coordinates satisfy [307] 


V' 


t 


1 

0 

0 

o' 

7 

X? 


x-vt 


-vt 

1 

0 

0 

X 

/ 


y 


0 

0 

1 

0 

y 

z! 


z 


0 

0 

0 

1 

z 


(23.30) 


where it is convenient to drop the p subscript on the platform velocity v p and simply call it v. Because force P involves 
the second derivative with respect to time, then Newton's equations under a Galilean transformation stay the same, 
or F = m(d 2 x/dt 2 ) = m(d 2 x’/dt 2 ), where dt and dt' are the same. But when such Galilean transforms are applied to the 



8 C. Martin, J. Buckland, G. Berryman, and W. Champion, lyrics to Clocks, 2002. 

9 J. Lawrence and R.E. Lee, Inherit the Wind. New York: Bantam Books, 1960, Act 2, Scene 2, p. 60. 
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equations of electrodynamics, the form of the wave equations for electric and magnetic field associated with light alter, 
showing that the equations are not "invariant" under a Galilean transformation. 

Enter Einstein, who asserted that if the ball were replaced by a beam of light, then both an observer in the rest frame 
and one in the moving frame would measure the speed of light as the same. But if each observer experiences light as 
traveling at a velocity c, then the equivalence of the transit times no longer holds. This means that something is peculiar 
with the passage of time: an observer in the rest frame would interpret the moving frame results as being due to light 
traveling a path length of 2tJl 2 + ( fict) 2 /c and therefore argue that more time passed than the observer in the moving 
frame, who would say the transit time is 2A t. The passage of time in the moving frame has slowed: writing c as the path 
length over the transit time, first in the rest frame and then in the moving frame, and using the very common notation 
that p = v/c, then 

= 2 (cAQ 2 + (pc At') 2 

2Af C 2 At' ( ' ' 


where the left-hand side is for the rest frame and the right-hand side is for the moving frame with L 
rightmost equation for At' gives 


At' = 



= yAt 


cAt. Solving the 
(23.32) 


where the second of the common symbols of relativity, namely y = 1/\/l - p 2 , is introduced. If the passage of time has 
slowed, then the length L of a rod with its axis in the direction of motion shortens: determining the length of a rod by 
measuring the transit time of a light pulse from one end of it to another requires L'/At' = c, so that if At' = yAt, then so 
too must L' = yL. 

The invariant in going from one coordinate system to another is the speed of light: a point on the spherical wave front 
of an expanding pulse of light from a point source, measured in both the unprimed (rest) and primed (moving) frames 
is the same, or 


(23.33) 


To keep the discussion simple, assume that the coordinate axes in both frames are parallel such that / = y and z' = z. 
Introduce the differential length dr = \Jdx 2 + dy 2 + dz 2 , so that Eq. (23.33) can alternately be written 


c 2 dt 2 - dx 2 - dy 2 - dz 2 = c^dt 12 - dx' 2 - dy 12 - dz' 2 


(23.34) 


which can be expressed as ds 2 = ds' 2 where ds is the invariant quantity in inertial frames. 10 How, then, must the Galilean 
transformation matrix of Eq. (23.30) be altered to respect the constancy of the speed of light in any inertial frame plus 
retain the equal validity of all inertial frames in describing the physics? Ignoring the rows and columns associated with 
the y and z axes (they are unaffected by uniform motion in x), then the simpler 2x2 matrix for (x,t) need only be 
considered. The quantity ds 2 = ds' 2 may then be expressed as 

Sdt 2 - dx 2 = (cdt dx) _°^ = <C|L|C> (23.35) 

where L is the Lorentz metric (it should be a 4 x 4 matrix with only diagonal elements non-zero, such that L lt = 1 and 
Ljj = -1 for j from 2 to 4, but as the y and z axes are the same in the rest and moving frames, they can be ignored). 
The unusual usage of bra-ket notation is only because it captures the matrix operations and vector properties that are 
relevant 11 and so is convenient. The invariance of ds 2 therefore entails that in a moving frame |f') = U|f) for which ds 2 
is unchanged can be represented by introducing a coordinate transformation matrix U where 


U = 


(23.36) 


10 If ds 2 = 0, it is said to be "light-like", if positive, then "time-like", and if negative, "space-like". 

u No imputation that |f) is a wave function or basis state is implied, because it is not: |f) is nothing more than a two-component real vector 
here, and L and U are just matrices, as emphasized by the symbol representing them. 
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The quantities {a, b,f,g ) are real quantities to be determined such that 


UtLU = L^(* 2 --f 2 ° 

\ab-fg b 2 -g 2 


(23.37) 


where t denotes "transpose" as the matrix elements of LI are real. These represent three equations for four unknowns 
because the off-diagonal entries are the same. One more is needed, but it has already been encountered: Eq. (23.32) 
shows that a = y, and therefore, g = y and b = f = py. Thus, 


' cf' 


y{ct + px) 


'ct 


y{ct' - px' 

X 1 


y{pct + x) 


X 


y{x' - pet') 

y 


y 


y 


y 

z' 


z 


z 


z' 


(23.38) 


where the other coordinates have been returned to their rightful place. 

Now that the coordinates have been considered, turn to energy and momentum. The equivalence of any reference 
frame for the description of dynamics means that saying F = dp/dt, where F is force and p momentum, has consequences 
as well. In a reference frame moving just shy of the speed of light, the application of force is not changing the particle 
velocity (which is just shy of c) that much, so something else must be affected, and that something is mass. The way dt! 
is related to dt suggests that F = d{mv)/dt -*■ d{ypm 0 c)/dt, where m 0 indicates the mass of the particle in its rest frame and 
m = ym 0 is the mass in a moving frame. 12 A more rigorous exploration confirms this expectation [99], Analogous to the 
invariance of ds 2 , the invariant quantity in the energy-momentum relation is the rest mass m 0 , and so 

m 2 0 =m 2 - ( p/c) 2 (23.39) 

A more agreeable representation is to define me 2 = E. Therefore, in terms of the energy-momentum terms (E, cp x , cp y , cp z ), 
the invariance relationship becomes 

E 2 = (nigC 2 ) 2 + {cp) 2 = {ym 0 c 2 ) 2 (23.40) 

In the non-relativistic limit (/?« 1), y can be Taylor expanded to show that E « m 0 c 2 + (\ /2)m 0 (fic) 2 = m 0 c 2 + (l/2)m 0 v 2 , 
the second term easily being seen to correspond to kinetic energy. As tempting as it is to head off in that direction, in 
actuality Eq. (23.39) is the equation sought. If E = ym 0 c 2 is the total energy, and m 0 c 2 is the rest energy, then because 
y 2 + p 2 y 2 = 1, 

{pc) 2 = {y 2 - 1 ){m 0 c 2 ) 2 = (pym 0 c 2 ) 2 = {ym 0 c 2 ) 2 {pc) 2 (23.41) 

which recovers p = mv but with m = ym 0 and v = Pc. Putting momentum back into the form of p = hk then results in 

{hek) 2 = E 2 — (m 0 c 2 ) 2 (23.42) 

which is in the form of a hyperbolic dispersion relationship, where the designation indicates that k and E can be expressed in 
terms of hyperbolic trigonometric functions cosh and sinh. That is, by dividing throughout by the rest mass term {m 0 c 2 ) 2 
and juggling. 


(23.43) 

(23.44) 

where the dispersion relationship is now seen to be expressible in hyperbolic functions. 


1 = cosh 2 © — sinh 2 © 
tanh 0 = — 


12 Alas, relativity has forced a distinction in the mass of an electron in the rest frame m 0 compared to the moving frame m, breaking with the 
earlier convention of calling m the mass of the electron in vacuum. Context will have to determine content, again. 
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23.4 The alpha semiconductor model 


The electron dispersion relation thus has the form of a relativistic particle of rest mass m n and a limiting upper velocity a 0 c. The quantity 
a 0 c is roughly constant for the III-V compounds, u 

- Barbara L. Jensen [306] 


Invoking relativity to provide an example of a hyperbolic dispersion relation seems more ostentatious than serviceable, 
but there is a reason for it. The limitation represented by the speed of light for inertial frames has direct consequences. 
An analogous limitation on the electron velocity in a crystal gives rise to a method of estimating the effective electron 
mass m n . Actually, it does more than that: it provides a means of estimating optical quantities, acts as a bridge between 
the (quasiclassical) Drude theory and a quantum mechanical formulation, and provides some of the scattering terms 
needed for photoemission predictions from multialkali antimonides [308] and a number of compound semiconductors. 
So as not to get distracted, consider just the dispersion relation and its implications for effective mass and a hypothetical 
limiting velocity though. What is particularly interesting is that for a certain class of semiconductors, namely, the II-VI 
and m-V semiconductors, the variation of band gap E with effective mass r e = m n /m is very intriguing, as shown in 
Figure 23.3. 

Return to the dispersion relation of Eq. (23.27), but now, instead of measuring energy from the top of the valence 
band, measure it from the center of the band gap, or E' -*■ W + E g / 2, and use the A /E g -> oo form of the equation to write 

h 2 k 2 (W + E/2)(W — E/2) W 2 -E 2 /4 

— = -----= - - — (23.45) 

2m n E g E S 


or 


E 2 

W 2 = -f- + 

4 


EJW_ 
2m n 


(23.46) 


This is suggestive of the relativistic dispersion relation, but for it to he like those dispersion relations, then W would have 
to act like E, therefore investigate the consequences of W = ym n (a 0 c) 2 , where y is to be determined 14 and where a 0 c takes 
over the role of a limiting velocity. With this substitution, then 


\ 2m n (a 0 c) 2 J 


(23.47) 


The first (or constant) term on the right-hand side is unity if 


B. 


— 2 m n (a 0 c) 2 


(23.48) 



Effective Mass r e 

Figure 23.3 Band gap E f vs effective mass r t = mjm for selected II-VI and III-V semiconductors. Values used are based in Table 23.1. 


13 In ref. [306], a 0 = *+ a/2, that is, it is half the value of the fine structure constant a. Because a is reserved here for the fine 

structure constant, this version of the quote modifies a 0 by appending a o subscript, as done in ref. [308]. 

14 No one is fooled: the intention is to bend the meanings of the terms until y functions in the same way that the relativistic y does. 
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Table 23.1 Semiconductor band gaps and effective electron masses. Select ll-VI and lll-V 
semiconductor parameters of E g and m n = r e m. "Ratio" refers to the quantity R = ( E g /R y )(m/m n ), 
where R y = m(ac) 2 /2 = 13.6057 eV is the Rydberg energy: the alpha semiconductor model 
assumes R = 1. The data are graphically represented in Figure 23.3. 

Semiconductor Type r e = m n /m £ [eV] Ratio Ref. 


CdO ll-VI 0.1000 

CdS ll-VI 0.2050 

CdSe ll-VI 0.1300 

CdTe ll-VI 0.0960 

CdTe ll-VI 0.1100 

HgSe ll-VI 0.0300 

HgTe ll-VI 0.0170 

ZnO ll-VI 0.2700 

ZnS(W) ll-VI 0.2870 

ZnSe ll-VI 0.1700 

ZnTe ll-VI 0.1500 

AIP lll-V 0.2120 

AlSb lll-V 0.1200 

GaAs lll-V 0.0680 

GaP lll-V 0.1300 

GaSb lll-V 0.0450 

InAs lll-V 0.0219 

InP lll-V 0.0700 

InSb lll-V 0.0150 


2.5000 1.8375 [309] 

2.4100 0.8641 [310] 

1.7000 0.9611 [311] 

1.5100 1.1561 [310] 

1.5000 1.0023 [311] 

0.3000 0.7350 [309] 

0.1500 0.6485 [309] 

3.3500 0.9119 [311] 

3.7800 0.9680 [311] 

2.5800 1.1155 [311] 

2.2800 1.1172 [311] 

3.0000 1.0401 [310] 

1.5800 0.9677 [311] 

1.3500 1.4592 [309] 

2.2500 1.2721 [311] 

0.7500 1.2250 [310] 

0.3540 1.1881 [310] 

1.3500 1.4175 [311] 

0.1700 0.8330 [311] 


and if the second term is represented as (py) 2 , then 


This is perfect: Pa 0 c acts like the velocity, ym n acts like the relativistic mass, and E g acts like the rest energy. But there is 
more: the ratio of E 0 with the Rydberg energy Ry = (1/2 )m(ac) 2 shows that if 


™ (23 - 5i) 

just as shown in Figure 23.3. 

In short, for semiconductors that are describable by the alpha semiconductor approximation, then if the band gap 
is known, the effective mass is known, and a convenient simple model is available to calculate parameters and optical 
constants associated with electron transport and emission. That is, if one is interested in how some physical feature (such 
as photoemission, for example [308]) scales with band gap, then a means of taking into account the changes wrought on 
the effective mass as a consequence are easily incorporated. The basis behind this approximation is that the dispersion 
relation introduces a factor acting like the limiting velocity in an alpha semiconductor of a 0 c (a form reminiscent of the electron 
velocity in the Bohr atom in Section 2.2.1 being ac), and the velocity of the electron being defined by pa 0 c. so that 
Eq. (23.46) becomes 

•/ 2 = 1 + p 2 y 2 <-> y = - 1 (23.52) 


Finally, some of the scattering terms in electron transport yet to be encountered are dependent on the energy of the 
electron measured from the bottom of the conduction band, but W is thereby an odd choice, as it is measured with 
respect to the center of the band gap (the roots of W 2 , being ±W, therefore act like (positive energy) electrons (W > 0) 
and (negative energy) anti-electrons). In such calculations, finding a term that goes as the actual energy in the conduction 
band has utility. A candidate is the dimensionless parameter 

W-{E/2) 

u= --i— (23.53) 
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Figure 23.4 A band gap diagram for the hyperbolic dispersion relationship of W in Eq. (23.46) using the alpha semiconductor approximation. Observe 
that Pr = hk/m n a 0 c. 



Figure 23.5 Behavior of P(u) defined by Eq. (23.54). 


which is the ratio of the excess energy above the conduction band minimum with the band gap energy 

y = 1 + 2u 


(23.54) 


In terms of u, the limiting behavior of W is readily visible in Figure 23.4. Further, the behavior of p(u) as u increases 
mimics the relativistic relationship between kinetic energy and velocity v = Pc of Section 23.3, and shown in Figure 23.5, 
where 

V4h(1 + u ) 


p(u) = - 


1 + 2 u 


(23.55) 


23.5 Current and effective mass 


"Keep ancient lands, your storied pomp!" cries she 
With silent lips. "Give me your tired, your poor, 
Your huddled masses yearning to breathe free. 

The wretched refuse of your teeming shore. 

Send these, the homeless, tempest-tost to me, 

I lift my lamp beside the golden door!" 


- Emma Lazarus 1 
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It is a perennial and contentious question: if a barrier to freedom exists, who gets through, or, put another way, do 
the weighty ones have an advantage? For metals, the effective mass of the electrons in the bulk metal was close enough 
to the free electron mass that such questions did not arise, but with semiconductors, the effective masses are sufficiently 
different that some effect must arise that will affect emission. 

The transverse momenta hk ± are conserved across the interface, so attention can be gratifyingly restricted to the 
one-dimensional problem. Although the effect of a changing electron mass from bulk to barrier can be ascertained using 
a formalism like that leading Eq. (17.4) (examples being the treatments of Stratton [305, 312]), there is a way to intuit 
the behavior based on reconsidering the rectangular barrier of Section 18.1. The matrix equations leading to Eq. (18.10) 
can be reformulated to take into account that the incident wave function is characterized by an effective mass m„ = r e m. 
Of the two matricies that are involved, Eqs (18.4) and (18.5), only 5, will change as a consequence of m -* m n to become 


VO) 


2k' 




(23.56) 


where k! = yj2m n E/h = kyJT e and k = ^fk 2 - k 2 is the same as before. The effect on the transmission coefficient t(k), 
originally considered in Eq. (18.9) for transmission over the barrier, is then 16 


and so for large kL, 


D(k')= 16 1 — 


2(k 2 - kk!) sinh(Lie) + 2 iic(k + k!) cosh(Lx) 
4 k'kK 2 


’ 4 K 2 (k + k') 2 + 4(k 2 + k' 2 )(jc 2 + k 2 )sinh 2 (Lx) 

If k -*■ k!, Eq. (18.11) is recovered. If D(k) represents k' -> k in D(k’), then in the limit exp(Lx) » 1, 
D(k!) " m n k'(K 2 + k 2 ) ~ 3/2 k 2 0 


D(k) mk(jc 2 + k’ 2 ) 


k? 0 -( 1 - r e )k 2 


(23.57) 

(23.58) 


(23.59) 


similar to the findings of Stratton [305] but by different means. The dependence of the ratio of the effective mass trans¬ 
mission probability D{k!) with the free electron mass transmission probability D(k) on r] /2 = (m n lm?l 2 may have been 
anticipated in Eq. (11.4) by observing the changes in going from an integration over dk to E 1 ^ 2 dE dQ. when changing 
from momentum to energy, where a yjrn n is associated with each momentum component. 17 In any event, Eq. (23.58) 
confirms that, for field emission from semiconductors and in a manner of speaking, the weighty ones do indeed have an 
advantage. 


15 E. Lazarus, The New Colossus, http://www.libertystatepark.com/emma.htm. A poem written in 1883 and mounted on a plaque at the Statue 
of Liberty in 1903. 

16 The notation leaves something to be desired: it would appear that t(k) is a function of V, but keep in mind that k' = k^F e . 

17 Effective masses do not have to be the same along each axis (e.g., silicon), in which case mj 2 -* (m x m y m z ) l l 2 . For silicon, m y = m x , so the 
effective mass is commonly written as (m^m 2 ) 1 / 2 , where the subscripts refer to longitudinal and transverse [73]. 
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CHAPTER 24 

Interfaces 


All lies and jest 

Still, a man hears what he wants to hear 
And disregards the rest. 

- Paul Simon and Art Garfunkel 1 

Differing materials abutting one another with or without a potential drop across or between them exhibit complex 
behavior dependent upon the nature of the metals and semiconductors involved, the doping, and the manner in which 
they are in contact [52, 53, 213, 304, 312]. The description of emission from semiconductors (field emission in particular) 
is significantly complicated by the numerous differences between semiconductors and metals, beyond the effective mass 
changes encountered in Section 23.5, to include field-dependent electrochemical potentials and temperature-dependent 
band gaps, the orders of magnitude smaller electron density values near the surface in the conduction band or the 
nature of field emission from the valence band itself, changes to the image charge potential affecting the Schottky barrier 
reduction factor, the non-linear potentials that attend doping profiles, and on and on. Most vexing is the differences 
that arise depending on whether the doping is p-type or rz-type [111, 172, 305, 312, 313]. Not all of that complexity 
relates to what is needed to understand emission. Why listen to a laborious disquisition on it all? 2 Focus instead on 
the processes which change the emission barrier or the supply of electrons abutting it. The shape of the band bending 
affecting the supply of electrons, like that of the barrier impeding their transport, can be understood by assessing how 
charge is distributed near the interface. 

24.1 Metal-insulator-metal current density 

When two metallic electrodes are separated by an insulating film, the equilibrium conditions require that the top of the energy gap of the 
insulator be positioned above the Fermi level of the electrodes. Thus, the action of the insulating film is to introduce a potential barrier 
between the electrodes which impedes the flow of electrons between the electrodes. The electronic current can flow through the insulating 
region between the two electrodes if: (a) The electrons in the electrodes have enough thermal energy to surmount the potential barrier and 
flow in the conduction band, (b) The barrier is thin enough to permit its penetration by the electric tunnel effect. 

- John G. Simmons [314] 

Electron emission is into vacuum, but analogous equations to transport through interface barriers arise when the vac¬ 
uum is replaced by an insulator. When the insulator is sandwiched between two metal layers, as in Figure 24.1, by the 
addition of a right contact to replace the vacuum, then both contacts can provide electrons into the insulator region, 
and the net current will go as their difference. The bottom of the conduction band of the insulator takes over the role of 
the vacuum level in the emission equations, and modifications to account for a differing dielectric constant and effective 
mass of the electron in the insulator introduce changes. When a bias is applied (the right metal contact is held at a lower 
potential energy V b = qcp h than the left contact), then a difference in the left and right current through the structure 
occurs, so that total current flow, being the difference between them, depends on whether the rightmost edge of the 
insulator conduction band is above or below the Fermi level of the left metal contact. If the right barrier is lower, then 
the total current should follow an equation like (compare Eq. (11.2)) 

J= ^hl -/nghtCE))^ (24.1) 


^aul Simon and Art Garfunkel, lyrics to The Boxer, 1968. 

2 Apart from the obvious reason that interfaces are technologically quite important to electronics, that is. 


Introduction to the Physics of Electron Emission, First Edition. Kevin L. Jensen. 
© 2018 John Wiley & Sons Ltd. Published 2018 by John Wiley & Sons Ltd. 
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Unbiased 



Figure 24.1 Left, a model of an unbiased metal-insulator-metal interface (compare the middle figure of Figure 24.12); observe that the Fermi levels of 
the metal contacts are aligned. Right, the same interface under bias, in which the right contact is held at a lower potential V b = q<p h than the left 
contact. Image charge effects round the rectangular barrier [314] but are not shown. 


where the x subscript on E x is suppressed and f{E) is the supply function of Section 11.1 for the left and right metal 
contacts. If the metal layers were replaced by semiconductors, the description would be more complex, requiring 
the introduction of doping levels and the specification of whether the semiconductors were p- or n-type but decid¬ 
edly non-trivial [213, 315], How D(E) behaves and how the equations for J that result compare to the canonical 
Richardson-Laue-Dushman (RLD) and Fowler-Nordheim (FN) equations of Part II is more to the point here. 

Before doing this, a small historical aside is useful. The treatment of transport across a metal-insulator-metal (MIM) 
region 3 evolved differently than Eq. (24.1) would suggest, focusing as it did on electron transport from doped semiconduc¬ 
tors into a metal contact (metal-semiconductor transport). The earlier approaches therefore used drift-diffusion models 
oft used in semiconductor analyses. In contrast, Bethe hypothesized that the current limiting mechanism was the barrier 
between the metal and semiconductor: when the bias was such that flow was from metal into semiconductor, the model 
is akin to the description of thermionic emission in Chapter 12, so that Bethe's theory is perhaps not surprisingly known 
as the thermionic emission model. The Bethe approach has more in common with the present treatment, and although 
both the drift-diffusion and thermionic emission models give comparable equations, the Bethe approach is less prob¬ 
lematic in its interpretation [316], Actual transport from semiconductor to metal, however, contains elements of both. 
A final simplification is that being concerned with electron transport means the usual complications of hole transport 
and recombination effects do not trouble the discussion. 

For the MIM configuration, three cases of interest dominate. The first is that the insulator region is so wide, and the 
bias so small, that only electrons in the thermal tail of the distribution functions have adequate energy to surmount the 
barrier: the resulting current density should be analogous to the RLD equation of Section 12. The second is that (as shown 
in the rightmost figure of Figure 24.1) if the bias is large enough, and perhaps the conduction band of the insulator low 
enough, tunneling from the leftmost metal into the conduction band of the insulator is possible: the resulting current 
density should be analogous to the FN equation of Section 13.1. Lastly, the insulator region may be sufficiently thick 
that tunneling through the insulator is possible, so that the transmission probability D(E) has features of the rectangular 
barrier of Section 18.1. Consider each in turn. 

24.1.1 Thermal model 

A significant advance in our understanding of metal-semiconductor contacts came during the Second World War as a result of the use of 
silicon and germanium point-contact rectifiers in microwave radar. This advance was considerably helped by developments in semicon¬ 
ductor physics. Perhaps the most important contribution during this period was Bethe's thermionic-emission theory (1942), according to 
which the current is determined by the process of emission of electrons into the metal, rather than by drift and diffusion in the semiconductor 
as was supposed by Mott and Schottky. 

-E. H. Rhoderick [316], p. 2 


3 The designation "metal-oxide-metal" diode is also encountered. 
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When the bias is small, then only the electrons in the population with a forward energy E that exceeds the interface 
barrier height are able to cross the insulator from metal contact to metal contact. Depending on what is being emphasized, 
the process is called Schottky emission, thermionic emission (into a semiconductor rather than vacuum), or the Bethe 
model, but all include over-the-barrier transport. The transmission probability is then a step function D(E) « Q(E - n~ 
where <S> h plays the role of the work function in thermal emission theory, but is now the energy difference between 
the conduction band in the insulator and the Fermi level in the metal. If the bias across the insulator is V b , corresponding to 
the potential energy difference between the left and right metal contacts as labeled in the right schematic of Figure 24.1, 
then clearly electrons from the right contact have to surmount a barrier of height <f> b + V h to get to the left contact 
electrode because their "zero energy", as it were, is V b lower than their brethren in the left contact. Thus, Eq. (24.1) 
becomes ^ 

J» / (f(E)-f(E+V b ))dE (24.2) 

2;r ” Jn+<s> b 


If the barrier height Oj, > 4 k B T, which is often the case, then f(E) can be replaced by 
iting case because the integration region is restricted to those energies for which that 
resulting in 




its Maxwell-Boltzmann lim- 
approximation is reasonable, 

(24.3) 


and so Eq. (24.2) becomes the simple integration over exponentials, and is 


-/MW, _ 


(24.4) 


Making use of as defined in Eq. (12.3), 




(24.5) 


for which all the factors outside the parentheses can be scooped up into an RLD-like equation J Rf D but with <t> -> <t> h . 
If pv b is large, then the second term in the parentheses is negligible, making what remains much like the RLD thermal 
emission equation amended by a factor r e denoting the effective mass of the electron inside the insulator. 

But that is not all - just as the application of a field lowered the barrier to thermionic emission by the Schottky factor 
V4 QF, so too does the field existing across the insulator lower <t> h , albeit with a few modifications. First, the Q in the 
Schottky factor has to be amended by Q -* Q/K s where K s is the dielectric constant of the insulator 4 . Thus, 

^ <fi b = O b - V 4-QF/ K s (24.6) 

is the Schottky factor lowered barrier height above the left metal Fermi level, and the field across the insulator is approx¬ 
imated as F = V b /L as would be the case if the insulator is narrow and free from doping. Being free from all doping is 
rather optimistic, as Section 22.4 makes apparent, and so F is instead the field that exists at the metal-semiconductor 
interface on the semiconductor side. 5 

An equation like Eq. (24.4) is important because it leads to current rectification, which is a means of transforming alternat¬ 
ing current into direct current. However, the more common device type associated with this is the metal-semiconductor 
diode (also called rectifiers and Schottky-barrier diodes), for which current from the semiconductor to the metal is 
described by an equation analogous to Eq. (24.4), given that the electron distribution in the semiconductor is really 
a Maxwell-Boltzmann distribution. The barrier shape has more in common with the depletion layers encountered in 
Section 24.5, but for the thermal emission-type behavior the shape is not as important as the height of the barrier, and 
so a discussion of the metal-semiconductor barrier is delayed until then. The process of rectification is straightforward: if 
the bias potential on the semiconductor side sinusoidally oscillates so that the barrier oscillates between small and large, 
the current is positive (read: from semiconductor to metal) and vanishes for the part of the cycle where the barrier is 


4 Those who recall Section 22.5 may feel disoriented, thinking that Eq. (22.29) describes the image charge when semiconductors are involved, 
but now the insulator has replaced the vacuum and the charge in the insulator is outside a metal surface, not a semiconductor one, so Q/K s is 

5 The barrier shape would also become quadratic (sloping-like) instead of linear (triangular-like). Patience - that discussion will come in 
Section 24.5. 
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large; current does not flow in the reverse direction as the field changes sign. This is just an inkling of what such devices 
do [152] and the physics they share with p-n junctions [316], 

24.1.2 Tunneling model 

By evaporating an aluminum strip on a glass plate, oxidizing the aluminum surface, and then evaporating another strip over it at about 
right angles, one obtains a metal-oxide-metal diode consisting of a thin oxide layer (20-200 angstroms thick) sandwiched between two 
metal electrodes. If a potential difference is maintained between the electrodes, current will flow by field emission (tunneling through the 
barrier). The theory of (field emission) is applicable to this problem with some slight modification... 

- Aldert van der Zeil [152], p. 148 

The same three considerations that affected a thermal emission-like equation in Section 24.1.1 also apply to developing 
a field emission-like equation here for the treatment of tunneling through a MIM structure: the effective mass, the barrier 
height, and a Schottky lowering factor. When the field across the insulator is sufficiently strong, then the tunneling 
electrons from the left-hand contact experience a barrier analogous to the image charge affected triangular barrier of 
Fowler and Nordheim, as suggested in the right side of Figure 24.1, but now the incentive to consider the effects of 
the right-hand contact are severely attenuated, given the great sensitivity of tunneling current to the Gamow factor 
0(E). Given Eq. (13.25), the MIM version of the tunneling equation not surprisingly involves the same replacements of 
(z) m -*■ m n = r e m, ( ii) <t> -*■ <T> b , and (Mi) Q -* Q/K s as before. But actually, that is not quite true. The tunneling process 
is very rapid, taking approximately L/v p < 1 fs for copper-like parameters and typical field emission fields and work 
functions. Consequently, the passage of an electron through a dielectric medium should make reference to the optical 
(or high frequency) dielectric constant e m = K m e 0 , rather than the static K s e 0 , when tunneling is through a material like 
an insulator [152], The point is, though, that the image charge term Q is modified by a dielectric constant term, reflecting 
the fact that the tunneling is occurring in a material, not in the vacuum. 

Such is the qualitative understanding. The quantitative understanding begins with observing that values of the dielec¬ 
tric constant are such that simply focusing on the triangular barrier model and ignoring the image charge lowering 
makes for a good and simple first approximation. Then, use the shape factor approach of Section 11.2: recall that the 
factor 0(p) most affects the form of the tunneling current J(F) tx exp (—0(p)) (see Eq. (13.10)), so focus on it. Electrons 
traveling from the left contact to the right experience the usual triangular barrier Gamow factor 0,(p) (Eq. (11.17)), 
but those from the right experience 0 r (p) (red and blue shaded regions of Figure 24.2, respectively). In terms of a 
shape factor (4/3), k factor of -y/2 m<5> b /Ti, and a length factor of LQ> b /V b (the field across the dielectric is F = V b /L) 
then 




Figure 24.2 Close-up of the insulator region of Figure 24.1 showing the regions used in the 
calculation of the left and right 0(d) factors used to define the left and right transmission probabilities 
(red and blue, respectively). The width of the blue region is always L (the thickness of the insulator); 
the width of the red region L' = L(O b /V b ) depends on the bias and changes. 
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The blue region of 0 r (g) is not so simple. First, V b > <t> b or the electrons entering from the left would not be experiencing 
a triangular barrier. Second, electrons from the right experience a barrier that increases rather than decreases, and so 




*=[( 1+ £f- I ] w 


(24.8) 


Therefore, for the minimum condition V b = <t> fc , the right 0 r (g) is larger by a factor of 2\p2 - 1 = 1.83 than 9,(g), which 
substantially reduces the current from right to left as the bias increases: by the time V b = 2<X> b , then that factor increases 
to 4.2. Ignoring the right current, then, is very reasonable, and so the tunneling current density through an MIM 
structure is 


g + <Dj ) \ L ) 3 hv b V » 


(24.9) 


recognizable as the modified form of the triangular barrier Fowler-Nordheim equation of Eq. (13.10). Incorporating an 
image charge effect amounts to introducing the elliptical integral functions v(y s ) and t(y s ), where the functions are defined 
by Eqs (13.23) and (13.24), and where y s = \jAQV h /LK w accounts for Schottky barrier lowering in a dielectric material. 
An equation much like it will be encountered again in Section 25.1, but for which the field is a bit different. 


24.1.3 Numerical model 

For truth is rightly named the daughter of time, not of authority. It is not wonderful, therefore, if the bonds of antiquity, authority, and 
unanimity, have so enchained the power of man, that he is unable (as if bewitched) to become familiar with things themselves. 

- Francis Bacon 6 

The musings of Bethe and Fowler and Nordheim (not to mention Rhoderick or van der Ziel) about tunneling junctions 
and barriers command deference for good reason, and if they maintain that the canonical thermal and field emission 
equations can be adapted to treating an MIM diode, it must be so. And yet, that second interface on the right-hand 
side of Figure 24.2 surely does something to D(k), especially if the wave nature of the electron manifests itself in other 
tunneling and transmission barriers of greater complexity (Section 19.2.3). If the barrier <t> fc and width L are "small", 
so that D(k) « exp(-0(k)) becomes lacking, then that, too, must matter. One should perhaps honor the elders, but under¬ 
take the calculation anyway: respect but verify. 

Fortunately, the problem represents a minor modification to an existing and well-understood problem: the triangular 
barrier of Fowler and Nordheim as treated using the techniques of Sections 18.2 and 19 makes the transfer matrix solution 
to D(k) the proper way to judge the adequacy of the thermal and tunneling approximations to an MIM diode, leading 
only to minor changes to Eq. (19.15) in Section 19.2.2. For the sheer perversity of it, use the Airy functions Ai(ie) and 
Bifie) directly, with the caution that the argument w(x) can be both positive and negative (in contrast to the usage of 
z in the Zi(c, z) functions, where it is positive only). In terms of k 0 = \j2m n Q> b /h and/ = 2 m n F/h 2 , then 

1s2 _ V2 _ fy 

W (X) = ° /2/3 ; (24.10) 

The values of wQc) at the interface are what matters, so let w 0 = w(0) and w b = w(L). Because the right-hand side of the 
diode is characterized by a potential drop V b rather than a field, then F = V b /L and introduce k b = sj2m n V b /h. The wave 
number in the right-hand side metal is then 

k' = sjk 2 + k 2 b (24.11) 


6 Francis Bacon, Novum Organum, Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 30, Francis Bacon. Chicago: Encyclopaedia 
Britannica, 1952, p. 121, Aphorism 94. The paraphrase "Truth is the daughter of Time, not of Authority” was uttered by Galileo in the Bertolt 
Brecht play of that name when he was unsuccessfully entreating the Prince and his courtiers to look through a telescope at the moons of 
Jupiter, which would have upended the authority of Aristotle. 
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All that modifies the Fowler-Nordheim triangular barrier analysis is the insertion of the matrices for the interface at 
x = L, or 



/1 1 \ * / Bi(M> 0 ) Ai(tn 0 ) \ 

\ik -ik) \-/ , / 3 Bi'(in ( ,) ) 

2k \k i ) y-/ 1/3 Bi'(«; 0 ) -f l ^Ai\w 0 )) 

( Bifwj) A\(w b ) \ ' / i 1 \ 
V-/ l / 3 Bi'(^) —f l / 3 Ai'(w b )J V*' -ik 1 ) 

(-f l,3 Ai\w b ) —AiK)\ / 1 1 \ 

/*/ 3 ^ /i/3 B f( Wb ) Bi (w b ) ) \ik! -ik') 


(24.12) 


(24.13) 


(24.14) 


where it is evident that if S L were absent, the formulation would give the usual Fowler-Nordheim relation for a trian¬ 
gular barrier (Eqs (18.41) and (18.51)). The rendering of Eq. (24.12) into a numerical algorithm for the evaluation of 
D(k) is given in Section A3.16 and the output of that calculation is shown in Figure 24.3 for generic parameters. The grey 
vertical line in these figures denotes the location of k 0 , which corresponds to the barrier height by = h 2 k 2 : /2m n . 
The thermionic emission model would simply assert that D therm (k) is unity for k> k 0 but vanishes otherwise. More inter¬ 
esting is the leading order behavior of the tunneling term in Eq. (18.59) and shown as the dashed lines, calculated 
according to 

D FN {k) = ^ exp ( - ^( k] - k 2 ) 3/2 ) (24.15) 


Clearly, the second interface makes a non-trivial difference, although it is equally clear that the thermionic prescription 
of Bethe and the triangular barrier model of Fowler and Nordheim do capture the general behavior. In practical barriers, 
where a rounding of the sharp edges is present in the simple MIM model, a softening of the oscillations that are evident 
occurs, as is generally true for rounded or smeared out potential regions (remember the parabolic barrier of Section 
19.1.1). This makes the qualitative prescriptions better, but does not completely erase the impact of the wave nature of 
the electron and the interference effects which arise because of it. The moral, that the wave nature matters, is going to 
become more evident as other modifications to the simple potential barriers are considered. 



Figure 24.3 Transmiss 

parameters are used: the height of the insulator conduction band is V 0 = 1 eV above the left metal Fermi level /r ; , the effective mass is m n = 0.5 m, and 
the width of the insulator region is L = 5 nm. The lines are labeled by the value of the bias voltage V b [eV]. The grey line labeled k 0 = \f2m n ® b /fi shows 
the location of the barrier maximum (as in Figure 24.2). The dashed lines correspond to D pN (k) of Eq. (24.15) (see also Eq. (18.59)). 


www.ebook3000.com 









Interfaces 309 


24.1.4 Interface roughness 

The ultimate measure of a man is not where he stands in moments of comfort and convenience, hut where he stands at times of challenge 
and controversy. 

- Martin Luther King, Jr. 7 

The more challenging the physical circumstances, the more illuminating the insight. The idealized barriers and smooth 
interfaces that form the one-dimensional models do not give a full account (although they are certainly suggestive) 
because they are silent about what happens when circumstances are rough. Roughness, in this case, refers to the undu¬ 
lations and irregularities of actual interfaces, for which the idealized models do not fully account for the complexity of 
the physics involved. Specifically, the electron momentum hk L (which is parallel to the plane of the interface) is taken 
to be conserved in passing from one side of the interface to the other (specular transmission), an approximation that 
ignores how structure complicates wave function matching in crossing the region over which roughness is expressed 
(diffuse transmission) [317]. 

In a planar, or one-dimensional, model, a one-to-one matching of the wave function on one side of an interface with 
a wave function on the other side exists, and so conservation of transverse momentum components holds. But when the 
roughness characterizing the interface is confined to a region between x a and x b , then the wave function to the left of x a 
with an energy E a is 

\a(Z)) = Z A i(ZA)\k a ) 

= J^A r (k' a ,k a )\k) 

whereas to the right of x h with an energy E b it is 

\K%)) = ^W%X)\%) 

= !>(%,*>) \h) 


(x < x a ) 

(X > X a ) 


(x < x b ) 
(X > X b ) 


(24.16) 

(24.17) 


(24.18) 

(24.19) 


In specular transmission the left-right coefficients have to be equal so that k is conserved, but in diffusive transmission a 
mash-up of the wave functions occurs and the coefficients A r and A t (as well as the Bs) no longer have to be the same. 
Current can then be evaluated by treating the passage through the interface region as a transition from the a state to the 
b state (a method credited to Bardeen), with all the mixing that entails. The diffuse case can be shown to then yield the 
specular case in the appropriate limits. 

Dry mathematical discourse hides ominous consequences, however, which take time to appreciate. As Stratton [317] 
puts it. 

The basic assumption underlying our technique for treating tunneling through a harrier with diffuse boundaries is that the tangential 
momentum of an electron passing through the boundary is not conserved, (emphasis added) 

That is going to make matters difficult. What applies to interfaces also applies to surfaces, and such calculations 
as thermal and photoemission emittance (emittance being a metric of great concern for electron beams) is going to 
be altered in ways profoundly difficult to anticipate (possibly even reflected in the observation that experimental emit¬ 
tance often is in excess of theoretical estimates based on one-dimensional theories [181]). The models are therefore 
indicative of what is happening, even if they underestimate a complete measure of it. With this caveat in mind, there¬ 
fore, consider how material considerations affect the transport equations, resulting in forms which often bear similarity 
to those of Part II, but which subtly change what the terms signify. 


7 Martin Luther King Jr., Strength to Love, Gift ed. Fortress Press, 2010, p. 26. 
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24.2 Band bending 


If it takes my whole life I won't break I won't bend 
It'll all be worth it worth it in the end 

- Sarah McLachlan 8 

A reed got into an argument with an oak tree. The oak tree marvelled at her own strength, boasting that she could stand her own in a 
battle against the winds. Meanwhile, she condemned the reed for being weak, since he was naturally inclined to yield to every breeze. The 
wind then began to blow very fiercely. The oak tree was torn up by her roots and toppled over, while the reed was left bent but unharmed. 

- Aesop 9 

The conduction band minimum for metals will not bend. In contrast, the much lower carrier density in semiconductors 
results in an inability to resist the penetration of an external field, and so their conduction bands do. In dealing with 
screening and the dielectric constant, a subtle concept behind Eq. (21.81) may have been missed, but for identifying 
what happens to the emission barrier at the surface of a semiconductor (or at the interface between a semiconductor 
and a metal), it is rather important. That concept concerns how the chemical potential p depends on the potential <p, 
conjoined because of their relation to the density. Because charge piles up at the surface to screen out an applied field, 
speaking of the electrochemical potential pix) is useful: it is measured with respect to the conduction band, but graphically 
it is common to represent the electrochemical potential as a straight line in a band diagram, and let the conduction band 
deform in response to changes in density occurring because of an applied electric field. In other words, the semiconductor 
bands (like the reed) bend in response to a force. 

In the absence of a force and at equilibrium, the chemical potential p is not going to change from place to place because 
if it did, there would be regions where the average electron energy would be higher (or lower), and flow would occur 
until conditions balanced. But when a force is present that caused the density to change, then if the variation in the 
potential characterizing charge buildup is not too rapid, the local density might appear somewhat flat, and therefore 
speaking of a local chemical potential does not seem too audacious. But that local chemical potential will be the chemical 
potential raised or lowered by the local <p(x), the gradient of which gives Fix) inside the material, and so in one dimension 

p{x) = p + (pix) (24.20) 


When p(x) appears with an argument henceforth, then it is the electrochemical potential, but when p is alone, it is the 
chemical potential of the bulk. 10 The electrochemical potential pix) allows the density p(x) to be evaluated from Eq. (6.8) 
with p -*■ pix). Mostly, attention shall be devoted to material in the negative x region (i.e., x < 0), with that material being 
metal or semiconductor, so that the region for x > 0 is vacuum (or a differing semiconductor). 

When treating the image charge for semiconductors, the continuity of the field across the semiconductor-vacuum 
interface, or Eq. (22.23), had to be respected, but the fields are related to the variation in (pix). Therefore, band bending 
can be understood by relating the FD distribution, the density, and the potential through Poisson's equation expressed as 

^(Pix) = fipix) - p 0 ) (24.21) 

where p 0 is the (constant) background charge in a jellium model (the positive cores are represented by a uniform distri¬ 
bution of charge). By introducing F = d x <p then the left-hand side becomes 


d 2 , d 

= ~r F = 
dx 2 dx 

The right-hand side can be rendered, using Eq. (6.8) 

1 d 


(P)±F= 1 -±F 2 
) d(P 2 d(P 

the form of Eq. (6.17), as 11 


= ^ (jfp) {Fy 2 (P(F + <P))-F l/2 m} 


(24.22) 


(24.23) 


8 Sarah McLachlan, lyrics to Answer, 2004. 

9 Aesop, "The Oak Tree and the Reed" (trans. Laura Gibbs), Aesop’s Fables. Oxford, New York: Oxford University Press, 2008, Fable 202, p. 102. 
The Gibbs translation provides the endomythium fables (the moral is inside the story) rather than the epimythium fables (the moral is an italicized 
sentence after the story) as in the perhaps better known (i894) Jacobs translation (which, truth be told, often seems to miss the point). 

10 It would be nice to refer to the bulk chemical potential as p 0 , but then that would run foul of Section 6.4, where p 0 referred to the T -* 0 
limit; the presence-of-an-argument notation will have to do. 
u ICeep in mind that F(x) is afieldbut F„iu) is a dimensionless function. 


www.ebook3000.com 




Interfaces 


Up till now, no mention was made of the sign of $(x) but now there is cause to. When <j>> 0, then g{x) > g and the 
electron density increases because F l/2 fpg{x)) increases: such a region is known as an accumulation region. Conversely, 
when tp < 0 then F } j 2 ipg(x)) and so the electron density decreases, connoting an absence of electrons: such a region is 
known as a depletion region. Each region affects how Poisson's equation is solved, theoretically or numerically. 


24.3 Accumulation layers 

It becomes indispensable to accumulate, to acquire property, in order to retain one's good name. 

- Thorstein Veblen 12 

A semiconductor such as silicon, when exposed to a vacuum field, has its conduction band pulled down, as in 
Figure 24.4. Letting the surface be at x = 0, and taking the vacuum field to be F uac , then at the surface Fix = 0) = F uac /K s 
and 0(0) = tp s whereas deep into the bulk the field and potential both vanish. An integration of both sides of Eq. (24.23) 
from cp = 0 to <p then gives a relationship between F and (p that can be used to find the dependence of each on position 
x. To do so, however, requires knowing the integral of F p (u). Using Eq. (6.18) it is 

G p (u) = F p (s)ds = s p In j } ds (24.24) 

and so the integration of Eq. (24.23) gives 

, 8x0 / N r \ 

F(4>) 2 = -p {G 1/2 {U) - G l/2 {u 0 ) - P4>F 1/2 (u 0 )} (24.25) 

where u = pig + tp) and u 0 = pg. 


EXAMPLE: Express the coefficient sj8nQN c /K s p 5 l 2 as a field F p and determine its magnitude if the temperature is 
T= 300 K and r e = 1. 

SOLUTION: From Eq. (6.19), N c = 6.8122 eV“ 3 / 2 nmT 3 . Then, using Q = 0.35999 eV nm and p = 38.682 eV" 1 , it 
is found that F fi = 0.023691 eV/nm ->■ £ = 23.691 MV/m. 


If conditions are such that pg{x) < -4.6 then «: 1. Then, 


G i/ 2 (m)«F 1/2 (m)« [ s l ! 2 e~ u ds = 
Jo 




(24.26) 



Figure 24.4 A semiconductor surface subject to high field, for which band bending results in an accumulation region near the surface. The 
semiconductor occupies the region for which* < 0. For the configuration shown, in the bulk (x -> -oo), the electron density is non-degenerate, but 
near the surface, it becomes degenerate. 


12 Thorstein Veblen, The Theory of the Leisure Class: An Economic Study of Institutions, Introduction by C. Wright Mills, New American Library, 1953, 
p. 37. 
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Figure 24.5 The ratio of F(<t>) 2 with (8«-QAf c -\/®e"» / 2K s fi 5 / 2 ) (black dots) compared to the approximation of Eq. (24.27) (red line) and its asymptotic 
forms (dashed green for s large, solid blue for s small) for T = 300 K, n = -0.1 eV, and S>n = -3.8682. 


and so 13 


3 

K s \p/2) 2 p 


{e u ~ u ° - 1 - (« - u„)} 


(24.27) 


The ratio of F(r/>) 2 with the coefficient of the bracketed term {■ ■ ■} in Eq. (24.25), compared to the approximation in 
Eq. (24.27), is shown in Figure 24.5 for silicon-like parameters. The approximation is seen to be rather good. When 
conditions are such u — u 0 = p(p(x) - p) = Pep is small, then the exponential can be Taylor expanded to second order, and 
the relation simplifies to 




(24.28) 


where p « N c yfne^/p i/2 from Eq. (6.37), and the Debye screening constant k D from Eq. (21.88) has been used. 
Eq. (24.28) is very familiar: it is of the form 

F =|-<£= T ^ <Kx) = 4> s exp(x/2) (24.29) 

ax X 


and so p(x) - p decays exponentially (remember that x < 0) into the bulk with a length constant of X = 2^J~Kjk D . 


EXAMPLE: For silicon-like parameters (p = —0.1 eV, K s = 11.8 and T = 300 K, but let r e = 1 for simplicity) estimate 
what vacuum field is required to give (j)(x = 0) = <£ s = 40 meV. 

SOLUTION: Use Fp = 0.023691 eV/nm from the previous example. To evaluate the remaining terms, first find 
pep = 38.682 X 0.04 = 1.5473 and then = 0.020897. Lastly 

-(e H -p<P- 1)= 1.9066 
4 

The product of these factors with their appropriate powers, using Eq. (24.27) as a guide, results in 
0.023691(1.9066 X 0.020897) 1/2 = 0.0047288 

which is the field F s (in [eV/nm]) just inside the semiconductor. Outside the semiconductor, the vacuum field is 
F vac = K S F S = 0.05580 eV/nm, corresponding to an electric field of about 56 MV/m. 


13 Compare Eq. (2.28) in Monch [213], Also, keep in mind F(x) = F(4>(x)) « F(4>). 
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Conversely, when the electron density is large, then either u g = pg can become large and positive (as for metals) or 
<p can become sufficiently large that Pg(x) = P(g + <J>) is large and positive (as for field emission conditions), that is, the 
electron density right near the barrier is degenerate [312], as in the discussion following Eq. (22.22). When u « u 0 » 1, 




(24.30) 


The dominant term in the {■ ■ ■} brackets, equivalent to Eqs (24) and (25) of ref. [312], are familiar in the treatment of 
field emission from semiconductors. If <p/g is small, a Taylor expansion of (u/u g ) 5 / 2 = (1 + (4>/g)) 5 ^ 2 allows Eq. (24.30) 
to become 

ShrD / M \ f \/u .. _ / ,/> \ 1 fcL 

(24.31) 
the bulk, but now 




where k TP is defined by Eqs (21.85) and (21.12). Again, 4>(x) = cp s ex p(x/2) decays exponentially 
with A = lyjKjkjj,. 

Because E($) in Eq. (24.27) is only a function of <p, a rapid method of numerically finding <p(x) is available. Using 
Ffi = \/&jiQN c /K s P 5 / 2 shows that the equation is the same as 


F(4>) « F,( |) 1/4 ^ /2 {e^ - 1 - /?</.>) 1/2 


(24.32) 

which is sufficiently accurate for finding <p(x) compared to Eq. (24.25) for semiconductors, but far easier to numerically 
evaluate. 

Let fy = <p(Xj) be evenly distributed from </> = 0 to <p = (p s , where 4> s is the value at the surface (x = 0). Similarly, let 
Fj = (d<p(x)/dx)\ x=x . Finite difference methods, discussed in Section Al.3.2, show that a second-order accurate approxi¬ 


mation to the derivative results in 


Fm+F,- 


(24.33) 


which can be reworked to find x +1 in terms of the other factors, or 


(fpj+x ~4>j\ 

= x, + 2 — - - ) (24.34) 

2+1 2 \PfH+Fj) 

Starting with x 1 = 0 and fy = <p s (N —j+ 1 )/N, the values of Xj are built up sequentially as j progresses from 2 to N. 
Because fy +1 < as j increases, Xj becomes more negative. Observe that 4> N+] = 0, which should occur at x N+1 = oo 
(i.e., far into the bulk), but the algorithm fails at that boundary because a finite difference is a poor approximation for an 
infinite step. But then, the calculation need not be run to j = N: it can just as readily truncate when x } passes a threshold 
value x max , a flexibility that can be put to good computational advantage. Two examples are shown. Figure 24.6 (left) 
shows typical silicon-like parameters with a moderate doping level, for which Eq. (24.29) is seen to be a reasonable 
approximation. Open circles ($(x)) refer to a numerical solution using Eq. (24.34) and implemented in Section A3.14; 
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the red line is the approximate solution of Eq. (24.29). The vacuum conditions correspond to K s = 11.9, T = 300 K 
(corresponding to p = 38.68 1/eV), F uac = K S F( 0) = 36.2 MV/m, X D = 78.78 nm, cp s = 0.02 eV, and p = 2.19 X 10 16 cm” 3 . 
Figure 24.6 (right) shows the same parameters but for F vac = K s F(fi) = 3.47 MV/m and <p s = 0.1 eV, for which Eq. (24.29) 
is no longer adequate given that exp(/?$ s ) is no longer sufficiently small. 


24.3.1 Factors affecting emission equations 

Strange, is it not? that of the myriads who 
Before us pass 'd the door of Darkness through, 

Not one returns to tell us of the Road, 

Which to discover we must travel too 

- Omar Khayyam 14 

Accumulation layers drag down the conduction band minimum and thereby changes the barrier height, affecting how 
electrons strive to get over or through the barriers and into the darkness beyond. Even though the doping may be such 
that the electron statistics are non-degenerate at zero field (Maxwell-Boltzmann), as the field increases, electrons can 
accumulate until the statistics become degenerate. Therefore, field and thermal emission regimes will show markedly 
different behavior [312, 318], Nevertheless, restricting attention to the field emission regime forgoes examining some 
of the more profound differences that field emission from semiconductors exhibit [305, 313]) and the assumption of 
degenerate statistics sidesteps, so as to use p F , s, and J F evaluated by Eqs (17.12), (17.33) and (17.59). The parameters 
they use, though, are modified as follows: 

1. The semiconductor image charge potential of Eq. (22.29) shows that Q -► rjQ with 15 rj = (K s - \ )Q/(K S + 1) for every 
term dependent upon the Schottky factor \JdQF, although Q in the coefficient of Eq. (24.27) does not thereby acquire 
the factor r) because it is not associated with a Schottky factor (the K s attending e 0 in Poisson's equation has already 
been accounted for). 

2. The height of the barrier above the Fermi level p now changes with field, so that the work function ® is replaced by 
X — cp, with cp defined in Eq. (24.27). 

3. The factor cp(F) must be determined for each field. 

Observe that item 3 creates a particular problem because the inversion of Eq. (24.27) is not simple. An easier approach 
is to specify cp instead, and calculate the associated F(cp). The algorithm to do so behind the findings here is described in 
Section A3.17, which first finds the maximum and minimum <f> to be considered based on the vacuum field range to be 
considered, and then implements the three prescriptions above in terms of the electron affinity of the semiconductor /, 
the vacuum field F vac (<p), the dielectric constant K s , and the effective mass m n = r e m. Although Eq. (13.25) is convenient 
when the Forbes-Deane approximations to v(y) and t(y) (Eqs (13.23) and (13.24)) are used, the Murphy and Good form 
of Eq. (13.12) shows where the modifications take root: 


K s - 1 
" K s + 1’ 


\6n 2 ti<bt(y) 2 ’ 
fiFicp) 


-vly) 


2k B T y/2m%t(y) 

where = / - cp, then if the J T contribution is negligible compared to J p , 

•WW), T) Am? exp (-Jt.) e(A) 


(24.35) 

(24.36) 

(24.37) 


(24.38) 


14 Omar Khayyam, The Rubaiyat of Omar Khayyam (trans. Edward FitzGerald). Garden City, NY: Garden City Books, 1952, p. 167, Quatrain LXTV. 
15 Observe that Stratton [305] uses the notation v in place of n as used here, the connection being v = y/rj, but this would cause confusion with 
v as used in Eq. (13.25), and so Stratton's convention is not used. 
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where acts as a work function (height of barrier above Fermi level) but means the difference between the electron 
affinity (x) and the changing band bending at the surface For rather generic semiconductor parameters of x = 
4eV, r e = 0.2, K s = 8, and n 0 = —0.1 eV, then the results of the algorithms in Section A3.17 are shown in Figure 24.7 
for and Figure 24.8 for J(F, T) for two temperatures of 300 IC and 600 K. Although the J curves are rather close 
(and possess a slightly visible non-linearity), they are distinct, whereas the traditional copper-like curves shown alongside 
for the same temperatures overlap. As expected, temperature changes have a greater consequence for semiconductors. 

24.3.2 Small barriers 

... like so many unfortunate events in life, just because you don't understand doesn’t mean it isn't so. 

- Lemony Snicket 16 

Something becoming "smaller" is often synonymous with "easier",- but there are times where the surprise must be 
subdued and the consequences dealt with. In the development of Eq. (13.3), the size of the upper limit of the current 


n evaluation of the surface 




Figure 24.8 Evaluation of the field ei 

m n = 0.2 m, a dielectric constant K s = 8, a zero field pi = -0.1 eV, and an electron affinity of x = 4 eV, using the algorithms of Section A3.17. For plotting 
the maximum F was set to 10 eV/nm and the minimum to 1 eV/nm. 


16 Lemony Snicket and Brett Helquist, A Series of Unfortunate Events: The Bad Beginning. New York: HarperCollins Publishers, 1999, p. 162. 
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density integral, designated Cp, was assumed large, allowing it to be taken to infinity without much error, and resulting 
in a simpler equation: the current density became independent of p. Semiconductors are a different case, for example 
a typical metal-like barrier has a height of p + ® = 11.5 eV (copper), whereas a typical semiconductor-like barrier has 
X = 4.05 eV (silicon). No longer taking the limit Cp -*■ oo in Eq. (13.3) will introduce a ^-dependent term to the current 
density. That term can be found by 


where X -* 1 at the end. Performing the differentiation then setting X 


J _£ AC// \ 


1 gives 

J sFN (T) = AF 2 exp (-|) [z (i) - (Cp + 1)<T C "] 
‘2r e m<t> b t(y) 


C= - 


(24.39) 


(24.40) 

(24.41) 


' hF(4>) v 

a form comparable to Eq. (24) of Stratton [305], hence the small s subscript, to treat electrons emitted from the conduc¬ 
tion band of semiconductors, but there are a few differences. First, and like Murphy and Good (Eq. (55) of ref. [110]) 
before him, Stratton uses an analytical integration of a limiting form of the current integral to obtain the temperature 
dependence, giving rise to 

(24.42) 


f s P_1 ln(l + s)ds = — -r ——- 
Jo psm(jrp) 


and resulting in a factor nCk B T/ sin(?r Ck B T) in place of the X(l/«)-based terms. For small argument, both thermal modifi¬ 
cations are about the same, but the singularity in E(1 /ri) as n -*■ 1 is balanced by that in L(w), as discussed in the treatment 
of Eq. (17.65))), a behavior in contrast to nCk B T/ sin (nCk B T) when Ck B T -*■ 1. 


EXAMPLE: Find the temperature at which the factor nCk B T/ sin (nCk B T) faces difficulties. Assume a field of F = 1 
eV/nm, a work function term of = 3 eV, and an effective mass ratio of r e = 0.5. 

SOLUTION: "Difficulties" can be taken to correspond to Ck B T m 1/2, for after that point, the denominator ceases 
to increase and begins to decline. Because t(y) is already close to (but larger than) unity, approximate it by unity. 
Then, expressing all factors in units of [eV], [nm], [fs], and [q], 

„ h F 11605-0.65821-1 

1 = — -= - 1 462.56a 

Ak B \j2r e m<b 4\/2 • 0.5 • 5.6856 • 3 


EXAMPLE: Compare the size of the correction term in Eq. (24.40) for metal-like, then semiconductor-like, values 
of relevant parameters at room temperature, assuming the surface field is approximately 7 eV/nm. 

SOLUTION: Take metal-like to mean p = 7 eV and O = 4.5 eV, for which Cp = 3.3367 and 

(Cp + l)e~ Cl * = 1.7486893 X 10 -9 

For semiconductor-like, from Figure 24.7, <fi « 0.342 eV when F =7 eV/nm, and let / = 4.05 eV, r e = 0.2, p = —0.1 
eV, and K s = 8, for which C(p + </>) = 0.3298 (the electrochemical potential pUjj) = p + tj) from Eq. (24.20) must be 
used) and 

(Cp(4>) + l)e“ c " w = 0.9562 
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24.3.3 Pre-factors of transmission probability D(£) 

Teaching about FN tunnelling does not use FN'sfull quantum-mechanical approach, because the simple-JWKB (Jeffreys-Wentzel-Kramers- 
Brillouin) approximate method is easier to understand. When applied to FN's exact triangular barrier, the simple-JWKB method 
generates an approximate formula without the tunnelling pre-factor. 

-Richard G. Forbes [319] 

A wary reader may be unsettled that the electrochemical potential can be negative as well as positive. A negative p{<t>) 
means the barrier is substantially larger at lower fields, and therefore a possibly stark change in the current-field plots 
should be expected. Stratton and others, such as Baskin etal. [313], consider such a circumstance and find behavior very 
much different than that of metals: the path to demonstrate this is arduous, but, although the results are aesthetically 
interesting, the operational regime for field emission from semiconductors is for circumstances different from where 
the atypical behavior occurs. Here, instead, consider the more relevant modifications to the transmission probability 
coefficient. 

Where the smallness of the electrochemical potential makes a difference is in the oft-neglected coefficient P(E) 
of the transmission probability D{E), called the "tunneling pre-factor" by Forbes [319]: its effects can be substantial, 
especially for drawing inferences from experimental data. Its neglect is perhaps due to the expectations created by metal 
parameters. That is, for the triangular barrier, P{E ta p) was given by Eq. (13.9): for copper parameters, P(p) = 1.9522. But 
metals are not all there is: the pre-factor matters for more than just the magnitude of the overall current density, rather it 
affects the behavior of the current density integrand. As a consequence, the distribution in energy of the emitted electrons 
(the normal distribution, anyway, recall the distinction between normal and total energy distributions in Figure 17.6) is 
also affected. For metals, the integrand was concentrated about p, so P{E) was reasonably evaluated at E = p and P{p) 
could be pulled out of the integral. Such an approximation is reasonably good, where the presence or absence of P(p) 
changes the height but not so much the general shape. For semiconductors, the electrochemical potential p(c/)) = p + d> can 
be small, and so the rapid decline of the integrand evinced in metals due to the behavior of the transmission probability 
D(E) is not so strong. Both cases are shown in Figure 24.9. As a result, with semiconductors the approximation of P(E) 
by its evaluation at p and its extraction from the integrand (and often, the subsequent overlooking it endures) is poor 
because the low-energy behavior of P{E) now affects the integrand. 

The rising importance of P(E) threatens the utility of the otherwise simple Fowler-Nordheim equation, but, curiously, 
there is recourse. Recall from Eq. (18.59) that P(E) behaves, to leading order, as 

P(E(k)) « 4 k/k 0 (24.43) 



Figure 24.9 A comparison of the current density integrands dJ(E) for metal parameters (m n = m, p = 7 eV) (left) and semiconductor-like parameters 
(m n = 0.2m and p + 0 = 0.2 eV) (right). Parameters common to both: T = 300 (blue) and 600 K (red), F = 5 eV/nm. For convenience, 4> = ® b = 4.5 eV 
and K t = 8 for each. 
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for E = h 2 k 2 /2m n , a result for the triangular (no image charge) barrier. But observe that, first, V(x) for semiconductors 
is not as blunted as for metals as in Figure 22.9 because of K s and, second, that what remains is fairly triangular itself. 
This has led to the suggestion that k 0 be replaced in P(E) by a term that goes as the actual barrier height but otherwise 
retaining the image charge form of the Gamow factor 0(E), or 




(24.44) 


where V 0 is the height of the barrier above the conduction band, or % - where p(<j>) = p + <p is the electrochemical 
potential. Such an approximation works quite well [318], giving the current integrand behavior up to a scale factor close 
to unity, as compared to an Airy function transfer matrix solution of D(k), as shown in Figure 24.10 for semiconductor-like 
parameters (based on Figures 1 and 5 of ref. [318].). 

Alas the calculation of J(F) in Figure 24.10 is by the numerical evaluation of an integral rather than the relative 
simplicity of a formula no more complex than the J m in Eq. (13.4): the cause is P(E). The modifications to the current 
density formulation are most transparently revealed by comparing the current density J(F) evaluated with P(E) compared 
to that evaluated with P(E) -> P(p). The latter shall be called J^F) and leads to Eq. (13.12) if P(p) -*■ 1. The ratio of the 
two current densities is therefore 


J(F) 

J„(F) 


-L 


Cn 


(24.45) 


where C= \j2m n <P b t(y i )/TiF, the b subscripts denoting that these quantities are different than their metal analogs by 
Oj, = x ~ and y b = 4riQF/® b . The integral is a special case of the Kummer function (alternately known as the 
confluent hypergeometric function) for all the good that does: its evaluation will, for most who need it, be accomplished 
numerically anyway. Knowing the limits is helpful, particularly when Cp is large (as for metals) or small (as can be the 
case for semiconductors). For small Cp = p, Taylor expand e~ s to find 


m 

J,(F) 


Conversely, for large Cp = p, Taylor expand (1 - (s/p)) 1 / 2 to find 


m „ V_zl _ p 2 ~ 2 c -v 

J^P) p 2 p 


(24.46) 


(24.47) 


where both expansions are only to first order. A comparison of these expansions to a numerical evaluation is shown in 
Figure 24.11. Whether using the exact evaluation or the approximations for the Kummer function, as Cp decreases the 
impact on J(F) can be substantial, thereby boding poorly for the neglect of P(E) in the J(E) relations. 



Figure 24.10 Pote 

parameters jf s 300 K, p = 10 19 cm' 3 , fi = 0.28 eV, K s = 9, m n = 0.06 m, / = 3.6 eV). V(<2) refers to V 0 - y/4riQF, so that V(0) = V„. On the left, VQc) 
shows band bending in bulk, so that 4> is reasonably large at the suface. On the right, "Numerical" refers to the J evaluated using an Airy transfer 
matrix approach to finding J(F). Based on Figures 1 and 5 of ref. [318], 
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Figure 24.11 The ratio of Eq. (24.45) compared to its small and large x approximations (Eqs (24.46) and (24.47), respectively). Although the large x 
expansion appears sufficient throughout, it is a factor of « 5/4 larger than the exact case for small argument. 


24.4 Depletion layers 


Don't it always seem to go 

That you don't know what you've got 

Till it's gone... 

- Joni Mitchell 17 


The opposite of electrons piling up (accumulating) at an interface and thereby causing the bands to bend downward is 
the pushing away (depletion) of electrons and the consequent bending of the bands upward, such that tf> in g{4>) = p + 4> 
becomes negative. Although a common situation in semiconductor devices, where the doping is so low that F 1/2 (/?/t(x)) 
may be approximated by a Maxwell-Boltzmann distribution and Poisson's equation readily solved, consider instead 
an interestingly different situation in which the electrons in the conduction band (and holes in the valence band) are 
negligible compared to the presence of ionized dopants : they, too, result in band bending. Several cases are typical and of 
interest to electron emission: 

• For field emission, the ionization of acceptor atoms causes the shielding of the bulk material from an externally applied 
field, as occurs with boron doping in diamond. 

• At the interface between diamond and a metal, intentional nitrogen dopants become charged and lead to an internal 
field emission barrier [298, 320, 321], 

• A nitrogen layer can keep generated secondary electrons from being absorbed by a metal back contact [322]. 

The creation of a depletion layer that attends the nitrogen doping cases (introduced in Section 16.1) is suggested by 
the sequence shown in Figure 24.12. The scheme presented is a simplified representation, sometimes characterized as 
an abrupt junction with no trapped charge states between a metal and an n-type semiconductor (a classical Schottky 
barrier): the existence of trapped charge and the presence of dielectric phases due to the intermixing of the metal and 
semiconductor makes the behavior of the current flow between metal and semiconductor more complex [213, 323], and 
such complications change the barrier parameter (and introduce new ones), but such complexity can be built upon the 
simple model of the classical Schottky barrier. 

The greater understanding of semiconductor features, and the intention to demonstrate a numerical method of finding 
the band bending profiles associated with more exotic doping profiles, is dealt with by, first, exploring more fully the 
depletion width associated with boron charged impurities, followed by considering nitrogen and boron effects next. 
Such a model will be useful in Chapter 32.1 when considering the transport of generated secondary electrons in a thin 
diamond film. 

Were diamond a perfect insulator, then the potential across a thin layer of it deposited upon a metal surface and subject 
to an electric field would be <p(x) = (F vac /K s )x for x measured from the metal-diamond interface. But because charge in 


7 Joni Mitchell, lyrics 


tig Yellow Taxi, 1970. 
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Figure 24.12 Top, a metal (left) and an w-type semiconductor (right) before contact; Middle, after contact but before charge migration (a discontinuity 
in B is unphysical); Bottom, after contact and charge migration, which creates a depletion layer. 

the form of thermally excited electrons, holes, and charged donors and acceptors exists, then the potential must instead 
be determined by the solution to Poisson's equation in the form [213, 324] 

d 2 x m = ^ («(*) - p(4>) + <(</>) - N d (4>)) (24.48) 

where n and p are the number densities of the electrons in the conduction and valence bands, respectively, and 
and N~ are the number densities of the ionized acceptor and donor atoms, respectively. The rationale for making them 
functions of <j) are because it determines the density of electrons via p(x) for electrons (and analogously for holes) and 
because it is the bending of the bands that determines whether charge can tunnel away from donors or to acceptors 
[298, 320]: a nitrogen atom with an energy level 1.7 eV below the conduction band sees something looking much like 
a tunneling barrier when the diamond layer is under a field, and so the tunneling of its electron (or alternately the 
injection of its hole) will leave the nitrogen atom charged. Moreover, trace amounts of boron are always present even 
in high-quality diamond, so that doping levels of even 10 14 cm -3 boron are considered quite pure (about 1 boron atom 
every 215 nm). The different processes that can be involved are schematically illustrated in Figure 24.13 for diamond on 
a semiconductor. 18 If conditions are favorable to the ionization of a boron impurity (in contrast to the nitrogen ionization 


ls For diamond on a metal, see Figure 6.8 of Novel Cold Cathode Materials [166]. 
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Figure 24.13 Processes in band-to-band transport/internal field emission for a semiconductor/insulator interface (an accumulation region ( a ) occurs if 
the substrate is a semiconductor, although the substrate could be a metal). The processes are: 1, electron injection from the accumulation region; 2, 
tunneling from donor or impurity; 3, hole injection; 4, Zener tunneling. The diamond surface shown is a positive electron affinity (PEA) surface; a 
hydrogen-terminated surface would be a negative electron affinity (NEA) surface. Levels are not to scale. Based on Figures 2 and 3 of Cutler et al. [298]. 


of process (2) in Figure 24.13, boron atoms close to the surface can accept stray electrons and also become charged), then 
a layer of negative charge can build up and screen the applied field from the bulk. The charging of the boron creates a 
different kind of depletion layer. 

The nature of the depletion layer for two kinds of dopants in an insulator provides a rather good model to introduce a 
second numerical method to compliment the one introduced in Section 24.3. Take the thickness of the diamond layer on 
the substrate to be L, and the vacuum field applied to its surface to be F vac , so that in the absence of charged impurities 19 
the field across the layer would be F 0 = F vac /K s , where K s = 5.7 in Table 24.1. If L is sufficiently large, then it will exceed 
the depletion width w = £ 0 F vac lN a q 2 = F vac /\6nQN a (Eq. (16.4)). Even for low dopings, this width is small, for example 
for N a = 10 15 atoms/cm 3 , then w = 55.26 nm for a field of F vac = 1 eV/pm (equivalent to an electric field of 1 MV/m). 
Now consider Poisson's equation applied to d> when two dopants are present: 


d 2 .. . 16 7i Q / (/ 


(24.49) 


where the subscripts refer to boron (b) and nitrogen (n) for the doping densities N, and the charge of the doping is 
associated with the sign of the term. In contrast to the boron, however, let the nitrogen doping levels be rather high, and 


Table 24.1 Parameters used in the numerical solution of Poisson's equation in diamond. 


Symbol 

Definition 

Value 

Unit 

m 

Mass of electron (vacuum) 

0.511 

eV/c 2 

m* 

Mass of electron (diamond) 

0.57m 

eV/c 2 

e g 

Diamond band gap 

5.5 

eV 

p 0 

Reference density 

10 16 

#/cm 3 

n b 

Boron factor (p b /p a ) 

0 or 0.023 

Po 

n n 

Nitrogen factor ( pjp 0 ) 

0 or 10 

Po 

K s 

Static dielectric constant 

5.7 

[-] 

L 

Diamond flake thickness 

8 

|im 

x„ 

Nitrogen layer thickness 

0.3 

|im 


Bias potential </KL) 

H 

eV 


J By virtue of its large band gap, diamond is already relatively free of roaming electrons and holes. 
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the diamond grown so that only the immediate region 0 < x < x n near the back contact has significant doping, where x n 
is the boundary past which no nitrogen is present. Now what does the potential look like? 

Re-expressing the equation in dimensionless terms is a good precursor to preparing the problem to be solved numeri¬ 
cally. Set the back contact at x = 0 and the surface at x = L. Scaling all lengths then by L, let s = x/L and s n = xjL. A good 
energy scale emerges naturally from Poisson's equation. Let cj> = y<P 0 ’ where 4> 0 is to be determined. Further, let N a = n b p 0 
and N d = n n p 0 so that n b corresponds to boron doping and n„ to nitrogen doping. Also, p 0 is a reference density which can 
be rather arbitrarily set: p 0 = 10 16 atoms/cm 3 is an agreeable choice, being a mid-range doping density. Then Poisson's 
equation is, in terms of the dimensionless quantities s and y for s < s n 


(24.50) 


with an analogous equation for s>s n . This serves to identify 4> 0 as 


(24.51) 


A dimensionless replacement for the field is also sought: from d x $(x) = —F uac /K s , then d s y = —R demands R = F mc L/K s (/) 0 . 

Solving Eq. (24.49) is as close to trivial as ordinary differential equations get: as the right-hand side is a constant, 
y(s) must be a quadratic function in s, and so solutions are of the form 


y(s < s n ) = A 2 s 2 + AyS + A 0 (24.52) 

y{s > s„) = B 2 s 2 + ByS + B 0 (24.53) 


where the constants Aj and Bj must be determined. As there are six unknown coefficients, six conditions or equations 
are required to specify them, found by demanding 

1. from Poisson's equation, d\y = ~(n b - n n ) for s <s n 

2. from Poisson's equation, d\y = -n b for s > s n 

3. the potential vanishes at the origin, or y(0) = 0 

4. the field at the surface demands d s y = —R 

5. continuity of d x cp(x) at x n demands d s y(s ~) = d s y(s^) 

6. continuity of 4>(x) at x n demands y(s~) = y(s+) 

where the ± on the sf t indicates infinitesimally larger or smaller than s n . These equations may be solved by tediously 
substituting one equation into the next, or rapidly and elegantly by translating the six equations into one large matrix 
equation, like this 


o 

o 

o 

o 

o 

^0 


n n - n b 

0 0 0 0 0 2 

Ay 


-n b 

1 0 0 0 0 0 

a 2 


0 

0 0 0 0 1 2 

B 0 


-R 

0 1 2s„ 0 -1 -2 s n 

By 


0 

1 S n si -1 -s n -si ; 

B 2 


0 


the inversion and solution of which (perhaps by Gauss-Jordan elimination if one must, perhaps by symbolic mathematical 
packages if one can) results in finding (i) A 0 = 0, (a) A 1 = -R + n b - n n s n , (iii) A 2 = ~(n n - n b )/2, ( iv) B 0 = -n„s%/2, (v) 
B 1 = -R + n b , and lastly, (vi) B 2 = -n b /2, and so 

y(s < s n ) = ^(n n - n b )s 2 + (n b - n„s„ - R)s 

y(.s > s„) = -\n b s 2 + 0 n b - R)s - ±n n s 2 „ (24.55) 

The depletion width w is recovered by setting n n = 0 and evaluating d s y(s = 0) = 0 to find R = n b , which shows 
w = Fvacl^Po n Q n b' but which is equivalent to the definition of w in Eq. (16.4). 
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EXAMPLE: Find the characteristic energy (p g if the diamond layer L is the same as the depletion width w for a 
boron doping density of 10 15 cm -3 = n b p 0 (i.e., n b = 0.1), and when the surface field is F vac = 10 eV/pm (equivalent 
to an electric field of 10 MV/m). Assume all the boron atoms are ionized, and the dielectric constant of diamond to 
be K s = 5.7. Then find the ratio R = F vac L/<p g , again assuming L = w. Why does R not change when the surface field 
changes for these conditions? 

SOLUTION: From the definition of w in Eq. (16.4) inserted into Eq. (24.51), and using N a = n b p 0 = 10 16 cm -3 = 
lON a , then, in [nfeq] units. 


xKflPo \n b ) 


1 / 0.01 
0.0010314V 0.1 


= 9.6954 eV 


Observe that for these parameters, w = 0.5526 pm. Substituting, 


R(L = w)= =n b = 0.1 


The independence here of R on F vac is a consequence of the special case L = w, as it is not in general true. 


Eq. (24.55) represents the rather special case of regions of constant doping and the comparative absence of carriers 
(electrons or holes), so that a polynomial expansion of 4>(x) provides an easy solution. This is not the general circumstance: 
were electrons to be present in the conduction band in any appreciable number, then the difficulties encountered in the 
treatment of space charge, as in Section 10.5, come back in force, and would result in a varying charge density, making 
numerical solutions perhaps a preferred means of finding <p(x). To show correspondence, however, the examples below 
shall apply to the piecewise constant doping density case so as to compare the numerical accuracy against the analytical 
model. This will be done even though the doping may be graded and additional charge present, as in the right-hand side 
of Eq. (24.48). 

The numerical methods of Section A 1.4 are quick and quite adaptable, and therefore possibly preferable to more 
sophisticated methods that require a higher investment of coding, memory, or both [236]. When both the left and 
right boundary conditions 0(0) and <p(L) are known, then the very fast and low-memory algorithm of Section Al.4.2 is 
attractively simple. 20 



x/L 

Figure 24.14 A comparison of the analytic solution of Eq. (24.55) to its numerical evaluation using DopePhi of Section A3.15.2 using the default 
parameters of that program (N = 24, n h = 0.01, n n = 2,R = 0.03, and s n = 0.26). The yellow region is where nitrogen doping occurs; boron doping 
occurs throughout. 


20 Were the field at the rightmost boundary F mc /K s known instead of 4>{V), then one could use the Section Al.4.2 algorithm in conduction 
with a bisection search algorithm. There are occasions where this is a reasonable approach, instead of the more conventional finite difference 
solutions to differential equations [242] that are always available. 
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The algorithm of DopePhi of Section A3.15.2 gives an example of how such a calculation might proceed, the results 
of which are shown in Figure 24.14. By using a dimensionless formulation, the values of L, 4> 0 , K s and F vac need not 
be explicitly given, as the solution only relies on the factors s„, R, n b and n n . The factor s n is somewhat unusual: it is in 
between two of the grid points at s nc and s nc+1 (where nc ** nc is an index term in the code). This is commonly the case: a 
discrete grid results in ambiguity as to where an exact point for an abrupt transition should be situated. Were the doping 
profile smooth and continuous, deciding where s n is placed and how the abrupt transition is numerically represented 
would not be an issue. For all that, even a rather coarse discretization as the one shown provides good correspondence 
to the analytical solution of Eq. (24.55). Consequently, the algorithm can be profitably used in analyzing the impact of 
electron density variation in tunneling structures [325] (it will also be useful when d*u = v appears elsewhere, such as 
heat diffusion in the coupled thermal equations associated with the laser impinging on a photocathode [85]). 


24.5 Modifications due to non-linear potential barriers 


.. .the fundamental goal of any emancipatory movement must be to demystify and democratize the production of scientific knowledge, to 
break down the artificial barriers that separate "scientists” from “the public”. 

- Alan D. Sokal 21 


The misuse of scientific jargon and sloppy pseudo-theories common to postmodern analyses was sharply satirized 
through the publication of a fraudulent article successfully pushed through a peer-review process to publication by A.D. 
Sokal in 1996. Lest pranking scientific journals seem an odd way of driving this point home, consider that Hans Bethe 
and friends G. Beck and W. Riezler did the very same when they were post-docs in getting a fake derivation of the fine 
structure constant published in Naturwissenschaften meant to satirize questionable numerological efforts fashionable at 
the time. 22 Although taking aim at the idea that "truth" is a social convention, Sokal's hoax (as it is called) brushed 
rather closely in places with a concern here: that in the efforts to describe and understand barriers, inordinate care with 
the mathematics and precision with the ideas is required lest the results border on rubbish. 

Finding the two roots of V(x) - E for an image charge barrier plus linear field is straightforward for the evaluation of 
the Gamow factor 0(E) needed to develop a field emission equation or find the Schottky reduced barrier for the other 
emission mechanisms. Inserting a non-linear component makes for more of a challenge: although the root equations 
may even be possible to write down, actually finding the roots takes more care than amateurs generally bring to the 
problem, largely because of the image charge term. Material interfaces provide a good non-linear potential candidate to 
practice upon. When two dissimilar materials are joined, charge migration occurs so that their electrochemical potentials 
are aligned, as in Figure 24.12. The band diagram that results depends on the doping levels, whether the semiconductors 
are n- or p-type, and how big the energy band gaps and electron affinities of each are. The intensity of the treatments 
[111, 213, 312, 315, 323, 326, 327] is indicative of the complexity of the problem, and while many of these works are 
concerned with emission, hole transport [323, 328] and transport between material layers can be treated with the same 
methodology as the linear potential barrier becomes non-linear in a way that will be useful for field emission barriers 
associated with field enhancement in Chapter 30. 

What is meant by "quadratic" is somewhat ambiguous. The periodic Schottky deviations of Section 19.2.3 encountered 
a potential barrier treated by Cutler and Gibbons [263] and subsequently applied to field emission by Cutler and Nagy 
[233]. Those barriers are "quadratic" in the sense that 


(24.56) 


or the potential is quadratic in 1 /x. It is flexible in modeling departures from the standard image charge potential while 
retaining the linear field asymptotically, but complicates analysis near the origin. In contrast, depletion barriers are better 
described by 

V(x) = g + O - Fx - - + yx 2 (24.57) 


21 Alan D. Sokal, Transgressing the Boundaries: Toward a Transformative Hermeneutics of Quantum Gravity, Social Text, 46/47, spring/summer 
1996, p. 217-252. 

22 The episode is recounted in S.S. Schweber, In the Shadow of the Bomb: Bethe, Oppenheimer, and the Moral Responsibility of the Scientist, Princeton 
Series in Physics. Princeton, NJ: Princeton University Press, 2000, p. 87. 
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Apart from the Q/x term, the remaining terms of Eq. (24.57) are quadratic in x, with y having units of [eV/nm 2 ]. 

The unpleasant lack of a closed form solution of the quadratic potential Gamow factor is bracing after the success of 
Fowler and Nordheim's solution for the image charge potential. With an almost Dylan Thomas-esque defiance, 23 there 
have been efforts to smuggle the Fowler-Nordheim equation back into the quadratic barrier [329-331], treating the 
non-linear potential barrier at interfaces by introducing a sheet charge that causes a discontinuous change in the electric 
field and putting a kink in the triangular barrier by causing an abrupt change in slope, as a consequence of charged 
traps. Doing so allows the current density to be recast as a product of Fowler-Nordheim-like terms so that much of the 
triangular barrier intuition does not have to be jettisoned, at the price that the position of the sheet charge and its charge 
per unit area require specification. The model is often used to treat metal-insulator-semiconductor layers for its ease. 

By contrast, a disinclination to rely on parametric values gleaned from data requires coming to terms with the actual 
barrier, and for the Gamow factor that means coming to terms with finding the three roots of V(x) — E= 0 [235]. It is one 
thing to speak of the roots of Eq. (24.57), and another matter to actually calculate them: iterative numerical methods 
can perform poorly if the initial guesses are poor; bisection methods are straightforward and always work once the roots 
are bracketed, but bracketing those roots (which, depending on the size of y, can be closely spaced) takes artistry. When 
many J{F, y) evaluations are needed, fancy artistry does not compete with a fast method using at most a few iterations. 

Some observations simplify the development of an appropriate method. First, in the limit that y -*• 0, two of the roots 
have to coalesce into the standard image charge roots of Eq. (11.26) with the last root approaching oo. Second, a rela¬ 
tionship between the roots and the usual factors V 0 , F, ®, and now y is established by demanding that V(x) - E from 
Eq. (24.57) be matched to 

_ y(x- Xl Xx-x 2 Xx-x 3 ) 


V(x) — E = - 


(24.58) 


where Xj are the roots with jm (1,2,3) such that x x and x 2 converge to x ± when y —> 0 , and x 3 is the largest of the 
roots which lies in a region where V(x) crosses back over the x axis beyond x 2 > x + , and which is therefore a parameter 
specifying V(x)-E rather than a physical root, as in Figure 24.15. Equating these forms, and then equating the coefficients 
of powers of x gives [235] 


F = y(x 1 + x 2 + x 3 ) 

V 0 = y(x l x 2 +x l x 3 +x 2 x 3 ) 
Q s = y(x 1 x 2 x 3 ) 


where V g = n + ® - E. If K s > 1, then Q -*■ Q s = Q/K s . 


(24.59) 

(24.60) 

(24.61) 


Figure 24.15 A comparison of the 
corresponds to y = 0. In practice, in the evaluati 
bias across the semiconductor were set equal to 



a of 9(E), the potential would be truncated at 


a constant thereafter if the 


23 Dylan Thomas's popular poem beginning, "Do not go gentle into that good night, /Old age should bum and rave at close of day; /Rage, rage 
against the dying of the light." (The Poems of Dylan Thomas, New Directions Publishing Corp., New York, 1971, p. 239) was a futile but valiant 
protest of that which no man can overcome. 
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The third and last observation makes possible an iterative approach: for any of the roots x ; - for which V(x } ) - E = 0, 
a grouping of terms shows that it can be related to itself via 


Q s + Fxf 


(24.62) 


Such a form is ripe for using the iteration methods of Section A3.7.2: in fact, those iteration methods converge fast for 
a good first guess. In the case of the smallest root x x , a great first guess is x_ from the standard image charge result of 
Eq. (11.26). Skip over x 2 for a moment, as a trick will be useful for it. That means consider x 3 next: when y is small, x 3 is 
large, and so the x 2 terms on the right hand side will dominate the neighboring terms in the numerator and denominator 
of Eq. (24.62): this is because yx\ = x 3 (Q s /x 1 x 2 ) as * 3 (Q s /x + x_) = Fx 3 , which is large. Therefore, x 3 as F/y is the asymptotic 
approximation. Let that approximation give the starting approximation to x 3 in Eq. (24.62). Now return to x 2 : rather 
than using x + , it is better to use the relation for Q in Eq. (24.61). The first approximations to the three roots are then 
given by 


Q s 

Vo+y /vZ-4Q s F 
F 3 + Q s y 2 
y(F 2 + yV 0 ) 

Q s 

n 1*3 


(24.63) 


(24.64) 

(24.65) 


where the approximation to x 2 is evaluated last. How well does it work? Using the algorithm of Section A3.18 with the 
choices E = fi, ® = 4.0 eV, F = 2 eV/nm, y = 0.22 eV/nm 2 , and K s = 8 (in the algorithm, Q s = Q/K s ), the first six iterations 
are in Table 24.2, and the convergence is rapid indeed, being adequate by n = 4 even though y is rather large, as suggested 
in Figure 24.15. Notice also in Table 24.2 that rapid convergence of x 2 is a consequence of the rapid convergence of x, 
and x 3 , and that x } is closely approximated by x_. 

With a method to obtain the roots, the quadratic Gamow factor 0(E) in Eq. (11.14) can be approximated. Remembering 
that V 0 = n + ® — E, 

'J2m n y f*2 f (x - x,)(x 2 - x)(x 3 -x) 3 1/2 

6(E) = 2 V / - 1 3 - dx (24.66) 

The length scale that matters most is that between the first two zeros, so let x = x, + F sin 2 (5) where L = x 2 —x l . 
The substitution results in [235] 


yj2m n y r< 2 1 L o 


0 


(24.67) 


Table 24.2 Quadratic image barrier roots. Iterative solution forx- as given by 
Eq. (24.62) indexed by n using the algorithm of Section A3.18 with its default 
parameters (V a = 4.0 eV, F = 2 eV/nm, y = 0.22 eV/nm 2 , K s = 8). The values for 
n= 1 are from Eq. (24.63). 


0.0113137 2.42554 7.45359 

0.0113136 2.88460 6.43117 

0.0113136 2.94774 6.15451 

0.0113136 2.94899 6.13078 

0.0113136 2.94899 6.13061 

0.0113136 2.94899 6.13061 
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Although Xy/L is small, (x 3 - x 2 )/L is generally large for smallish y, and so the radical of the numerator in the integrand 
may be expanded as a series in cos 2 cp. If the function R„(p) is defined by 


r*i 

R„(p) = 

Jo 


cos ^ n (p sm (p 


yfp + 


(24.68) 


then 


m Ci) + ( t ) } 


(24.69) 


For small p, the R„(p) functions can be replaced by R 2 (0) =1/3 and f? 4 (0) = 1/5, as can be shown using elementary 
trigonometric integrations. As a result. 


s ) 


where the dependence on E is through the dependence of the x ; - on V 0 = p + ® - E. 

A scaling argument can be obtained. In the limit that x 3 -»■ oo, y -* 0, and Q s —<■ 0, the triangular barrier 9 tri of 
Eq. (13.5) is recovered: small y is synonymous with large x 3 « F/y (as can be shown by substituting x ± in for x 2 and x, 
in Eq. (24.65)), and the neglect of x 2 and x, by comparison to x 3 . Thus 


,. 0 1 (fly' 


(24.71) 


where y s = \JAQ S F. If both Q s and y vanish, then L -> L g = V 0 /F and c(0) = 1, showing that 0/6 tri -> 1. For small y, 
therefore, 6(E) increases as ( L/L 0 ) 3 / 2 , where L 0 = V 0 /F is the triangular barrier width. 








CHAPTER 25 

Contacts, conduction, and current 


You see a number of great objects, and when you look at them there seems to you to be one and the same idea (or nature) in them all... 

- Plato 1 

25.1 Zener breakdown 

In 1934, the development of the energy-band theory of solids prompted Zener to propose interband tunneling, or internal field emission, 
as an explanation for dielectric breakdown. ...he showed that an energy gap could be treated in the manner of a potential barrier. 


- Leo Esaki [254] 


If the band bending at the interface is sufficiently strong, and the band gap sufficiently small, then electrons can tunnel 
directly through the band gap from the valence band, leaving a hole behind, a circumstance shown in Figure 24.13. 
The semiconductor is assumed to be n-type - were it p-type, then the charge carrier would be holes. Such a process of 
tunneling current through the interface is more common with p-n junctions rather than metal-semiconductor interfaces. 
The strong similarity of the tunneling barrier to Fowler-Nordheim field emission means that the tunneling current can 
be almost immediately written down after three modifications: (i) the height of the barrier <t> is now the band gap, (it) 
the mass m is now the effective mass m n , and (Hi) the field is the slope of the band bending at the interface. The energy 
difference is just the band gap energy ® -*■ E . The field at the interface is, from Eq. (24.49), F = I6nQw/K s , where from 
Eq. (16.4), w = K s V d /&nQN d , and where V d is the amount V(x) changes over the depletion width. The tunneling current 
equation for Zener breakdown is then most easily given by modifying Eq. (13.12) to get [152] 



(25.1) 


where F 0 = (32nQN d V d jKf) 1 ! 2 , y\ = 4 QF 0 /K s E g and m n instead of m is used. Lastly, when there is no bias then V d is the 
difference between the left side (metal) and right side (semiconductor) where the "work function" is measured with 
respect to the Fermi level (not the bottom of the conduction band). When a positive or negative bias V b is applied to the 
semiconductor, the location of the conduction band minimum changes, but this is not so for the metal. Hence, under bias, 
V rf V rf + V b in F 0 . If V b = V 0 cos(wf) such that V d > V 0 , then the current J zener would act as a rectifier by only appreciably 
passing current in one direction. 

25.2 Poole-Frenkel transport 

The fact that the illumination of an electronic semiconductor results in an additional increase of the conductivity independent of£ shows 
that the increase of electrical conductivity in intense fields is due to the increase of the number of free electrons, and not of their mobility ...In 
an external field £ this energy is further decreased by a mechanism similar to that of the Schottky effect in the thermoelectronic emission 
from metals. 

- Jakov Ilich Frenkel [332] 

Charged inclusions into the crystal structure of semiconductors like silicon [333], Ai^Ga^AsHe, and Ge:Hg [334], or 
undopted chemical vapor deposition (CVD) diamond [335] can mimic the ionic potentials considered in Section 20 on 

1 Plato, Parmenides (trans. Benjamin Jowett). Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 7, Plato. Chicago: Encyclopaedia 
Britannica, 1952, p. 489. 
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Position 


Figure 25.1 Various mechanisms by which an electron attached to a charged impurity can escape by "hopping" from core to core: thermal-like emission 
over the barrier (Poole-Frenkel), tunneling through the barrier (direct tunneling), and interactions with lattice vibrations until tunneling occurs 
(phonon-assisted tunneling) (based on Figure 1 of ref. [334]). The line is modeled after "32" of Figure 25.2. 


breakdown. When a sufficient number of charged inclusions 2 are present, then electrons can "hop" from one ion core 
potential to another by a variety of means suggested in Figure 25.1: thermally assisted electron hopping (Poole-Frenkel 
emission), direct tunneling, or interactions with lattice vibrations (phonon-assisted tunneling) which temporarily excite 
the electron energy above the rim of the ion potential barrier. 

The type of mechanism responsible for the hopping leaves its dirty little fingerprints on the behavior of the 
current-voltage (TV) relationships that arise. Start by considering emission from the CVD diamond films grown and 
examined by Gohl et al. [335]: if the metal-insulator barrier (perhaps somewhat like that shown in Figure 24.14) 
dictated the behavior of the electron emission from thin (3.2-27.5 pm) films deposited onto a metal, then the current 
as a function of bias voltage should have behaved as In (I/V 2 ) and l/V. That was not observed. Rather, the current was 
better explained by a Poole-Frenkel conduction mechanism. It is common (but not necessarily correct or even a good 
idea) to represent the ion cores as being equi-spaced so that the potential along the z coordinate normal to the interface 
(or surface) is modeled by (in cylindrical coordinates (p,z)) 

Vip, z)=V 0 -Fz- A ^Y J e ~~ (25.2) 

rj ={ P 2 + (j a -z) 2 } 112 (25.3) 

where k is the screening constant familiar from Section 21.7, and a is the average separation between the trap centers 
(e.g., the representative trap density of p t = 7.1 x 10 17 cm -3 = a - 3 suggested in ref. [335] for CVD diamond corresponds 
to a = 5.2 nm). Choosing the z origin in the center of the layer is convenient for some of the numerics below, so that 
there are N traps to either side of the origin. And lastly, about that cylindrical coordinate factor of p: it would be zero if 
the z axis passed directly through the trap core centers, but for graphing purposes it does not hurt to let p be arbitrarily 
small but finite so that singularities are not encountered. 

If the energy levels of the trap centers are such that the barriers that electrons have to cross are high and wide, then 
hopping will have a thermionic-like (over the barrier) characteristic, and current will then be the difference between 
left-hopping and right-hopping electrons. In the absence of a field, these two processes are equal, but when a field is 
present, as in Figure 25.2, then the barriers to the right are slightly lower than those to the left, so that a net current 
is possible. Assuming the occupation of the energy levels conforms to a Maxwell-Boltzmann distribution, and that the 
activation energy (amount of energy the electron must acquire to escape the well when the field F is zero) is E a , then 
the energy difference between the right and left barriers confining the electron to the core can be calculated. Let the 


2 Charged inclusions make the discussion easier by using terminology familiar to those who have endured earlier sections and "ion cores", but 
effects can also occur for neutral inclusions. 
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z [nm] 


Figure 25.2 The ion core potentials for diamond-like parameters suggested by ref. [335] (K s = 5.7, core-to-core separation = 5.2 nm, corresponding to a 
trap density of about 7.1 x 10 18 cm -3 , and screening k = 0.769 nm -1 ) for various representative (albeit low) fields in units of [eV/|im]) calculated 
according to Eq. (25.3). 


location of the barrier maximum be a distance l to the right of the core center (location of the hump in Figure 25.2): the 
presumed simplicity of Eq. (25.3) then indicates that the energy difference is 

A ®=v<j>,-a + t)-V(p,t) = Fa+^h^-^^) (25.4) 

where r 2 = p 2 + (Na + I) 2 and r 2 =p 2 + ((IV + 1 )a -l) 2 . In either the limit is that N becomes large or the field is weak so 
that l —► a/2, then r L m r R , leading to AO -*■ Fa. Although N does not need to be that large (Figure 25.2 does not visually 
change much if N is changed from 600 to 6 for the parameters of the figure) saying it is has other uses: if the number of 
trap centers to the left and right differ, then N L does not need to even equal N R if both are large for the correction term 
to Fa to be small. The point is, for an electron in the center trap, the barrier to the left appears to be E a + AO/2 and the 
barrier to the right appears to be E a - AO/2: the difference between the left and right barrier heights gives rise to a net 
hopping current. Of course, such an aesthetic model presumes too much by pretending the trap centers are uniformly 
spaced, but it allows for a simple understanding and a means of contrasting Poole-Frenkel (or thermal) hopping here 
with tunneling hopping next. 

Borrowing from methods familiar from derivation of the RLD equation in Section 12, the occupation of the trap states 
is taken to be Maxwell-Boltzmann distributed, and the current, such as it is, 3 is most easily approached in cylindrical 
coordinates (k p , k z ), so that 

J vf = N c f 2Trk.dk.e-^ f —dk z D(E z )e~ fiE '’ (25.5) 

(2ny J 0 J 0 m 

where the total energy E = E p + E z = (h 2 /2m)(k 2 + k 2 ), N c is a normalization constant that sweeps up any straggling 
material-dependent parameters (it can, for example, be found by specifying the number density, an integral resembling 
J p f apart from a factor of q and hk z /m), and the integrals in cylindrical coordinates fall into easily separated terms that 
can be integrated parts by virtue of the Maxwell-Boltzmann distribution. The first integral is straightforward: 

f“ 2nk p dk p e-t> E ’ = ^ (25.6) 

When the transmission probability D(E) is a step function, the second integral is almost as easy. But now the current 
traveling to the left must be subtracted from that traveling to the right, and so 

JJ ^ dk z {D R (E z ) - D,(E z )}e-^ = ^ {ffr - e ~^) ( 25 . 7 ) 


3 It is not current in the usual sense: rather, the electrons occupy various energy levels within the traps due to their interactions with the thermal 
agitations of the host lattice. It would therefore be better to speak of a collision frequency and an escape probability for a charge q in a well, but 
if one does so, the concept of flux, and therefore current, is not far behind anyway. 
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where the shorthand <t> ;r = E a ± AO/2 is used. Invoking the definition of the sinh function and combining all terms, 

j = 2 ^E_ N (k K T) 2 e - l,E ° sinh (pFa/2) (25.8) 

" in 3 

Observe that the quantity {J pf /T 2 )e^ E ‘ is a function of (IF and therefore should have the same behavior for different temperatures , 4 
that is, a plot of {J pf /T 2 )e^ E ‘ vs pE a for different temperatures will lie along the same line. 

Finding how such a prediction plays out is pleasing to behold: consider, therefore, the measurements of Gohl et al., 
who examined undoped very thin diamond films (3.2-27.5 pm) grown epitaxially on silicon and measured using a field 
emission scanning microscope (doing so allowed the examination of pm-scale areas). High (354.2 IC) and low (300.1 IC) 
temperature measurements of the current, for which data extracted from Figure 4 of ref. [335] are shown in Figure 25.3 
(intermediate measurements were also presented, but the argument is adequately made with the limiting cases). For 
the x axis, convert the field in the diamond to units of [eV/nm] by multiplication of 0.001 to bring [1/pm] to [1/nm], 
For the y(x) axis, construct y = In (J pf /T 2 ) + pE a , where E a is chosen so as to make the lines overlap as in Figure 25.4, 
(the value found, E a = 0.43 eV, is the same as that found in ref. [335]). The scaled data then overlaps as suggested 



F buik [eV/pm] 


Figure 25.3 Current vs field for a 6.7 pm diamond sample for T = 354.2 K (red drcles) and 300.1 K (blue squares) based on Figure 4 of ref. [335]. 



Figure 25.4 The data of Figure 25.3 recast by subtracting pE a from ln(//T 2 ) and adjusting E a until lines overlap, based on Figure 5 of ref. [335]. A value 
of E a = 0.43 eV is found to give good overlap. Units of the x axis are [1/nm] for F in [eV/nm]. Solid red circles are those points in Figure 25.3 in the 
gray shaded area signaling the onset of tunneling. 


^Compare Eq. (2) of ref. [335] and the discussion surrounding it. 
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by Eq. (25.8). The deviation of the higher points of the 354.2 K data set are attributed to the onset of a tunneling 
contribution, where the potential barrier has become sufficiently thin that electrons can tunnel out of the traps in addition 
to jumping over them. This allowed Gohl et al. to conclude that field emission from thick undoped CVD diamond films 
was mediated by the Poole-Frenkel mechanism of conduction rather than a Fowler-Nordheim-like current through an 
interface barrier (between the substrate and diamond or between the diamond and vacuum), as had been speculated 
prior to their demonstration [335], 


25.3 Tunneling conduction 


After much effort 1 succeeded in “proving " this theorem on the basis of the similarity of triangles... 


- Albert Einstein 5 


The high field data of Figure 25.3 showed evidence of a tunneling contribution when the internal field is sufficiently 
strong, wherein the electrons may start passing through the barrier in addition to hopping over it. Tunneling transport 
has a different signature than over the barrier (or Poole-Frenkel) transport, and therefore monitoring the dependence 
of the emission rate on F shows it. The average distance of 5.2 nm between the trap centers for the diamond studies 
of Gohl et al., was fairly small. Compare that to Ganichev et al. [334], who examined the emission rate associated with 
several deep-defects in semiconductors, for which charged mercury atoms in germanium (or Ge:Elg) at a concentration 
of 10 14 - 10 15 cm -3 is closer to an average separation of 100-200 nm. Under such conditions, the Coulomb potential 
2Qe~ kr /K s r of a trap is similar to the familiar Q/x potential associated with the image charge barrier, in the presence of 
which a field acts to reduce the barrier by a factor y/4QF. Consequently, the ionization probability w(F) of the trap goes 
as the degree to which the barrier is reduced, or for small fields. 


In 


f w(F) \ 

\ 0 ) / 


OcP’l/2 


(25.9) 


as shown in Figure 25.5. Frenkel [332] estimated the coefficient of proportionality using a method similar to that which 
gives the Schottky factor y/4QF for the image charge barrier encountered in Eq. (11.30). Paraphrased, it suggests the 
potential near the ion behaves something like 

V{r,9) = V 0 — Fr cos 6 - exp(-kr) (25.10) 



„ ... -s ro(F)) measured at two 

frequencies of the electric field. The low-field behavior goes as \fF and is indicative of Poole-Frenkel transport. The high-field behavior goes as F 2 and 
re of phonon-assisted tunneling. Based on Figure 4 of ref. [334]. 


5 Einstein was speaking in reference to proving the Pythagorean theorem when he was 12 years old, as quoted in W. Isaacson, Einstein: His Life 
and Universe, Simon & Schuster, New York, 2008, p. 17. The popular canard that Einstein failed math as a young student is hokum: he always 
received top grades in the subject, and before he was 15 he had mastered calculus. 
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where K s is the dielectric constant, k is the screening factor, and V 0 is a constant. The maximum of the barrier radially 
away from the core center is then found by d r V = 0: it is smallest in the forward direction 0 -*■ 0 and, if the screening 
factor is small, then the maximum lies at r 0 = (2Q/K S F) 1 ^ 2 . At that point, therefore, the potential has dropped by an 
amount 



(25.11) 


The number of free electrons due to thermal ionization in the absence of a field goes as exp(-^V 0 /2), and is associated 
with a conductivity of a 0 . If the barrier lowers by Ay, then the conductivity increases to a = a 0 exp(/?Ay/2), and so 
Eq. (25.9) is suggested to really be 



(25.12) 


The relation is about a factor of 2 off, a consequence of the multidimensional nature of the emission of the carriers from 
the traps [334], The power (1/2) that F is raised to, though, is seen in the low field regime of Figure 25.5. 

The high-field behavior of the same figure is another matter: the change of the power from p = 1 /2 to something bigger 
is a signal that over-the-barrier transport is yielding to phonon-assisted tunneling in Figure 25.1. Why that "something 
bigger" power is somewhere between 1 < p < 2 can be understood by an argument similar to Albert Einstein's youthful 
and innovative proof of the Pythagorean theorem when he was entering his teen years. Consider the right triangles 
shown in Figure 25.6. The area of a right triangle is proportional to the square of its hypotenuse (if the hypotenuse 
doubles, then the area increases fourfold, for example), or C = ac 2 , where a is the constant of proportionality. 6 But the 
triangles A and B have the same constant of proportionality, or A = aa 2 and B = ab 2 . Therefore, because A + B = C, it 
follows a 2 + b 2 = c 2 . With a little bit of contemplation, one sees that if each of the legs a and b scale with respect to a 
third factor x, then the area of C scales with x 2 , but a can be called the "height" of C, and b its "length". In short, scaling 
arguments and powers can be related. 

Now return to Figure 25.2 with an eye towards understanding how the Gamow factors associated with them will 
behave: compared to the almost field-free case of the dark blue line (2), the higher field lines look increasingly triangular, 
such as the red line (32). If tunneling is occurring from one well region to the next, then the Gamow factor 6 associated 
with the tunneling is going to mimic that behavior. Scaling, as in Section 11.2, suggests 0 goes as the product of a 
shape factor, a height k 0 (where the integrand tops out), and a length L (the distance between the integration limits of 
the Gamow factor), the latter two of which are field dependent. Therefore, the difference of the Gamow factors 6 0 - 0 
(where 9 0 is the field-free case) will behave approximately as F when the field is weaker and L « a. The question then 
becomes, how does it change when the field is strong enough so that L reduces as F increases? 



Figure 25.6 The triangles A, B, and C are right triangles, with hypotenuses a, b, and c, 
respectively. The area of triangle C is clearly the sum of the areas A + B (being the lower half of a 
rectangle with a diagonal of c). 


6 If 0 is the smallest interior angle of the right triangle, then a = sin(20)/2, as can be shown by elementary means, but the value of a Is not 
important for the argument. The same factor is found for the triangles A and B, because the triangles are similar. 
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Figure 25.7 Behavior of the integrands of the Gamow factor 0(£) and 0„(E). Left, gray (0.00) is U{x)-, color is for V(x ) for increasing fields (in jumps of 40 
eV/pm). Right, the difference \/U(x) - y' V(x) for the color lines of the left. For "40" small increases in F increases the height, but by "80", increases in F 
affect both height and length. 


Using arguments similar to those leading to Eq. (25.4) via Eq. (25.3), let U{r) be the same as V(r) in Eq. (25.3) but with 
F = 0, or 


U(J>,Z) = V 0 -^ £ 

r } = [p 2 + (ja - z) 2 } 1 ' 2 


(25.13) 

(25.14) 


Thus, it will be that 9 0 is evaluated using U(r), and 9 using V(r ): the integrands of 9 0 and 9 are shown on the left of 
Figure 25.7, with the gray line (labeled "0.00") being the line associated with U and the color lines being those associated 
with V at ever higher fields (the labeling being in [eV/pm]). For circumstances in which the transmission probability 
behaves as D(E) a exp {-9(E)), then the increased rate of jumping will go as the ratio of the transmission probabilities 
(recall Eq. (25.9)) 

(25 ' 15) 

where E is understood as the energy of the level in the Coulomb potential region such that the activation energy (distance 
from E to the top of the barrier) is equivalent to E a = V 0 in the present treatment. Assuming the cores line up just so, 
so that p -* 0 makes the potential along z look like Figure 25.2, then the problem has again the look and feel of a 
one-dimensional potential barrier problem, and so the indolent urge to go back to speaking of U(x) and V(x) can be 
indulged. The argument of the exponent becomes 


yfim [*■ 

°°- e = 2 -irL 


dx 


(25.16) 


Rewrite the integral as 


^ + (Vuw - Vn*)) dx = FL J - 


(25.17) 


by virtue of the definitions of U and V, where x = x_+sL and L = x + —x_ is the length of the integration region over 
which the integrand is positive, an approach suggested by the shape factor approach of Section 11.2. If L ~ F v , then 
9 0 - 9 ~ F p+ \ but given the quirky behavior demonstrated in Figure 25.7, a numerical approach teases out the behavior 
more clearly. 

Consider parameters, therefore, that are (nominally) motivated by the experimental data of Figure 25.5 in the case 
of mercury (Hg) in germanium (Ge). Evaluate Eq. (25.17) for the values a = 100 nm and K s = 16.2 (note that such a 
large value of a means that N in Eq. (25.14) can be kept small 7 ). Also let V 0 = 0.3 eV and k = 0.769 1/nm, For low fields, 
something like 0.1 eV/pm, then the difference between 9 0 and 9 (as seen in Figure 25.8) will be slight: La a and the 
height of the integrand will go as V 0 - \j 1 6QF/K S , making A 9 = 9 0 — 9 ~ F 1 / 2 . At higher fields, something like 3 eV/pm, 
both the height and length of the integrand exhibit field dependence, and so a quadratic nature to A 9 becomes evident. 


7 N -» 0 is not bad, although N = 5 is used for the figures. 
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Figure 25.9 The integrand of A0 = 6 0 - 0 as a function of field for the parameters of Figure 25.8 for C = h/2^j2m n to remove the dependence on 
effective mass and ft (the result is simply the integration of \fu - \fv). The right-hand figure shows the low-field behavior vs \fp. The "Hi Fit" is given 
by 9.034 + 29.18F 2 ; the "Low Fit" is given by 3.165F 1 / 2 . 


These expectations are clearly seen in the numerical evaluation shown in Figure 25.9 when the numerical evaluations 
of Eq. (25.17) are compared to 9.034 + 29.1&F 2 for the high-field fit, and 3.165 F l/2 for the low-field fit (the fits being 
found numerically). Observe how rapidly the low fit diverges from the numerical values as the field dependence of L 
comes into play as suggested by the triangle arguments near Figure 25.6. Similarly, the quadratic nature of the high-field 
behavior does not continue unabated: it only holds until F increases to the point that 6 becomes small. 

Naturally, the problem is even more vigorously complicated by the behavior of the energy levels in the wells, and their 
interactions with the lattice vibrations that are always present - ref. [333] gives a flavor of how rapidly the complexity 
escalates when physics is added. But simple models motivated by methods familiar from the general thermal-field models 
of Chapter 17 lead to expectations that are approximately and pleasantly realized. 


25.4 Resonant tunneling in field emission 

...the stupider one is, the closer one is to reality. The stupider one is, the clearer one is. Stupidity is brief and artless, while intelligence 
wriggles and hides itself. 

- Fyodor Dostoyevsky 8 

For metal to semiconductor interfaces, following the thermal hopping of Poole-Frenkel and the tunneling of hopping 
conduction, one last configuration invites examination: resonant tunneling as through a Si0 2 layer atop a metal (or semi¬ 
conductor) or, even more interesting, an Si0 2 -Si-Si0 2 sequence of layers that creates a well [259] as schematically shown 
in Figure 25.10. There are similarities to the resonant-like and hopping-like effects of atoms atop field emitters [258] or 
charged centers in the oxides which cover them [336], Such territory is familiar from Chapter 20 on the Paschen curve, 
or Section 19.2.4 on resonant tunneling diodes (RTDs): insofar as the rightmost configuration of Figure 25.10 resembles 


8 Fyodor Dostoyevsky, The Brothers Karamazov (trans. Constance Garnett), Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 
52, Dostoyevsky. Chicago: Encyclopaedia Britannica, 1952, p. 121. 








Contacts, conduction, and current 337 




Figure 25.11 Measured current versus field through a multilayer cathode (MLC) structure (see Figure 25.10), based on Figure 14a of ref. [259], for 
which the hump was attributed to resonant tunneling effects. 


an RTD, Litovchenko et al. refer it as a multilayer cathode (MLC), for which the J(F) relation shown in Figure 25.11 shows 
an odd bump in an otherwise exponentially increasing current density as the field increases. But there are differences 
between the resonant field emission problem and what has come before: the right-hand boundary is vacuum and subject 
to a constant field, and there are no incident electrons from it. 

Familiar or not, a proper account of the problem [259] is challenged to tease out the influences of the resonant levels 
(suggested by the dashed red lines in Figure 25.10) on the emission current, complicated by the replacement of the 
metal by silicon because of the development of an accumulation region with levels itself. The resonant levels in both the 
accumulation region and the quantum well are related to zeros of the Airy function, the determination of which builds 
character but not necessarily enlightenment: the location of the energy levels matters less than how those energy levels 
shift as a field is applied until enhanced transport is possible. In fact, dogged calculation using the methods of Chapter 19 
and fidelity to the actual potential obscures what should be important here. 

Consider, therefore, a purposefully "foolish" model 9 . In that sense, "stupidity" as apparent foolishness is instructive 
in the sense of simpleminded that exposes the consequences of resonance on field emission current density through 
layered structures or thin films. The methods also apply to the other emission mechanisms if one wants to be shrewd 
about the supply function, but as experimental data from Figure 10 of ref. [259] will be considered, follow its lead. 
Though simple, the model becomes powerful because it is easily analyzed by exploiting much what has come before, 
enabling the behavior of interest to be teased out easily. 


9 (Germaine Greer, Shakespeare, Oxford University Press. Oxford. 1986. p91). The epithet "foolish" is nowadays treated as a synonym of "stupid" 
but (at least in the time of Shakespeare or Erasmus) it was not. Fools spoke truth, especially to power, without being tortured or executed in 
the many innovative ways of the time that the more intelligent were for speaking impoliticly: King Lear's fool was not particularly humorous, 
but he retained the King's favor after speaking truthfully whereas Lear's daughter Cordelia did not. Rather, a fool is a "natural", simple as we 
say, and by extension, still in a state of nature. Our word silly is descended from an Old English word meaning "blessed", surviving still in our 
idea of blissful idiots. 
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Place, therefore, a barrier before the triangular barrier of Fowler and Nordheim such that it has the same height, and 
let the bottom of the well be at the bottom of the conduction band, that is, at the zero of the energy scale. To keep matters 
really simple, ignore effective mass variations (m n = m). Teasing out the resonances is going to be difficult unless some 
artifice is employed. As the resonances are what is under investigation, extract their behavior from D(k) as calculated 
using the transfer matrix approach of Section 19.2 by first extracting the behavior of the triangular barrier by considering 
the ration D(k)/D^(k) (where the denominator is given by Eq. (18.52)), and then remove the effects of the leading barrier 
by multiplying the ratio by e KXb following the methods of Section 19.2.4 (see also ref. [199]), or 

G(E{k)) = exp[K(k)x b ]^^ (25.18) 

Defined thus, G(E) rather effectively identifies the resonances of the quantum well: for parameters (similar to those of 
Litovchenko et al.) of a barrier width of x h = 0.5 nm and a well width of x w = 1 nm, as shown in to the left of Figure 25.12, 
increasing the field does remarkably little to budge the location of the maxima of D(k), which correspond to the resonant 
levels in the well, as shown to the right of Figure 25.12. Therefore, the bumps and blemishes on the I-V curves that 
Litovchenko calculates and measures have to be due to some other effect, and that effect is the lowering of the well in 
response to field, just as one would (or should) expect from Figure 25.10. 

In the simplest of modifications, then, let the well drop by a small amount in proportion to the field: a reasonable 
approximation is to take that amount to be how much the center of the well drops in response to the field if the field 
F/K s across the surface layer is constant, where F is the vacuum field and the bulk material is metal. This drop is 

W = -f-(*» + y) (25.19) 

if the well depth is at 0 for F = 0. For the ad hoc choice of K s = 10, then the modifications to G{k) are shown in the left 
part of Figure 25.13. The energy levels for a square (flat bottom) well depend only on the depth of the well, and so 
the expectation is naturally that the resonant peaks will move in proportion to the field, as they do in the right part of 
Figure 25.13. 

The resonant peaks follow the same Lorentzian behavior of Eq. (19.26) that RTDs do, naturally, so that the Lorentzian 
is modeled near the /th resonant peak as 


= —5 r \y(E - Ej)\ 


x j; yH E - Ej y + e 2 ~ ey 

Because the resonances slightly differ in height, a better correspondence with the simulation is achieved by allowing the 
y to vary according to y- = nj/iaEj), or 

GOD =£ 


yf(E—Ej) 2 +e 




tr of fixed height before 

a Fowler-Nordheim-like triangular barrier. All of the resonances (right) overlap, where G(k) is defined in Eq. (25.18). The parameters of V b = V 0 = 10 
eV, x w = 2x b = 1 nm, and m n = m are used, similar to but not the same as those in ref. [259]. Lines are labeled by the vacuum field F in [eV/nm]. 
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t. Now, the resonances are shifted by an amount that scales with the field. Lines ai 
:nt the analytical model of Eq. (25.21). 


(still) labeled by the vacuum field F in [eV/nm]. Black dots 


where N = 4, { rij} = [1,1,1,2], {J^}[eV] = [0.6949,1.5749,3.0238,5.0042], a = 3.2, and e = 10 -8 , parameters which all 
depend on the specifics of the potential heights and well widths and where the values were found by appropriate means. 10 

The form of the Lorentzian of the resonant level of Eq. (25.20) gives a surprisingly good match to the numerical 
evaluation of G(E) defined by Eq. (25.18), as suggested by the discussion in Section 19.2.4. Hence, the excellent fitting of 
the peaks of Figure 2 5.13 by Lorentzians is not entirely unexpected. The utility of Lorentzians 8 e (x) is due to their behaving 
much like Dirac delta functions <5(x) when e is small (Section A2.7). This allows the current density contributions from 
the resonant peaks to be almost immediately written down as 





D(Ej)(n - Ej) 


(25.22) 


whereas the non-resonant part of G(E) does not vary rapidly and so 


« G 0 e- KX >J FN (F) (25.23) 

an equation anticipated by Eq. (19.20), with G 0 ~ 10 - 100 as implied in Figure 25.13, and k = (2/h)\j2m n V b . The total 
current is then the sum of the resonant and non-resonant parts. 

The simplicity of the model constrains its predictions to being qualitative, but that is enough. The resonant current 
contributes substantially when a resonant level is accessible by the supply of electrons from the metal (or semiconductor 
accumulation layer) but when as a consequence of increasing field the resonant level moves outside the range of energies 
that overlap with the supply function, the current density consequently drops. The total current is then a combination 
of a steadily increasing non-resonant part with bumps occasioned by the contribution of the resonant parts as they move 
into and out of range, an understanding that matches Figure 25.11. 


10 In the present circumstance, "appropriate means" was visually or by hand. It does not have to be done thus, but since only a few Ej were 
required, doing so was simple. Sometimes eyeballing graphs is close enough. 
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Electron density near barriers 


Wiseman John: As thou removest the Book from its cradle, you must recite these words. Clatoo, Verata, Nieto. 
Ash: Clatto Verata Nieto. Okay. 

Wiseman John: Repeat them... 

Ash: l got it. I got it. I know your damn words. All right? 


- The Army of Darkness 1 


Using words without understanding their significance invites trouble. Accepting that quantum mechanics describes 
electrons, but then ignoring their wave nature in treating the accumulation layer (as in Eq. (24.25)), causes the density at 
the barrier to be overestimated, and misses the effects of electron penetration into the barrier. As a result, errors include 
overpredicting the degree of band bending [326] or not quite accounting for the magnitude of the dipole caused by 
electron penetration into the barrier that modifies the work function [337, 338], Admittedly, calling such consequences 
"trouble" is hyperbole, but sloppy talk misses effects. Fortunately, a few simple models capture the effects of density 
variation and dipole contributions to the barrier height quite well. 

26.1 An infinite barrier 

Compared with the classical case the electrons are repelled from the surface when quantum size-effects occur. A correct description of the 
spatial variations of the potential and the electron density in accumulation and inversion layers thus needs self-consistent solutions of 
Schrodinger's and Poisson's equations. 


- Winfried Monch [213], p. 30. 


For atomically flat surfaces, an important concept is the electrical surface, which is “the plane where the field appears to start". In basic 
models the electrical surface coincides with the classical image-plane, but in principle the concepts are different. 


- Richard Forbes [339] 


That density variations that occur near a barrier are caused by the wave nature of the electron should not be a surprise: 
the variations were already evident in the simple models of Section 9.2.3. What is new is that the accumulation region is 
a well (recall Figure 24.13) with energy levels, and the wave function of those levels determines the behavior of density 
near the interface. The simplest model to compliment the step potential model of Section 9.2.3 is therefore to let the 
well region for 0 < x < L be at zero potential, and the walls to be infinitely high so that y/ k ( 0) = y/ k (L) = 0, so as to first 
appreciate the consequences of the contribution of discrete states. 

The well with infinite barriers is one of the most familiar problems of quantum mechanics [43]. Let the sides of the 
well be such that the left side is at x = -L and the right at x = 0. The usual y/ k (x) = e ,kx + r(k)e~ lkx can only vanish at x = 0 
(i.e„ y/(0) = 0) if r(k) = -1 for all k. Similarly, ip k (-L) = 0 only if k is quantized such that kj =jjr/L for integer;', and so 
the eigenstates are 2 



(26.1) 


'Sam Raimi and Ivan Raimi, screenplay to The Army of Darkness, shooting script February 26, 1991. The phrase Ash speaks is a homage to the 
cautionary science fiction classic The Day the Earth Stood Still (1951) made at the dawn of the Cold War era. Although Ash's version is altered 
from the original "IClaatu barada nikto" and a good deal less serious, in both films getting the words right mattered. 

2 The usage of {/] may seem odd, but the intent is not to create confusion with (kj\. It is seen that {/] = (kj\ - (-kj |. 


Introduction to the Physics of Electron Emission, First Edition. Kevin L. Jensen. 
© 2018 John Wiley & Sons Ltd. Published 2018 by John Wiley & Sons Ltd. 
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where N is the normalization constant from the requirement that 

1 = f' \(j\x)\ 2 dx = N 2 Ji sin 2 ) dx = ±LN 2 (26.2) 

or N = yJl/L. If Cj is the weight for the state y/j(x), then density is 

N 

P=Y,C 2 \{j\x)\ 2 (26.3) 

written so because the C ; can be chosen to be real. 

That this reproduces the results of Section 9.2.3 is easy to show by starting from the continuum formulation introduced 
there and continued in Section 11.1. There, for a metal surface at zero temperature, the density is found by summing 
over all momenta up to the maximum of k F , and, for the moment, the wave nature of the electron is ignored, so that 
(recall Eq. (6.24)) 



which is nothing more than Eq. (7.3) for a Fermi distribution. Next, the effects of inserting an infinite barrier at x = 0 result 
in setting the reflection coefficient to be r = -1 (no current is possible), and so y/ k (x) = 2~ 1 * 2 (e lkx - e~ ,kx ) = i\jl sin(kx), 
which satisfies y/ k (0) = 0. Now respecting the wave nature makes the replacement/(k) -*■ f(k)\y/ k (x)\ 2 . That, plus using 
the identity 2sin 2 (kx) = 1 - cos(2kx), results in the density p(x) being given by 

P e = 2^ f * 2 ) (1 “ cos ( 2kx ))dk = p 0 £(2k p x) (26.5) 

where the function f(s) has been introduced and is [188] 

{Ws {l + 3 £2p-35Ei} (26.6, 

Because s = 2k P x is dimensionless, many different density configurations give rise to the same oscillatory behavior near 
the walls, where p/p 0 « s 2 /10 for s« 1. The oscillations near the origin, seen in Figure 26.1 and all too familiar from 
metal surface studies [340, 341], are known as Friedel oscillations [85, 342, 343], and such oscillations also arise in how 
electron density screens charged inclusions [97] and vacancies [78], 

Net charge, or the difference between the electron p e (x) and the background positive charge p, = p 0 , where the variation 
is only in x, corresponds to sheets of charge, and those sheets give rise to fields, as per Eq. (16.3). A foreboding that the 
undulations are going to complicate the determination of a self-consistent potential or change the surface barrier looms: 
right from the start, the origin for the electrons (where x = 0) cannot be where the background positive charge ends and 
vacuum begins if overall charge neutrality is required. That is, the absence of electrons right next to the barrier itself 
would lead to quite a slab of unbalanced positive charge there. Therefore, the edge of the background positive charge 



Figure 26.1 Friedel oscillations in the electron density p near an infinite barrier for x > 0, where p c = *J/3w 2 and Eq. (26.5) is used where s = 2k r x. 
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ends at x = x,- < 0. Where the electric field starts, and where background positive charge ends, need not be at the same 
spot, a simple statement masking a complicated circumstance [339, 344, 345], and affected by the degree of penetration 
of electron wave functions into the barrier. 

A metal surface not exposed to a field will be uncharged, regardless of whether the electron density wiggles and the 
ion density does not. This can be expressed as 



(26.7) 


where the Heaviside step function 0 term is attached to the background positive charge term. Invoking Eq. (26.6), the 
relation is alternately expressed as 



(26.8) 


As is typical, solving the integral on the left-hand side can be done a hard way or an easy way. The hard way is edifying 
for techniques, useful later, that it used, so consider it first and use series expansion methods [188]. The function £(s) is 
equivalent to 



(26.9) 


That is fairly convenient: it entails, for example, that for Sj = -2 k j, then 3 * 



(26.10) 


Such a relation suggests that the infinite integral be replaced by an infinite sum over integrals over a region where s 
varies by only 2 n, or 



(26.11) 


The j = 0 term is primary, being adjacent to the infinite barrier: call it N P . The remainder will be called N R . A quick 
integration by parts on each, followed by Taylor expanding the trigonometric functions, results in 



(26.12) 


where (the form of the integration makes use of £(s) being an even function) 



(26.13) 


and 



(x 6 - 960) ( n 2 - 6) 



(26.14) 


+ ... 


48 x 5 


where the denominator in N R is Taylor expanded and Eqs (A2.8) and(A1.8) are used to show (mind the lower limit!) 


g (2/ + l) -p = (1 - 2~ p ) f (p) 


(26.15) 


The series are summable numerically to find N P + N R = -0.7854, a wholly uninspiring result until one realizes in a flash 
of insight that that number is the same as —ic/A. Clearly something is going on. 


3 If one balks at the tediousness of carrying the - sign about, one can ignore it by considering positive s and observing that |(-s) = |(s) is an 

even function. 
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Figure 26.2 Electron density v. 

begins at x = x v Thin white lines show Sj = -2 tcj. Also included is a grey line showing the hyperbolic tangent approximation of Eq. (26.52). Based on 
Figure 4 of ref. [188]. 

The easy approach has piquancy by comparison. Approach the integral using the calculus of residues [60]. Observe 
first that the integrand in Eq. (26.8) is even, so the limits can be extended to ±oo by including a factor of (1/2). Next, 
observe 

1 r \ 1 ( /" 00 glS( j ^ 

(26.16) 


s and is of order n = 3 (hence the 3 oi 


The pole s = 0 lies on the real axis and is of order n = 3 (hence the 3 on S n ). Therefore, the sum of residues elegantly 
results in ^ 

(26.17) 

Using either method, the location of the onset of the background positive charge necessary to balance the electron 
charge is found to be 

(26.18) 

for a single infinite barrier at x = 0. The constant background positive charge is overlaid on the electron density variation 
in Figure 26.2. 


26.2 Two infinite barriers 

.. .for there is nothing either good or had hut thinking makes it so...I could he hounded in a nutshell and count myself a king of infinite 
space... 

- William Shakespeare 4 

For what is to come, having a discrete model is useful. The discreteness is imposed by the second boundary condition 
that i p k (—L) = 0 (considering the well to the left of the origin continues the convention of having the metal to the left of the 
metal/vacuum interface) regains Eq. (26.1), with C 2 +4 (n 2 - j 2 ) in Eq. (26.3). The next example explores its connection 
with the usual continuum model. 


EXAMPLE: Find the discrete approximation to p 0 using the composite modified trapezoidal rule of Section Al.2.1. 
Compare it to the usual p 0 = k\/3n 2 . 

SOLUTION: Let k -f kj = jn/L for a length scale L. At zero temperature, the Fermi level k F corresponds to 
the maximum wave number, and so let k p = nn jL, which serves to define n. Thus, dk -*■ (n/L)Aj, but A j = 1. 


4 Ref. [37]: Hamlet, n.2.247-253. 
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Putting the pieces together, 

'■ “ (| ~ f) ~ t) = (f” s “ l") 

where the summation uses Eq. (A1.2) (mind the limits) and where the subtraction of the factor n 2 /2 in the second 
line is a consequence of the/ 0 term in Eq. (A1.10). Observe that kp/(3ju 2 ) = nir"/T>L i after the change of terms, so 
the approximation is good as n becomes large. 


The quantization condition imposed by y/ k (x = -L ) = 0 therefore parallels the equi-spaced kj in transitioning to a series 
approximation of the continuum integral. Without further ado, therefore, the quantized level form of p becomes, 5 

j « ^ X (« 2 -f) sin 2 (AJ) (26.19) 

where 2 n (x) = nnx/NL and n = k p L/n. The expectations borne of the continuum example and the appreciation of the 
quantization conditions lead to two interesting consequences. The first is that some vestige of Eq. (26.5) will persist near 
the wall's edge and, second, that some scaling relation for the discrete case will render the behavior of the densities visibly 
similar even as they are different in magnitude. This will be seen even if n is not large, that is, even if there are only a small 
number of energy levels involved. 

Consider an L so small that there are only a few energy levels that can be squeezed into the well before k p is exceeded. 
The density profile for several of them is shown in Figure 26.3 based on the numerical evaluaton of Eq. (26.19): in 
the center region, the rippled nature is increasingly suppressed, even as the oscillations near the well walls seem to be 
confined to an ever smaller region near the walls; to those familiar with the Gibbs phenomenon and Fourier transforms 
[60], this is expected. Analogous to the dependence the density variations on s in Eq. (26.5), the densities for small n 
are similar when the x axis is expressed in terms of k F x tx nx/L, as in Figure 26.4. Thus, n does not have to be very large 
before conditions near the barrier atx = 0 look suspiciously close to the L -* —oo limit. 

Why does this matter? Simply, because insofar as accumulation regions near a barrier can be (very) crudely modeled 
as a well region, the electrochemical potential model surrounding Eq. (24.20) indicates that band bending is not going 
quite the way that Figure 24.4 might suggest, namely, with an exponential shape. Rather, because the density drops to 
zero rather than increasing right up to the barrier, the quantum mechanical nature of the electron causes the potential 
drop to not be as great as if the density were increasing right up to the barrier's edge. To say more requires a triangular 
well model. 



x/L 


Figure 26.3 The density of Eq. (26.19) for several k p = mt/L characterized by low values of n. Observe that larger n flattens the middle but gives rise to 
behavior at the well walls that seems increasingly compressed into a region that becomes smaller as n becomes larger. 


5 The same result is obtained if the incantations of quantum mechanics are properly expressed, but keep in mind that the quick end-run here 
was a direct consequence of the assumption of infinite barriers: were the barriers finite, the kj levels would not be equi-spaced, and the analogy 
would fail. Getting the "words" right (so to speak) does matter, however convenient the analogy otherwise appears. 
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nx/L 

Figure 26.4 Same as Figure 26.3 but only for the largest and smallest n shown there, and for the x axis scaled by n. Observe how rapidly the oscillations 
adopt the behavior of Figure 26.1. 


26.3 A triangular well 


The simple analysis based on square (flat bottom) wells with infinite walls is suggestive but limited because accumulation 
layers do not resemble flat square wells so much as triangular ones. The energy levels are going to be very different. 
Fortunately, though, the methodology is much the same: find Airy wave functions such that yq.(0) still vanishes. The 
zeros of the Airy function then determine the spacing of the energy levels within the triangular well region. The potential 
is defined by 

no¬ 
where there is no harm in emphasizing that V(x) increases as x becomes more negative, because F = h 2 f/2m is taken as 
a positive number. 

But which Airy function, Ai(w) or Bi(iu) or a combination? Consider the argument w. In Section 18.2, w(pc) is obtained 
by setting the barrier height to 0 at x = 0”, thereby setting k 0 -*■ 0 in Eq. (18.20). The argument of the Airy function is 6 

w(x) = -/“ 2/3 (k 2 +/x) (26.21) 


f -Fx (x < 0) 
I oo (x > 0) 


Thus, near the origin, w(x) is negative and the Airy function oscillatory, but as x becomes more negative, w(x) becomes 
positive and increasingly large, for which Bi(iu) exponentially grows as Ai(ie) decays. Therefore, the boundary condition 
that i/q.(x) -* 0 as x -* —oo dictates that Ai[ie(x)] is the solution, but not every Ai(w), just the ones for which w(fi) = w n 
such that Ai(iu„) = 0. 

There are a few ways to find the w n zeros, the most straightforward being to look them up in ref. [103] if one is content 
to know only the first ten of them. The more stimulating approach is to fiddle with Eqs (18.22) and (18.24) to convince 
oneself that a crude approximation is w n « —[3x(4n - 1)/8| 2 / 3 . With effort, a better approximation is 

K = Y ( 4 « - 1 ) 


I-~2 'j 

48A 2 n ) 


(26.22) 


Finally, if accuracy is an obsession, then start with either the crude approximation o 
Newtonian iteration, by which 


Ai \w n ) 


a few rounds of 
(26.23) 


(where, recall, the prime indicates derivative with respect to argument) is sufficient to produce a very accurate approx¬ 
imation to w n . An algorithm to produce the w n is given in Section A3.19, the results of which are shown in Table 26.1. 


6 The present treatment is not a tunneling problem, and so using w(x) instead of (?z(x) is easier as w(x) can change sign, simplifying the discussion. 
Nevertheless, the notational conventions of Section 18.2 are retained, such as the definition of/. 
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Table 26.1 Zeros of the Airy function Ai (w): simple approximation to w n (Eq. (26.22)) and exact w n 
(application of Eq. (26.23)). As n increases, agreement improves. 


n Simple Exact n Simple Exact 


-2.338107347 
-4.087949276 
-5.520559788 
-6.786707878 
-7.944133759 
-9.022650719 
-10.04017448 
-11.00852394 
-11.93601513 
-12.82877636 


-2.339599848 
-4.088062286 
-5.520586014 
-6.786717415 
-7.944138050 
-9.022653580 
-10.04017544 
-11.00852489 
-11.93601608 
-12.82877731 


16 

17 

18 

19 

20 


-13.69148922 
-14.52783012 
-15.34075546 
-16.13268471 
-16.90563393 
-17.66130066 
-18.40113258 
-19.12638092 
-19.83812904 
-20.53733253 


-13.69148922 

-14.52783012 

-15.34075546 

-16.13268471 

-16.90563393 

-17.66130066 

-18.40113258 

-19.12638092 

-19.83812904 

-20.53733253 


Clearly, then, the well wave functions are ys„(x) = N n Ai[w n (x)\, where w n (x) is given by Eq. (26.21) for k -* k n such 
that w n (0) is now the zero of the Airy function, showing that = —f 2 / 3 w n ( 0) (it is positive because w n (0) is negative). 
Finding the normalization N n is found by demanding 


1=1 N*J Ai [w n (x)] 2 dx = N*f~ l/3 J Ai (wfdw 


(26.24) 


where w n is shorthand for w n ( 0). As bad as the integral looks, a bit of cleverness induces it to reveal itself. Perform an 
integration by parts to obtain 7 


Ai (w) 2 dw = w Ai(ic)’ 


ii.-/: 


Ai (ic)Ai' ( w)dw 


(26.25) 


The first term vanishes by definition of a "zero", and Ai(oo) = 0. The second term benefits from the relation w A\(w) = 
Ai”iw) by virtue of Airy's differential equation, and so 


2 J wA\A\'dw = — J ^-^-(Ai') 2 ^ dw = Ai'(w n ) 2 


(26.26) 


where for compactness the argument 
the normalized eigenfunctions are 


left off Ai' and Ai in the integrals, and where Ai'(oo) = 0 as well. Consequently, 


¥„(x)=f l/6 


Ai (tt>„(x)) 
|AiV„(0))| 


(26.27) 


an elegantly simple result. The denouement, though, is how the density behaves. Let 


p„(x) cx S = X (" 2 


A,(w n -f'/'x) 

Ai'(w n ) 


(26.28) 


The behavior of S is shown in Figure 26.5. Why is S scaled by a factor of n 2 ? In three dimensions, density goes as n 3 , as 
in Eq. (26.19). The density for the triangular barrier is more akin to a two-dimensional gas, and therefore it is presumed 
to go as n 2 . By taking the ratio of S with n 2 , the density heights should be comparable, which, numerically, they are. As 
for the horizontal coordinate, the dimensionless measure of position in w n {x) is f l,3 x, and as with the square barrier case 
the expectation is that the S/n 2 lines will be comparable if plotted as a function oif l/3 x/n, an expectation also realized 
numerically. 

If electrons were point particles, the density would monotonically increase right up to the barrier edge, but because of 
the wave nature of the electrons, the density instead drops. This has definite consequences when the barrier is made finite 
in height (the wave functions can then penetrate the barrier as in Figure 9.11), and Poisson's equation is used to relate 
density to electrochemical potential p(x) = p + 4> because of the variation of p with p(x) (Section 24.3 and Eq. (24.21)). 


7 For brevity, if w n appears without an argument, it is taken to be the Airy function zero w n ( 0). 
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Figure 26.5 Lest, behavior of the scaled density (calculated by Eq. (26.28) but with f l/i x scaled by n. Right, behavior of the scaled density with unsealed 
position / 1/3 x. Observe that close to the barrier at x = 0 the density variation visually resembles the square well case and that the quantity f l/3 x is 
dimensionless. 



Figure 26.6 Comparison of theoretical classical and quantum predictions of electron density 
near an interface for silicon, based on Figure 6 of ref. [326], except that here the x axis has 
been shifted to the negative side to match the orientation of prior figures. 


In fact, such a calculation has been performed by Stern [326], who made a much more detailed analysis of wells at 
interfaces using properties of semiconductors that for simplicity are ignored here. Stern's findings for (100) silicon are 
reproduced in Figure 26.6 (based on Figure 6 of ref. [326]; see also Chapter 2 of Monch [213]). Although the physical 
potential well is only approximately triangular for the depletion layer considered by Stern, the qualitative shape is in 
keeping with behavior evinced in the idealized triangular well model of the present analysis. 


26.4 Density and dipole component 


When the barrier is not infinite, then wave function penetration into the barrier region can occur. To find the effect of 
electron density, consider the rectangular barrier as its transmission behavior is well understood: the reflection coefficient 
is known analytically from Eq. (18.9) with the change k -* Ik for transmission when k < k 0 . Thus, for under the barrier. 


r(k < k 0 ) = 


( k 2 + k 2 ) sinh(ieL) 

2 ikic cosh (kL) + (k 2 - k 2 ) sinh(K-L) 


(26.29) 


where k 2 = 2 m(V 0 — E)/h 2 = k 2 — k 2 and k 2 = 2 mE/h 2 . Intuitively, if the wave function penetrates the barrier, then it 
pushes a bit more into the barrier before it is turned back, and so a phase change between the incident and reflected 
waves is introduced. This can be seen by representing r(k) as 


r(k) = -R(k) exp(2i(p(k)) 


Recalling that 


= k 2 -k 2 for k < k 0 , it is straightforward to show that 


R(k) = 


1 + 


' 2 iKk )V 

, k 2 smh(icL) / J 


p . _ 2xkcosh(K-L) _ 2Kk 
{ (v 2 - k 2 ) sinb(vL) “ k 2 0 tanh(rL) 


(26.30) 


(26.31) 

(26.32) 
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where R(k) = \r(k)\ and cp are real functions. Observe that the usual limits obtain: for k 0 -* oo, then R{k) -*• 1 and cp -*• 0; 
for L -* oo, then R(k) ->■ 1 and r -*■ (x + ik)/(tc - ik) (recall Eq. (9.68) with k! -*■ ix). For finite tunneling barriers with a 
width of about a nanometer and barrier of a few electronvolts, then k 0 L is quite large (explicitly, a 1 eV barrier that is 1 nm 
thick entails k 0 L » 26 for m„ = m), so that to a very good approximation cosh(xL) « sinh(xL). As a result, tan(jci) -*■ 1 
and so cp is independent of the barrier width L to a good approximation. To leading order, therefore, 

r{k) * (26.33) 

*(*) = \r(k)\ a 1 (26.34) 

where the form of r(k) is familiar from Eq. (9.68) (a wide barrier for k < k 0 is indistinguishable from a step potential). 
For tall barriers, the onset of the barrier is what matters, not its extent (or shape), as the same conclusion can be drawn 
for barriers of different shape that are also high [188, 338, 346], For the calculation of density, the wave functions with 
smaller k are weighted more, particularly if the weighting factor for the wave functions behaves like the supply function 
f(k), as in Eq. (26.5). Consequently, for the purposes of finding electron density variation near an interface, cp » —kx g 
where x 0 = x/k 2 « \/k 0 to a good approximation. As a result, 

\{n\x)\ 2 a 1 - cos (2k p (x-x 0 )) (26.36) 

It is as though the infinite barrier density result were rigidly moved over a distance x 0 (not exactly, but pretty close) that is more 
or less constant at 1 /k 0 , that is, Eq. (26.5) is simply modified by 

p^ p^[2k p (x-x 0 )] (26.37) 


So far, temperature and field effects on the density have been ignored by considering the zero-temperature limit of 
the supply function f{k) and the infinite barrier approximation. How good these approximations are may be estimated 
by evaluating 8p{x), where 

p(x) = pJ(2k P x) + 8p(x) (26.38) 

and where p 0 t;(2k F x) = p 0 (x) so that when p 0 (x) appears with an argument, it is to be understood as the infinite barrier, 
zero-temperature solution of Eq. (26.5). Clearly, 8p must depend on both temperature and field simultaneously, but it is 
sufficient to treat each effect separately with respect to how 8p(x) modifies p a (x). 

Consider temperature first. The temperature effects treated here are a consequence of the presence of higher momen¬ 
tum components in the supply function (another temperature effect which gives rise to changes in the work function 
<t> on the order of k B T is due to the thermal expansion of the bulk material and are described in ref. [164], but are not 
considered further here). Because of the thermal tail in the supply function, higher momentum hk states are present 
and increase the density near x = 0, the maximum of which is pushed away from the barrier by a length n jk p , as in 
Figure 26.1. The shape of the density near the origin therefore has a thermal component which can be approximated as 
follows. Subtracting the infinite barrier, zero temperature integral of Eq. (26.5) from the temperature-dependent integral 
defining total density, and using the identity 


s-ln(l +e*) = ln(l +e~ s ) (26.39) 

for | = s{k) = fih 2 (kp - k 2 )/2m results in 

8p «-^sin 2 (k y x) x {J " In (l + r*®) dk- In (l + r*«) dkj (26.40) 

where sin 2 (kx) ->• sin 2 {k P x) is evaluated at the maximum of the integrand and pulled out of the integration because it is 
slowly varying by comparison to the In terms. Do a change of integration variables to s and ds/dk = ph 2 k/m -» ph 2 k p /m 
because it, too, is weakly varying by comparison to the In term. What is left is therefore 

Sp ” {wr) / 11,(1 +nds 


(26.41) 
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The integral is half of the familiar Riemann zeta function f(2)/2 = n 2 /\2, and so putting the parts together, 

P(x) _ 


= W + 2 (4) (l - cos(2k p x)) 


(26.42) 


For metals, the size of pp makes the correction term very small, so that the finite temperature density p(x) is not visually 
distinguishable from the zero-temperature result for typical temperatures and metal-like p (unless p is small, T is high, 
or both), yet another reminder that the electron gas in a metal is well-approximated by its zero temperature limit. As far 
as excitement goes, that was meager. 

Consider field next, where the prospects for exciting behavior are better [338, 347]. Instead of a rectangular barrier of 
thickness L and height V 0 , now consider a triangular barrier of height V 0 and field F. If the barrier is sufficiently high, then 
Bi(z) » Ai(z) when z = {k 2 0 - k 2 )// 2/3 = x 2 /f 2/2 (Section 18.2), so that for under the barrier, the analog to Eq. (26.29) for 
triangular barriers is 

-ikBi(z)+f 1/3 Bi'(z) 


r(k < k 0 )« 


(26.43) 


ikBi(z)+f l / 3 Bi’(z) 

Because Bi'(z)/Bi(z) « -/ _1 / 3 Di(l,z) (remember that s = -1 and c = 1 for the triangular barrier conditions under consid¬ 
eration) and 


Di(l ,z)«/ ] 


" 1 - 4z 3/2 ^ 


z J__ 

" 4jc 2 


(26.44) 


^ 4 K 2 {ik+K)-f 


K °’ 4k 2 (ik — k) +/ v ' ' 

so that |r| = 1 for these conditions as well, and as a result the effect of the barrier is to introduce a modified phase factor 
2cp'. Watch closely: if we let 

c = «-jL <*•«> 

then substitution into r gives the hauntingly familiar result for the triangular barrier 

M {26A7) 


m = - 


which is just Eq. (26.33) with k —<■ k', meaning all the prior findings can be carried over by making that replacement. In 
particular, whereas before for the wide and high rectangular barrier (p « he/( k 2 + k 2 ) = ktc/tf so that x 0 « K/k 2 « 1 /k 0 , 
now for the high triangular barrier with field / = 2 mF/h 2 , 


for' _ 4 k 2 (4k 3 -f) 

' k ' 2 + k 2 ~ (4k 3 -f) 2 + 16k 2 K 4 


(26.48) 


The variation of both k 0 x 0 and k 0 x' 0 with k is shown in Figure 26.7. That variation suggests replacing x 0 and x' 0 by a 
constant value, specifically l/k 0 , as a reasonably good (although not great) approximation, primarily for its considerable 
ease of use. 


x° 



k/k F 


Figure 26.7 Variation of k 0 x a (for F = 0 eV/nm) and k 0 x' a (for F = 5 eV/nm) for copper-like (n = 7 
eV, ® = 4.5 eV) and ad hoc {pi = 4> = 1.5 eV) parameters, where x 0 = k/1? 0 and F 0 is given by 
Eq. (26.48). 
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An integration over the force F(x ) from x = -oo to the surface at x = 0 gives the change in potential due to the density 
oscillations of Eq. (26.5). Using Poisson's equation after integration by parts [85, 348, 349], 

AO = - J F(x)dx = j xd x F(x)dx 

= J xd 2 <i>(x)dx = 4jcQ J x{p e (x) - Pi(x))dx (26.49) 

In light of Eqs (26.5) and (26.10), this is an excellent result: using p 0 = k i P /3n 2 and Eq. (26.18), it is the same as 

4a ' =4 “ i ’-{s](/° 3 l( s 7 

Qk P / „ 

= — v -(32-3n 2 ) (26.50) 


EXAMPLE: Find AO for a bulk electron density characteristic of copper. 

SOLUTION: For copper, p = 7 eV, so that k F = yj2mp/Ti = 13.5546 nm' 1 (or p 0 = k 2 /3n 2 = 84.1088 nm' } ). 
Therefore, 2 

AO = 32 ~ (0.359991 eV nm)(13.5546 nm -1 ) = 0.116063 eV 

32p 


Even before the satisfaction of the previous calculation fades, apprehensions arise: temperature must make a difference 
because the higher momentum states are weighted more as p = 1 /k B T decreases in the supply function/(&). Because 
incident waves with higher k penetrate deeper into a barrier of height V 0 = h 2 k 2 0 /2m, they cause x { to shift from its T = 0 
location, and even if this only by a small amount, that will change AO. Moreover, application of an external field induces 
a shift in the electron density and affects where x { (the origin of the background positive charge) is said to be. That shift 
gives rise to a slab of electrons not matched by the background positive charge and which therefore contributes to the 
barrier height. If that shift in the origin of the background positive charge compared to where it is at F or T = 0 is <5x,-, 
then the induced change in barrier height AO is 


•■wa: 


dx” J dx' = —- Q&yf&q) 2 


(26.51) 


Investigating these effects using a simultaneous solution of the wave functions for each k and the corresponding 
phase factors, then integrating to obtain the density variations from deep in the bulk to the surface takes some doing 
[188, 338, 347], A simpler model on which to base the analysis is to be welcomed even if it is less accurate. 

The hyperbolic tangent model (or "tanh model" for short) [173, 342, 350] is already useful in relating the work function 
<J> to the electron density variation via the exchange-correlation potential [340, 351], but it can also be used to explore 
the sensitivity of the electron density to temperature effects. Ignoring the oscillations in p(x) in Figure 26.4, a squinty look 
at the curve just makes out a hyperbolic tangent, giving the approximation its name. In practice, though, the relation is 
easier to represent as 8 


P a (x) = 


Pa 


1 + exp | kk F (x - Xj)\ 


(26.52) 


where 2 is a parameter to be determined. The infinite barrier approximation affords a good place to start, and for it, first 
let X = X 0 and, second, determine X g by demanding that d x p(x) = d x p a (x) atx = % where p(x) is from Eq. (26.5). They are 
tedious but straightforward derivatives, and when the dust settles, 

4— |f C0S ( A > + 24( ^, 4 3) sin(A) (26.53) 


s The connection is because (1/2) [l - tanh(x)] = 1/(1 + e 2x ), as easily shown. 
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where A = 2 k p x t . In light of Eq. (26.18), however, 2 k p x t = —(3n/A), and so 


64a/2 , , 

4 = (—3k 2 + 12>r + 16) = 2.48711 


(26.54) 


How Eq. (26.52) behaves using X 0 from Eq. (26.54) was shown in Figure 26.2 by the gray line, indicating that the 
approximation captures (more or less) the behavior in the region near the barrier. The reason for its consideration can 
now be revealed: it allows for a simple approximation to AO via Eq. (26.49) as 




(26.55) 


where the second form follows by the assumption that no net charge difference exists so that the total integral over p a is 
the same as that over p t . As a result, the integrations become straightforward after switching to s = X(x — x { ) to obtain 



(26.56) 


Those integrals are old friends (Eq. (A2.7)): the first integral is the same as the second but with s -*■ -s, and with p 0 = 
kp/3n 2 , all simplifies to 



(26.57) 


EXAMPLE: Find X for r s a„ = 0.2,0.3,0.5 [nm] as in Table 1 of Appelbaum and Hamann [351], who use the hyper¬ 
bolic tangent model to evaluate the exchange-correlation terms to explore the image charge potential. 
SOLUTION: Observing that k F = (9^/4) 1 / 3 /(r s a 0 ), then k P [1/nm] = 9.5958, 6.3972, and 3.8383. From Appelbaum 
and Hamann's values of (l/2k F ) [nm] = 0.0626, 0.0626, and 0.0610, respectively, it follows that X = 1.6647, 2.4971, 
and 4.2710. 


The manner in which the hyperbolic tangent approximation is used affects the values of X that are thereby useful. 
Why is this useful? It is because Eq. (26.57) captures how AO scales with / and L. When a barrier thins (L decreases) 
and/or a field is stronger If increases), then greater penetration of the barrier occurs, and as a consequence, X decreases 
in magnitude, resulting in an increase in AO. To say more or to quantify the account, the density must be found from 
a thermal distribution of wave functions properly matched at the barrier's edge, and the barrier need not be abrupt (as 
rectangular and triangular barriers are). Likewise, the self-consistently matched image charge potential [340, 341] or its 
more rudimentary and routine form of Eq. (11.25) are relatively abrupt as well. When wave function penetration into 
the barrier is then accounted for, the shifting position of x, in response to the changing density is accounted for, and then 
the small changes to the work function barrier can be evaluated numerically [188, 347], 
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CHAPTER 27 


Many-body effects and image charge 


All many-body effects are contained in the exchange and correlation contributions to ji and in their effect on the barrier potential A<fr. 
In particular, the image-force effect on ® may be regarded as contained in the disappearance of part of the correlation energy when one 
electron is moved away from the metal surface. 


- Norton D. Lang and Walter Kohn [348] 


The electron density and the potential under which electrons move is clearly related. In the bulk, electrons that seem¬ 
ingly move around in a quasi-free manner kept the physics simple, but when those electrons get to the surface, that 
quasi-free narrative must be reworked: electrons interact with each other and with the ion cores around which they 
move, and the interaction of all the electron plus ion components affects the dynamics. 

To describe a group of N electrons (e) interacting with ions (b), the Hamiltonian of their interaction must be specified, 
meaning that the various contributions to the total energy must be accounted for: the kinetic and potential energy of 
the electrons and their interactions, the interaction between the electrons and the ions, and even the energy of the ions 
themselves. Assuming the ions are represented as a smeared-out distribution of positive charge characterized by p + (r), 
these three components can be expressed as 


= KE + + V e _ b + V b 


(27.1) 


where the e and b subscripts denote "electron" and "background", respectively. In the jellium model, where the back¬ 
ground positive charge is uniformly distributed, the last two terms do not figure prominently in the story line. 1 The ones 
that do are 



(27.2) 


(27.3) 


where e is a small convergence factor which eventually is taken to zero (e -» 0) at the end of the calculation, and where 
the zero temperature limit is implicitly assumed so that the occupation probability from the Fermi-Dirac distribution is 
unity for all electrons in the sum - a good approximation that significantly eases all the integrals and summations. A 
proper account of the so-called exchange-correlation potential is involved (see refs [337, 348, 352]; the correlation part 
is a rather thorny problem in many body statistical physics, and relies on the summation of a large class of Feynman 
diagrams [64, 85], The present treatment aims to be more empathetic to those not acquainted with the formalism. The 
best place to start, of course, is with what is known already, namely, the kinetic energy term. 

27.1 Kinetic energy 

Sense sure you have, 

Else could you not have motion ... 


- William Shakespeare 2 


lr The present treatment is purposely simplified from the operator notation from whence it comes and the contributions not important to the 
present narrative have been hidden. See [85] and references therein. 

2 Ref. [37]: Hamlet, Uf.iv.72-73. 
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The kinetic energy of Eq. (27.2) has been glimpsed in Eq. (7.4), and so tackling it is straightforward. The approximation 
of the summation by an integration causes KE to be approximated by 


KE 


_2 r r f^ Ank 2 dk = h k F 
(2 jc) 3 J 0 2m 10 7t 2 m 


(27.4) 


The integration is over all of k space, and so KE is the kinetic energy per unit volume. It is agreeable to represent it that 
way by making use of p 0 = kj/^n 2 , so that 

KE = p 0 e k (27.5) 


where e k refers to the average kinetic energy of the N electrons. Using p = h 2 k 2 /2m and p 0 = kj/hiz 2 , it is seen that 
Eq. (27.4) is the same as 

KE=^pp 0 (27.6) 

and so e k = 3/</5. The other terms will unfold in a similar manner: the integrals treating the potential energy terms will 
be the product of density with an average energy, with the average energy part labeled e with a subscript denoting its 
origin, for examlple e k for kinetic energy. 


27.2 Exchange energy 


If a system in atomic physics contains a number of particles of the same kind, e.g. a number of electrons, the particles are absolutely 
indistinguishable one from another. No observable change is made when two of them are interchanged. This circumstance gives rise to 
some curious phenomena in quantum mechanics having no analogue in the classical theory... 

- Paul A.M. Dirac [40], p. 207 

The interactions between the N electrons contained in Eq. (27.3) is simpler in k space: letting R = r t - fj, then a con¬ 
tinuum version of it is 

f ^j M dR=2n J Re- tS dR J e ikRs ds = ^ ^ (27.7) 

where s = cos 9. Well, almost - it is a bit more complicated than that. Because I? is a distance between two particles, 
the operator associated with V ex (call it V ex ) acts on the ket |r,-, r ; ) to get to the left-hand side of Eq. (27.7) with R = 
f i - fj. Transforming to the momentum representation, terms of the form (A:,, k 2 \ V |k 3 , k A ) will have to be evaluated. A 
quantum mechanical field operator formalism [64, 78, 97, 353] leads to the Feynman diagram approach to evaluating 
the contributions, but the outcome can be appreciated without as much trouble [85]. The problem with the ket vector 
|/q, k 2 ) is that it seems to treat the electrons as distinguishable, and yet they are not. Wave functions for multi-particle 
statistics are constructed differently than simply gluing kets together, as in |k[) |k 2 ) = \ k } ,k 2 ). Specifically, many particle 
wave functions must show the proper symmetry for fermions and bosons in that if any two particle labels in the ket 
(e.g., kj and kj) are interchanged, then the wave function must be either symmetric (for bosons) or antisymmetric (for 
fermions) [65], That is, Py is a permutation operator that swaps k t and kj in an N-body ket such that 

Pi\k x ...k r -kj--- k N ) (±) = ±| V ' V '' V '' *v) (±) (27.8) 

with the + associated with bosons and the - associated with fermions. A rapid method of making the proper fermion 
kets is to make use of so-called Slater determinants, which, for fermions, are defined by (in the case of two particles) 

!£>] = ^ (l * it2> ' Mi>) (27 ' 9) 

and where the IV-particle version has instead N rows and columns, and a coefficient of 1 /\[n. A particle exchange is 
then tantamount to an interchange of columns in the determinant, the consequences of which are that the determinant 
changes sign. P 12 acting on |kjk 2 ) ( i clearly now gives P 12 |AqA: 2 ) < J = -|k,k 2 ) ( \ 
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As a consequence, an exchange term associated with the energy of interaction amongst the electrons arises. It will be 
the e -*■ 0 limit of 

V a = ~ 4Q 7T^ f &1 / dk 2 - -^- f PD Wi)f PD (E 2 ) (27.10) 

( 2 *rJ J \k y ~k 2 \ 2 + e 2 

In the zero-temperature limit, f PD (E) -*■ &(g - E). Introducing the difference and average wave vectors k 
k' = (k y + k 2 )/2, respectively, the integration over k! can be written as 

80(k)S J dk , &(g-E ] )@(g-E 2 ) 

= J + 

since the denominator of Bq. (27.10) is independent of If. This integral takes some getting used to. Were only one 0 factor 
present, then the interpretation is easy: J dr©(r- \x-y\) is the volume of a sphere of radius r that has been displaced 
from the origin by a distance y. But because two 0 functions are present, their joint action denotes where two such 
displaced spheres actually overlap. Hence the problem is one of finding the overlap region of two spheres, as indicated 
in Figure 27.1. Finding the cap section of a sphere (the shape of the piece that results when a bit of a sphere is sliced off, 
as it were) is a routine problem in calculus, and 80(k) is simply twice that. Therefore, 

80(k) = 2zkp©(2k F - k) f ” sin 3 0 d0 (27.12) 

Jo 

where 0 m (k) = axccos(k/2k p ), and the @(2k p - k) factor insures that k/2 does not exceed k P , as insisted upon by the original 
0 functions of Eq. (27.11). The integral gives (2-3 cos 6 m + cos 3 0 m ), or, using the definition of 0 m , 

80{k) = ^ (4 k F + k) (2 k F - k) 2 @(2k F - k) (27.13) 

Observe that the volume of a sphere, 4^k 3 /3, results when k= 0, as it should. The remaining k integration is straight¬ 
forward, made easier by taking the e -*■ 0 limit before performing the integral, or 

y ^ = -4Q7r [ j^-^SOimnedk^-^-k* (27.14) 

(2 icf Jo k 2 +e 2 w 3 * 

Now, as with Eq. (27.5), it is desired to represent V ex as p g e ex and as can easily be shown, 

,27 ' 15 > 


= ky-k 2 and 


(27.11) 


where a 0 is the Bohr radius (Table 2.3). 

A more commonly encountered nomenclature involves defining the quantity 
holds one electron, so that the electron density can be expressed as [64, 78, 354] 


1 4 , , 3 

— = ^{r s a o y 


f-L 

k F a o 


as the radius of said sphere that 

(27.16) 



Figure 27.1 Overlap of two spheres, as in Eq. (27.11), which gives the volume of the overlap region. The green line of thickness dk rotated about the k x 
axis is the volume element (a disk) of the integral over dk in Eq. (27.11), with cos 6 = k/2k F . The radius of the green element is simply k F sin 6. 
Therefore, the volume element is jrfc| sin 3 0 dB. 
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(r s is dimensionless) to obtain 


KB + V ex = R y p 0 
= RyPo 


| 2.2099 _ 0.91633 | p (e 


(27.17) 


where Ry = 2Q/a 0 = 13.606 eV and the remaining appalling mess of factors have been distilled into their numerical 
equivalents in the parentheses. 


27.3 Correlation term 


...a given electron in a dense electron gas acts to polarize its immediate surroundings: it pushes other electrons out of its way until its 
associated screening cloud possesses a charge nearly equal (and opposite) to its own. The quasi particles (electrons plus screening clouds) 
interact via an effective short-range interaction of the order of the interparticle spacing, in contrast to the original long-range Coulomb 
interaction. 

- David Pines [355], p. 11 


The electron is sometimes described as carrying a positive hole around with it.. .A part of the potential due to this type of interaction can 
be seen in the classical image force that acts on an electron as it approaches the surface of a conductor... this interaction, of course, cannot 
be determined by classical methods when the electron is within a few angstroms of the surface. 

- G.A Haas and R.E. Thomas [135], p. 124 


The exchange potential energy is not the last of it: there are other Feynman diagrams associated with Eq. (27.3). A 
small r s expansion to augment Eq. (27.17) is good for a high-density electron gas (see, for example, Feynman [64]), but a 
typical metal like copper has r s = 2.6756 when p = 7 eV, and therefore is not small in the required way. As a result, many 
other Feynman diagrams need summing. In the asymptotic limit when r s becomes large, the electrons crystallize into a 
lattice [353]: Wigner proposed an early form of the so-called correlation energy V cor (perhaps Feynman's tongue-in-cheek 
appellation of "stupidity energy" [64] to designate all the rest that is hard to know is superior) but a best fit to the data 
of Table 1 of Ceperley and Alder [356], reproduced here in Table (27.1) and based on simulation, is given by 




where c 0 = 0.8740, c, = 3.3618, and c 2 = 2.946 correspond to a least squares fit, 3 as shown in Figure 27.2. 


Table 27.1 First two columns: ground state energy of electrons as a function of r 5 , where 
e tot = (KE + V ex + V cor )/(R p a ) given in Table 1 of Ceperley and Alder [356], Second two 
columns: various metals and their associated r s from Table 1 of Kiejna [349], 


r 5 f tQt Metal r s 


1 1.1740 Al 

2 0.0041000 Zn 

5 -0.15120 Pb 

10 -0.10675 Mg 

20 -0.063290 Cu 

50 -0.028840 Au 

100 -0.015321 Ag 

Na 

Rb 

Cs 


2.07 

2.30 

2.30 

2.65 

2.67 

3.01 

3.02 

3.28 

3.99 

4.96 

5.23 

5.63 


3 Compare to the values of c 0 = 0.862849, t, = 3.22016, and c 2 = 3.03546 by Kiejna and Wojdechowski [352], which result in visually very 
similar relations. 
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Figure 27.2 Numerical data from Table 27.1 compared to KE + V a , V m , and their sum, as calculated using Eqs (27.17) and (27.18). 


EXAMPLE: Find r s for copper. 

SOLUTION: For copper, p = 7 eV, so that k p = yptmjt/h = 13.5546 nnr 1 . Therefore, from Eq. (27.16) 

= _T«92_ . 2 . 6756 

s V 4 / k F a 0 (13.555 nmr 1 )(0.052918nm) 

Compare to Table 27.1. 


27.4 Core term 

The assumption of a uniform positive background charge distribution to represent the lattice of positive ions that compli¬ 
ment the smeared distribution of electrons has the virtue of simplicity, but not accuracy. A more faithful approximation 
is to consider the sphere of radius r s to be composed of an ionic core (the nucleus and the electrons of the lower orbits) 
and an electron that is smeared more or less uniformly throughout it [164]; a metal is then a lattice of such spheres. Such 
an approximation is quite good as long as the so-called Wigner-Seitz sphere [357] is surrounded by other such spheres 
making up the bulk of the material (near the surface, the charge distribution within the sphere deforms and a dipole 
contributes that becomes a part of the work function). The configuration will give rise to a self-energy of the spherical 
electron cloud, and an energy associated with the interaction of the electron cloud with the core. Extended further, an 
assembly of non-overlapping atomic potentials centered on each atom, otherwise known as "muffin-tin potentials" (an 
array of depressions in an otherwise flat surface) can, for example, explain features of the field emission total energy 
distribution that the free electron model cannot [358]. The present model, though conceptually similar, is simpler in its 
application and therefore more of an introduction. 

Therefore, for an electron in the sphere of radius r s a 0 add to Eq. (27.17) a self-interaction term e ee and an interaction 
with the ion at the core e d . These energies contribute to Let the electron in the Wigner-Seitz sphere be a uniform 
density of charge, and the core be a bare ion. The electron self-interaction term can be evaluated using familiar methods 
[99]: inside a uniform gas of charge, a test charge at a radius r feels only the net charge inside that radius, as the cumulative 
effect of all charge outside that radius cancels. As a result, a differential shell of electron gas of volume 4 nr 2 dr at a distance 
r from the origin experiences a charge of magnitude qp 0 (r/r s a 0 ) 3 , where r is the (dimensioned) radial distance to the shell 
from the origin. Remembering Eq. (27.16) for p g , the self-interaction term is then 

£ <* = f Aicrl dr= (27.19) 

The interaction with the ion core is different: first, the interaction energy is negative, and second, all the ion charge is at 
the center, so 

e d = - f ’ “ y-p g 4nr 2 dr = ~jRy 


(27.20) 
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Table 27.2 Values of sphere radius r s and ionic radius q used in Figure 27.4 from ref. [86] (r ; = a,/a 0 and 
similarly for r s , where a Q = 0.052918 nm is the Bohr radius and a,- is the physical radius). Li-Cs are labeled 
"alkali" and Cu-Au are labled "other" in the figure. 


Li Na K Rb Cs Cu Ag Au 


1.29 1.83 2.51 2.80 3.16 2.55 2.38 2.59 

3.25 3.93 4.86 5.20 5.63 2.67 3.02 3.01 


Thus, the sum of the energy components (except for e cor to save space, and because it is not affected by including e ee and 
e ei ) is now 4 

£ k + £ ex + £ ee + £ ei = R y 

Reinserting e cor and defining E £ = e k + e ex + £ cor + e ee + e d gives 

4 = 2.2099 _ 2.7163 _ 0.8740 _ 

R y rj r s ^ + 3.3618^ + 2.946 

for which the kinetic repulsion is balanced by the potential attraction. Stability occurs at the minimum, which is at 
r s = 1.588 or r s a 0 = 0.08403 nm. Although this number is comparable to the radius of the monovalent metals as in 
Table 27.2, it is smaller and independent of the metal species, a bad sign given that metal atoms of course have different 
radii. Something has been overlooked, and that something is the ionic core. 5 

A modification nees to be made to the calculation by letting the core be of a finite size specified by r i a 0 (the i subscript is 
for "ion"). Of all the ways to account for the core, assuming it to be impenetrable is easiest in that no part of the electron 
cloud gets past the boundary at q, a useful albeit draconian "effective potential" [78], Now the integrals are considered 
afresh, after it is carefully observed that the density p 0 has changed: the exclusion of the electron from the core region means 
the core volume must be excised from the volume that the electron occupies. Therefore, let \/p' 0 = (4n/3)(r^ - r 3 ). Now 
the integrals are 6 


f 2.2099 0.91633 , 



To leading order in (r,-/r s ), their combination is 



(27.23) 

(27.24) 


(27.25) 


Observe that the second 
becomes 


acts to increase the kinetic energy by tightening up the space that the electron can occupy. E ( 
= e k + e ex + £ mr + e ee + e ei (27.26) 


then /R y is shown in Figures 27.3 and 27.4 for various values of q (the algorithm being given in Section A3.20), where 
the inclusion and exclusion of Eq. (27.25) is governed by C, being equal to 1 or 0, respectively. The points labeled "Data" 
in the later figure are for various metals. 7 It may seem that the total exclusion of electrons from the core is too Draconian, 
so that softening the value of C t may mimic that - although the correspondence is improved, the grounds for doing so 
are ad hoc. a softening of the core potential requires a better model, the one given next being a suitable template. 


4 It bears repeating because it is easy to overlook: r has dimensions of length, but r s = aja 0 and r ; = a i /a 0 below are dimensionless. 

5 Calling the core an ion is not the best use of words: what is meant is the atom inside the Wigner-Seitz cell apart from the last orbiting electron 
under consideration. It is a terminology of convenience to suggest that an atom stripped of its last electron has a certain size, even if it is stuck 
in a lattice of similarly afflicted atoms. 

6 At the risk of irritating the reader (again), remember that r has the dimensions of [nm], but that r t and r s are dimensionless. 

7 Data is from Table 9 (ionic radii, p. 78) and Table 1 (Fermi parameters, p. 150) of ref. [86], 
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Figure 27.3 The variation of Z e /R y as a function of r s for various values of r ( after e te and (Eq. (27.25)) are included in X £ . A minimum as a function 
of r t is evident, and the variation of its location with r t is shown in Figure 27.4. 



Figure 27.4 Value of r s that minimizes X e /R y (Eq. (27.26)) as a function of r v “€ ( = 0" neglects e ee + “C ( - 1" includes it. Other single valency (z = 1) 

metals (e.g., copper, silver, and gold), require additional considerations [78]. Data from ref. [86]. Treating C, as adjustable (C t = 0.4 line) improves 
correspondence, but requires justification. 


Reconsider the core model afresh by adopting the methods of scattering theory [135, 357-359], Given the symmetry 
of the Wigner-Seitz sphere, the wave function inside, u{r ) exp(ik • r), must be matched to outside, exp(ik ■ r ). The Bloch 
functions u(r) (recall Section 23.2) are given by spherical Bessel functions;',(x) and «,(x) with l = 0, where j 0 {x) = sin(x)/x 
and n 0 (x) = - cos(x)/x: they are solutions to Helmholtz's equation and describe radial waves. If the core were of zero 
size (Vj —f 0) then u(r) would only be given by j ; (r) as it alone vanishes for r —<■ 0, but for finite size the n^kr) solutions 
contribute so that 

u(r) =j 0 (kr) + ka n 0 (kr) (27.27) 


where k and a will be determined by boundary conditions. The first boundary condition is that the derivative of the wave 
function be continuous across the sphere boundary, and that in turn demands that at r = r s a 0 , 


j' 0 (kr s a 0 ) + ka n' 0 (kr s a o ) = 0 


f 0 (kr s a o 

n' 0 (kr s a 0 ) 


(27.28) 


For the ground state, the small kr limits of the spherical functions give 


krcos(kr) - sin(fcr) ^ * p r 3 

kr sin (kr) + cos (kr) 3 


(27.29) 
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or, using r = r s a 0 , then 

< 27 - 30 » 

where the introduced length a can be related to phase shift 8 0 in scattering theory [41]. Using the first Born approximation 
[359], the phase shifts are given by 

tan5, = -| jkJ^ j[(kr) 2 V(r)r 2 dr (27.31) 

so that a = k~ l tan S 0 for k -* 0, and so 


The small k limit 
results in 


mc&r 1 ) 




(27.32) 


be taken immediately, and the result is a = a 0 {r 2 - r 2 ), which, when coupled with k 2 in Eq. (27.30), 


h 2 k 2 

2m 



(27.33) 


This is simply the leading order terms from Eq. (27.21). An adjustable C i is then seen to be analogous to reducing the 
ability of the ion core to keeping the electron out, thereby causing the kinetic energy correction term to be smaller. 

Two questions are left. First, what about polyvalent atoms? The theory can be modified by replacing r s —► r s z~ y ^ in 
the derivations [78], where z is the valency number. 8 Second, why does the model work so well for the alkali and not 
for other (valency = 1) metals like copper, silver, and gold (Cs, Ag, Au)? It is because of the (implicit) assumption that 
there is no overlap between the ion cores, 9 whereas for Cu, Ag, and Au, the d shell electrons do overlap on adjacent 
ions; moreover, in polyvalent metals, the electrons occupy more than one Brillouin zone, among other defects of the 
model [135], 


27.5 Exchange-correlation and a barrier model 

We should like to point out that it is possible, formally, to replace the many-electron problem by an exactly equivalent set of self-consistent 
one-electron equations. 

- Walter Kohn and Lu Jeu Sham [360] 

The passage of the electron from metal to vacuum affects the approximation that the potential is modified by an image 
charge term near the surface. It is useful to imagine the opposite problem of an electron getting out by imagining what 
goes on when it tries to get in the surface of a metal [104]: as the electron approaches the surface, the correlation term 
reduces the density of charge about the incoming electron, essentially creating a correlation hole that, by the time the 
electron enters the metal, has a charge about equal and opposite to that of the electron. Consequently, the plummeting 
of the image charge potential -Q/x to -oo is sidestepped. The indistinguishability of the incoming electron to those in 
the metal also gives rise to an exchange contribution that, when coupled with the correlation interaction, matches up 
with the image charge potential further than a few angstroms from the surface of the metal. 

To find the variation of the effective potential, the variation in electron density serves to specify it for a single electron. 
That this is so, as shown by Kohn and Sham [360], is because the exact ground state energy is a functional of density 
and can be written as 

E g = KE[p] + J V(r)p(r)df+2Q J j P ^^ dfd? + EJp\ (27.34) 

If the exchange-correlation potential V xc is defined as 

V XC (P) = j-E xc = j- {pejpj) (27.35) 


8 The key phrase is "in the derivations" because making replacements at the end misses the subtle changes. 

’Explicitly, the assumptions are that the electrons can be treated as plane waves over most of the Wigner-Seitz sphere, and that the ion cores 
do not have appreciable overlap for adjacent atoms. 


www.ebook3000.com 




Many-body effects and image charge 361 


then the density p which satisfies these equations is that given by the one-particle Schrodinger's equation 

{-^v 2 + V(f) + y«(r)| yrffi = e Wk (f) (27.36) 

where V(r) contains the familiar components of barriers and electrostatic contributions from Poisson's equation. Solving 
these equations is the artistry of density functional theory and doing so is very effective in modeling the behavior of surface 
barriers [361]. 

V xc accounts for (most of) the changes in passing from the high electron density of a metal to the emptiness of a 
vacuum (r s -> oo). Seeing how that unfolds is edifying, with a good example being sodium (Na). Sodium is unique in 
this regard, as r s (Na) is close to the value which which minimizes the energy in Figure 27.2 or right near the cross-over 
of Figure 27.4 where the ion core effects are minimized (thereby enabling their neglect) and the potential energy can 
therefore be treated without the e ee and e d corrections [362], It is expected that 


= -/*-($- AO) 


(27.37) 


where AO is an electrostatic contribution from Eq. (26.50). The zero of energy is chosen so that V xc (r s -* 0) = 0. First 
switch from density p to r s by 

d_ = / ^V 1 _d_ 

dp \ dr s ) dr s 


3 4 a 
r 0 K — 


(27.38) 


and then apply it to V xc (where momentarily the s subscript is suspended for simplicity) to get 
/ 18\ 1/3 1 _ 1 U5J r + 2.94I6-0~ + 2.2095 
(r+3.3618Vr+ 2.9460) 2 


EXAMPLE: Estimate O for sodium (Na) assuming r s = 3.93 (corresponding to p = 3.2446 eV). Compare to the 
accepted value of O(Na) = 2.29 eV. 

SOLUTION: Using Eqs (27.39) and (26.50) for V xc and AO, respectively, 

O = V xc - p + AO 

= (5.2650 - 3.2446 + 0.0790) eV = 2.0995 eV 

which good to within 10%. 


That the exchange-correlation contribution accounts for the image charge behavior can be seen by a simple calcula¬ 
tion for sodium. Using the algorithm of Section A3.21, the results of which are shown in Figure 27.5, already shows 



Figure 27.5 Simple exchange-correlation potential for sodium parameters compared to the image charge approximation using the algorithm and output 
of Section A3.21. 
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Figure 27.6 Sum of the exchange-correlation potential and the potential from Poisson's equation. The origin is set at the edge of the background 
positive charge and is determined by the insistence on global charge neutrality. Based on Figure 2 of ref. [347] for molybdenum parameters. 



fairly good agreement between the exchange-correlation potential and the image charge approximation for a field of 
F =2 eV/nm. Numerical evaluations of the exchange-correlation potential coupled with Poisson's equation, and the 
determination of the location of x, (the edge of the background positive charge due to the positive cores), followed by 
summing the various components, yields a representation much like Figure 27.6 [347, 363]), showing the contribution 
of the various components. 

Such acrobatics drives home rather forcefully that the work function is not a fixed quantity: it depends on how the 
electron density varies near the surface, on the nature of coatings like cesium that coat the surface, of which the crystal 
face is present, and even on features (such as surface corrugation [85, 135, 364, 365]) that have not as yet been treated. 
Therefore, look askance at the idea that the work function is an unchanging input factor when approaching the emission 
equations. That seems to cast doubt on how much confidence to put into those equations, but that doubt is premature 
and the next section considers an approach. 
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CHAPTER 28 

An analytic image charge potential 


Garnet: Don't answer the question they 're asking. If a dishonest man has formed the question, there will be no honest answer. Answer 
the question beneath the question. The equivalent question. Answer the question really asked. And answer it with your life. 

- Bill Cain 1 

Questions of utility broached at the end of both Sections 26.4 and 27.5 remain regarding how to account for the 
variation of V{x) near the surface. The question "What does the emission barrier look like?" is not easily answerable: the 
image charge approximation so widely used is bereft of effects associated with wave function penetration of the barrier 
and thermal expansion of the lattice, and so is silent about changes to the work function that are a consequence. The 
evaluation of those effects is numerically achievable but not practically easy, and are increasingly complicated the more 
the potential is faithful to the variations in electron density and how it changes in response to field and temperature, not 
to mention coatings, crystal face, and non-planar structure. To model experimental data becomes difficult. 

In practice, though, what the barrier looks like is not really the question being asked. Given the utility of the canonical 
equations in modeling experimental data, the question behind the question is, can those equations be retained with, at 
worst, some "effective" parameters used within an analytic image charge barrier? That is an unambiguous question, and 
one worth answering. The answer is "Yes." 


28.1 Work function and temperature 

Einstein: You say that as if it were nothing. 

Mobius: How else should I say it? 

- Friedrich Durrenmatt 2 

The temperature variation of the work function <t> is empirically known to vary with temperature, although the 
variation is not large. Over a good temperature range, the variation is more or less linear, or 

®(D = O 0 + &T (28.1) 

where a on the order of k B [ 135, 366] (the tilde on it is to help prevent its being confused with the fine structure constant). 
That behavior is manifested in numerical simulations [347] in which the exchange-correlation potential and Poisson's 
equation for the proper treatment of the dipole term are self-consistently solved, as in Figure 28.1. Clearly over the range 
typical of thermionic dispenser cathodes (800-1200 K), the linear approximation is reasonable to account for changes 
on the dipole term wrought by increased penetration of the barrier by higher energy electrons. This is not the entire 
story, however. Thermal expansion of the bulk material, different surfaces associated with different crystal faces [364], 
and other physical effects bring in their own contributions [164], so that a can even be negative, as shown for some 
materials in Table 28.1. For example, for copper, the measured value of a/k B was (-5.69, -5.60, -4.74) [100], (-4.48) 
[110], (-3.45) [221], and (-2.76, -3.19) [111], where the crystal face is in square brackets, and each number refers to a 
different measurement [367], the data for which are shown in Figure 28.2. 


28.2 Work function and field 

No one can be a great thinker who does not recognize that as a thinker it is his first duty to follow his intellect to whatever conclusions it 
may lead. 

- John Stuart Mill 3 


1 Bill Cain, Equivocation Dramatists Play Service, 2014, Act n, Scene 3. 

2 Friedrich Durrenmatt, The Physicists (trans. James Kirkup). New York: Grove Press, 1964, Act II, p. 74. 
3 John Stuart Mill, On Liberty. New York: Liberal Arts Press, 1956, p. 41. 
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Figure 28.1 Work function corresponding to the exchange-correlation potential and dipole contribution for cesium-like parameters. The applied field is 
set at 40 MV/m, based on Figure 5 of ref. [347], The slope of the red line corresponds to a = 0.1288 k B . The dashed grey line to guide the eye 
corresponds to <6(T)[eV] = 2.056 - 0.051/^7 + 1.6 (k B T) 2 . 


Table 28.1 Work function and temperature: the value of a in Eq. (28.1) for various metals. 
Adapted from ref. [135] for representative cases except for Cu [367], Values of a can vary 
depending on crystal face and method of measurement. For a number of elements, the quoted 
value is representative, not unique. 


Name 

Atomic number 

Symbol 

<b 0 

a/k B 

Barium 

4 

Ba 

2.3 

4.31 

Carbon 

6 

C 

4.39 

1.46 

Silicon 

14 

Si 

4.02 

2.24 

Chromium 

24 

Cr 

4.58 

0.52 

Manganese 

25 

Mn 

3.83 

0.95 

Iron 

26 

Fe 

4.31 

0.52 

Cobalt 

27 

Co 

4.4 

0.78 

Nickel 

28 

Ni 

4.89 

-1.46 

Copper 

29 

Cu 

4.4 

-4.74 

Strontium 

38 

Sr 

2.3 

0.43 

Yittrium 

39 

Y 

2.95 

0.17 

Niobium 

41 

Nb 

3.95 

0.26 

Molybdenum 

42 

Mo 

4.55 

-0.34 

Silver 

47 

Ag 

4.31 

0.09 

Hafnium 

72 

Hf 

3.6 

1.21 

Tantalum 

73 

Ta 

4.33 

0.22 

Tungsten 

74 

W 

4.5 

0.13 

Rhenium 

75 

Re 

4.93 

0.34 

Iridium 

77 

Ir 

5.4 

-0.26 

Platinum 

78 

Pt 

5.03 

1.72 

Gold 

79 

Au 

4.25 

0.13 

Thorium 

90 

Th 

3.38 

0.39 

Uranium 

92 

U 

3 

2.33 


The data points of Figure 28.1 were evaluated using an assumed field of 40 MV/m. When the data points are evaluated 
using a higher field of 100 MV/m, the fitting lines as a whole are shifted to a different <t> 0 . In other words, the field 
F makes a difference in the fitted value of the zero-temperature work function <t> 0 . Alternately, holding temperature 
constant for evaluations analogous to that of Figure 28.1 results in Figure 28.3, using the value x 0 = \/k 0 = h/\flmV 0 
with V 0 corresponding to the maximum of V(x) (for the image charge potential, it would be p + </>, but in the simulation 
V(x) differs from V img (x)). This dependence is not entirely unexpected: recall from Section 26.1 that changes in barrier 
height correspond to a rigid translation of the density by an amount corresponding to x 0 as in Eq. (26.37), to a good 
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Single Crystal Copper 


Crystal Face 

Figure 28.2 Variation of a/k B for crystal faces of single crystal copper (adapted from Table 2 in ref. [367]). 



approximation. As the image charge potential and the onset of the external field are with respect to the surface associated 
with the positive background charge in the jellium approximation, both shall be affected, and so there are good reasons 
to expect Q/x -*■ Q/(x + x 0 ), an expectation that is put on a good foundation from a more complex analysis of the image 
charge potential using Thomas-Fermi theory [368]. But Figure 28.3 is something different: the present result suggests 
that O(F) -¥ 0(0) + Fx 0 . 

Grouping all the field and temperature effects (the term in Eq. (26.51) is included implicitly) together in one equation, 
a reasonable expectation is that an analytical potential barrier might look like [85, 347] 

V(x)~ ii(T) + ® 0 + aT-F(x-x g )- (28.2) 

but if an effective work function of the form 

G> eff = O 0 + 2Fx 0 + aT (28.3) 

is proposed, then 

+ + (28.4) 

This is a fairly convenient outcome: it means that the canonical equations can still he used with just a minor change in 
the parameter associated with the work function, because the form of the image charge potential is retained through 
a redefinition of position via x-* x + x 0 , which affects neither the barrier height for a given work function <t> e ^ nor the 
calculation of the Gamow factor for the tunneling probability. That is, although J{F, T) will change, the means to calculate 
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Figure 28.4 Comparison of the numerically evaluated V(x) including exchange-correlation and dipole effects for a field of 4 eV/nm, compared to the 
equivalent analytical image charge potential for molybdenum parameters. Based on Figure 4 of ref. [347]. 


it using, for example, the general thermal field equation, will proceed no differently than before. However, x 0 must still be 
determined, although a reasonable expectation is that it is close to the value given by the analytic image charge potential. 
In fact, a comparison of Eq. (28.4) to the numerically found potential profile is shown in Figure 28.4 and the agreement 
is fairly good in terms of barrier height and shape for energies near to the Fermi level or the barrier maximum. Observe, 
though, that the analytic image charge barrier is more unfavorable to emission than the standard image charge barrier. 


28.3 Changes to current density 






-Horace 4 


The changes to the estimation of current density using the canonical equations can be traced to the behavior of 
of Eq. (28.3). The impact is relatively straightforward to ascertain. Let <t> 0 be the fixed work function and <t> = <t>^ be the 
effective work function modified by temperature or field effects. Consider the effect of each one applied separately to the 
limiting cases of thermal emission (via the Richardson-Laue-Dushman (RLD) equation of Eq. (12.3)) and field emission 
(via the Fowler-Nordheim (FN) equation of Eq. (13.25)). 

• Changes to RLD current density The dependence of the Richardson-Laue-Dushman on work function is simple, 
and so a comparison of <t>) to J SLD (J, O 0 ) is most easily considered by examining their ratio, or 


JjaniT^o + aT) 


= exp 


(28.5) 


and therefore a constant change in the current density is obtained, for example for tungsten the current is 88% of 
what it would be if the work function had no temperature variation. 5 Because the change is constant, the effect is 
subsumed in the value of the work function. 

• Changes to FN current density For field emission, the work function is more pervasive throughout the equation, 
appearing in the coefficient, in the factor v(O), and in the exponent in the factors of Eq. (13.25). Additionally, x 0 is 
dependent on the height of the barrier: to leading order it may be evaluated from the simple image charge barrier 
by x 0 « ti/yjlmtu + 4>) with r/> = <T>, - \JaQF. Even so, the evaluation of J m (F; ®) is nuanced, as in the algorithms of 
Section A3.22 used to generate Figure 28.5. The primary impact on ln( J/F 2 ) vs (1 /F) will be a shift in the intercept 
plus a small change to the slope. 

The consequences of a temperature- and field-dependent effective therefore are masked in the traditional repre¬ 
sentations of thermal and field current density. Where the changes become more consequential is near the thermal-field 


4 "I am not what I used to be", Horace, Horatus Flaccus, Carmina: Liber TV, http://www.thelatinIibrary.com/horace/carm4.shtml (The Fourth Book 
of the Odes of Horace: Ode I. To Venus). 

5 The temperature change of uCH for tungsten is too insignificant to treat for its effect on J(T). 
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Figure 28.5 Consequences of a field-dependent work function ®(F) = <£ o + 2 Fx 0 (F) with x 0 = h/(2m(ii +1p) 1 ^ 2 and the approximation 0 = d> 0 - QF, 
as calculated using the algorithm of Section A3.22, for copper-like parameters. 


transition as a consequence of the changes wrought on the 6(E) factor in the Kemble approximation. Away from that 
region, the analytical image charge potential allows for a more or less standard usage of the canonical equations with at 
worst a willingness to be flexible with what <t> is, with being a field and temperature dependent quantity. With 
in hand, though, use may be made of the usual emission formulations. 




PART V 

Application physics 


We shall not cease from exploration 
And the end of all our exploring 
Will he to arrive where we started 
And know the place for the first time. 

- T.S. Eliot 1 


X T.S. Eliot, Little Gidding, No. 4 of Four Quartets. Orlando, FL: Harcourt Brace, 1943, Poem V. 
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CHAPTER 29 

Dispenser cathodes 


The field of thermionic electron emission is often considered a “mature" technology in which only incremental progress, if any, can he 
made. However, in recent years with the advent of modem surface analytical capabilities that enable one to better understand the physics 
and chemistry of emitter surfaces significant improvements have been made in thermionic electron sources... 

- Richard E. Thomas, John W. Gibson, George A. Haas, and Richard H. Abrams Jr. [369] 

The dispenser cathode developed by Phillips Laboratory [366], engineered to continually dispense low work function 
coatings such as barium 1 throughout its uncommonly long life, was a groundbreaking achievement. No wonder, then, 
that the physical processes behind its operation received great attention [228, 366, 369-380]. Although developed as a 
thermionic source, its potential use as a photocathode [381-386] came to be appreciated based on its ability to rejuvenate 
the low work function coating on which its performance depends. 

The surface of a dispenser cathode is shown in Figure 29.1, revealing an assortment of tungsten grains pressed together 
with pores visible between the grains. Impregnants containing barium are such that when heated, barium diffuses up 
and out through the pores in a process generally attributed to Knudsen flow [371]. The grain sizes vary, but if the area 
of a representative number of them are converted to equivalent circular diameters via A = {n/4)D 2 , the distribution of 
diameters can be used to infer a mean size, as in Figure 29.2. Similarly, the separation between the nearest-neighbor 
pores is well represented by a log-normal distribution (similar to a normal or Gaussian distribution with x ln(x) in the 
argument [387]) given by [382, 388] 

L(x; ti,a 2 )= —exp { - -^ [(In (x/j,)) 2 ] } (29.1) 

\Jljcox 1 2a ’ 

The analysis of a typical sintered random pore dispenser cathode gives a grain size of approximately 5-6 pm on average, 
and a pore-to-pore separation of 2.5 pm. The surface of the dispenser cathode, though machined flat, nevertheless has 
microscale surface roughness to it. Therefore, the apparent work function of the cathode depends on how coatings come 
out of the pores, how those coatings spread across the surface, how they evaporate away, and how they depend on the 
way in which the cathode evolves with usage. 


29.1 Miram curves and the Longo equation 

The data are presented as “Miram plots" which are normalized plots of emission versus temperature at various current densities... Such 
a plot is basically a “fingerprint" for any given cathode; it allows meaningful comparisons between cathodes as well as of the change of 
emission of a given cathode with time. 

- Morris Feinleib and Michael C. Green [389] 

For a dispenser cathode, higher current density operation comes at the cost of a shorter lifetime, so that knowledge of 
where the optimal operation point is for dispenser cathodes is exceedingly important, particularly for applications such 
as space-based traveling-wave tubes (TWTs) [378], where the prospects of replacing a failed cathode are non-existent. 
As in Figure 29.3, changing the current density from 0.75 A/cm 2 to 3 A/cm 2 (a factor of 4x) reduces the lifetime of an 
M-type cathode from 30 years to 1 year: as a rule of thumb, each 40 °C increase in temperature approximately halves 
the cathode's lifetime. As a result, even though thermionic cathodes can be driven harder (e.g., 460 A/cm 2 has been 
achieved by scandate dispenser cathodes [375]), the preference is often to hold them to a more modest 1 A/cm 2 or less. 

Thermionic cathodes are generally run space-charge limited (SCL; apart from being the highest current deliverable 
across a gap, SCL operation smooths the noisiness of the cathode [221], making other benefits accrue), but the CL 

1 Why the work function is reduced is the topic of Section 31.8. 
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Figure 29.1 Lett, the surface of a dispenser cathode (image courtesy of N.A. Moody (UMD / LANL)). Right, grain outlines and pore locations for analysis 
in Figure 29.2. A figure based on the left image appears as Figure 12 in N.A. Moody, K.L. Jensen, D.W. Feldman, E.J. Montgomery, and P.G. O'Shea, 

J. Appl. Phys., 102, 104901 (2007), and is reproduced with permission of AIP Publishing. 



Separation (pixel) Grain |ndex 

Figure 29.2 Left, determination of mean distance between pores of Figure 29.1 as fitted to the log-normal distribution of Eq. (29.1). Right, area of the 
grains of Figure 29.1 (right) represented as an equivalent circular diameter in microns, where the labeling is from smallest area to largest. 



Time [hours] 


Figure 29.3 Lifetime versus operating current density for typical B- and M-type dispenser cathodes. Based on Figure 5.27 of A.S. Gibnour, Microwave 
Tubes, Artech House, Dedham, MA, 1986, in turn based on a graphic by M.C. Green. 


relation of Eq. (16.15) is temperature independent. Increasing the temperature beyond that just required to reach J CL is 
wasted effort: no more current can be drawn across the gap, and if it is generated anyway, the excess is forced back to 
the cathode. A convenient method to see where source-limited current such as J RIT) transitions over to space-charge-limited 
current is given by a Miram plot, in which the ratio of the current density with J CL is given, so that when the relation 
approaches 100%, no further increase in temperature improves J. The preferred operation point of the dispenser cathode 
is then at the "knee" of the curve. Knowing how the location of that knee progresses and what causes the sharpness of 
its bend over time is desirable. 
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Figure 29.4 A typical Miram plot for experimental data from a dispenser cathode (data courtesy of M.C. Green (Varian Medical Systems) based on 
M. Feinleib and M.C. Green, Technical Digest of the International Electron Devices Meeting, 30, 314, 1984. 


A quintessential Miram plot 2 is shown in Figure 29.4. Such plots are reminiscent of an empirical formula relating 
the current density J of a thermionic source to the two relevant limiting forms of thermal (J R/D ) and SCL (J CL ) current 
density. The Longo equation, often used in life-test models of barium dispenser cathodes, relates them by 


J" GW + Va) a 

Consequently, if y = J/J CL and x = J R/0 /J c/ , then a Miram plot represents 


y[x(T)\ = 


(1 


(29.2) 


(29.3) 


The original empirical Longo equation set a 1; in the later incarnation that is Eq. (29.2), a is referred to as a "shape 
factor". If T 0 is the temperature for which Jrid(T 0 ) — Jcl' then a convenient representation becomes 


x(T) = 



(29.4) 


where B = 4>/k B T 0 = (O - V 4QF)/(k B T 0 ). For example, if T 0 = 1273 K and <p = 2 eV, then B = 18.762. The insertion of 
Eq. (29.4) into Eq. (29.3) forms the basis of Figure 29.5. 

What then if the work function evolves over time, as it does for aging dispenser cathodes? Physically, "aging" is 
noticed by the requirement of a higher temperature T' 0 to achieve J CL’ and it is straightforward to infer what change 
in cj)' causes this. Setting J R/0 (O, T 0 ) = J R/D (®', T' 0 ) (both are equal to J CL ) is solved for 4>' in terms of $, T 0 and T' and using 
Eq. (12.3) as 3 

4) + 2k B T 0 \n[^)\ (29.5) 


0 } 


Small changes in ® over time therefore do not significantly change the qualitative shape of the Miram curve, as Figure 29.6 
shows (although the knee does become a bit sharper). But change is there: the value of T' 0 has increased, which accel¬ 
erates the aging of the dispenser cathode if the same level of current density is demanded, because the loss of barium 
is accelerated. On a standard Miram curve like Figure 29.4 (rather than the scaled Figure 29.6), this is manifested as a 
shift of the knee to the right, and therefore a monitoring of the knee position is a means of monitoring the health of the 
cathode. 


2 This figure by M.C. Green and K.L. Jensen provided the image used on the plaque of the first International Vacuum Electron Sources Conference 
Miram Award (2012) received by Professor Yiman Wang (Beijing University of Technology) in recognition of her meticulous work on scandate 
cathodes. 

3 Remember that 0 = 0- \/I QF; there is no point distinguishing what the field F is, as it is not changing, so that the change in 0 is the same as 
the change in O. 
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Figure 29.5 A Miram plot (Eq. (29.3)) using the Longo equation (Eq. (29.4)) for various values of a, with T 0 = 1273 K and 0 = 2 eV. 



T/T 0 

Figure 29.6 Change in the Miram curve for a change in 0 of 10% (from 2.0 eV to 2.2 eV) for an initial T 0 = 1273 K and a = 1. Observe that were the x 
axis a temperature scale, the red curve would be undesirably shifted approximately 114 K to the right (T' 0 = 1387 K) by Eq (29.5). 


EXAMPLE: Let T' 0 be 10% larger than T 0 = 1273 K for 0 = 2 eV. What has the work function 0' changed to as a 
percentage of 0? 

SOLUTION: Using T' g =l.lT 0 , then 

0' = 1.10 + 2.2k B T 0 ln(l.l) = (2.2 + 0.023)eV 

therefore, 0' is 11.15% larger than 0. Generally the second term is smaller than the first, so that, crudely, the 
percentage change in T 0 corresponds to the percentage change in ®. 


How quickly J transitions from J mn to J CL is governed by a, which is affected by work function evolution due 
to coverage. Causes of its variation are therefore complex, so much so that Vaughan [390] argued that there is no 
theoretical justification for a (which he called n), entailing that it is simply a fitting parameter for which "good" cathodes 
are characterized by 6 < a < 10, with larger a causing the "knee" of the transition to sharpen. A number of factors 
typically affect the knee [391], but three are particularly important for the relationship between the Longo equation, 
the Miram curve, and the transit time model of Section 16.3: 

• cathode evolution changes the degree of coverage and therefore the work function 

• emitted electrons have a thermal spread in launch velocity (initial velocities are neglected in Eq. 16.32) 
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Figure 29.7 Current density from a Ba dispenser cathode after different operational lengths is compared to Vaughan's "good cathode" curve to J ajI , J a , 
and the predictions of a transit time model using the ballistic t o and full t in solving Eq. (16.33). Based on Figure 2 of K.L. Jensen, J. Lebowitz, Y.Y. Lau, 
and J. Luginsland, J. Appl. Phys., Ill, 054917 (2012) using data digitally extracted from Figure 4 of R.T. Longo, J. Appl. Phys., 94, 6966 (2003). 


• dispenser cathodes are made by "sintering" (pressing together under temperature) tungsten grains, so that many crystal 

faces are exposed to the surface [392], each of which when covered by barium exhibits a different work function. 

Such factors introduce velocity spread into the beam, which in turn causes a spread in transit times, by creating 
changes in the accelerating field near the surface of the cathode that can be affected by either surface structure 
(protrusions) or different crystal faces (work function variation). The scale of variations in ® limits the length scale 
associated with field variation near the cathode, and the ® variations are associated with the average pore-to-pore 
separation, which depends on the type of dispenser cathode: a conventional dispenser cathode has randomly spaced 
pores [372], whereas uniformly spaced pores characterize controlled porosity dispenser cathodes [392], There are also 
length scales determined by the variations due to machining [281] and crystal grain size [84], Variations in velocity 
after electrons are emitted are due to undulations in field, so the length over which those changes can be induced 
is related to the feature sizes, as further away the field homogenizes (becomes more uniform). Even though feature 
sizes are hundreds to thousands of times smaller than the anode-cathode (AK) gap (distance from anode to cathode), 
non-uniformity contributes to the form of the knee, but in ways difficult to quantify outside of simulation. 

A softening of the knee accompanies aging for dispenser cathodes. Longo made measurements of cathodes at different 
times in their life, generating the curves of Figure 29.7. The two data sets shown, the first corresponding to 7012 hours 
(0.8 years) with a » 1.8 and the second corresponding to 25142 hours (2.9 years) with a » 2.5, show a rather pronounced 
rounding of the knee that accompanies aging. To relate this to transit time, non-uniformity due to randomly spaced pores 
and patchy emission [219], and their relation to the impact of space charge in the anode cathode gap, recall that charge 
in the gap suppresses the surface field F at the cathode, thereby affecting the Schottky barrier lowering factor (the surface 
field is governed by Eq. (16.33)). When the ballistic transit time t 0 of Eq. (16.24) is used to find F, the bright red line 
results; if the transit time is given by Eq. (16.32), then the darker red line results. The lower temperature region of theory 
is thereby brought into correspondence with data, but near the knee other factors contribute. The theory curve is rather 
close to Vaughan's "good cathode" implying that "goodness" is correlated with a uniform emission surface (an implicit 
assumption in the one-dimensional models). 


29.2 Diffusion of coatings 

Theoretical treatments of the diffusion of single adatoms on clean metal surfaces are difficult in that it appears to be necessary to include 
both the relaxation between substrate atoms and the adatom, and also the variation that can occur in the bond strength per bond when 
the number of nearest neighbour atoms is varied, if agreement between theory and experiment is to be obtained. 

- Martin Prutton [343], p. 149 

Because thermionic dispenser cathodes are operated at high temperature, mobile barium atoms migrate across 
the surface rapidly (as would cesium atoms) and give rise to a low work function coating. The physics of diffusion 
and evaporation of Ba and Cs coatings on metals at high temperature [373, 393-396] is therefore a central feature 
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of the operation of dispenser cathodes (such processes also affect the operation of field emission [397, 398] and 
photoemission [380-382] cathodes). Atoms giving rise to surface coatings are provided through randomly spaced pores 
at a temperature-dependent rate, whereas uniform emission implies uniform coatings (surface density in atoms per 
unit area), which in turn reflects a balance between how material diffuses away from the pores and how fast material 
evaporates from the surface. The combination of evaporation, diffusion, and flow dictate the "lifetime" of a dispenser 
cathode. Thermionic dispenser cathodes die by barium exhaustion, 4 by which time the barium impregnates have run out 
of their chemical reactions and the amount of barium that can make passage to the surface becomes insufficient to pull 
a desired current from the cathode at a preferred temperature [366, 378], 

Dispenser cathodes will therefore be the occasion to explore diffusion, evaporation, and flow, but the related problem 
of work function reduction due to degree of coverage of the coatings themselves shall instead be taken up in Section 
31.8. This creates a small problem, as it is useful to know why coverage and work function reduction are related at all. The 
very simple model behind Eq. (21.55) is adequate for now, and it suggests that the coating of cesium or barium causes a 
work function reduction in proportion to its fractional coverage factor 6, with 6=1 corresponding to one atom per unit 
area. Herein is an ambiguity that occasionally surfaces: what is the unit area for one atom on the surface? Analogous 
to setting the volume per atom (of, say, cesium) V = (2r) 3 to correspond to Mp, where M = 132.905 grams/mole and 
p = 1.87 grams/cm 3 to obtain r = 0.245 nm (close to the covalent radius 0.253 nm of cesium), the bond length of Cs-Cs, 
or 2 r = 0.5309 nm [84], serves as a good choice to determine the unit area because on the surface Cs is not packed into 
the same lattice structure it experiences in bulk, but conforms in part to the potential peaks and valleys of the underlying 
substrate and to the proximity of its nearest neighbors in a manner suggested by Figure 29.8. The bond length-derived 
value comports well with experiment, and so the surface density a and the coverage factor 6 are chosen to be related by 



where r c is half the bond length, although taking it as the covalent radius is more convenient. 


29.2.1 Point source model 

...the main concern of physics is not with drunks who stagger home from lampposts. ..Instead of talking in terms of a drunk taking steps, 
let us revert to the less alcoholic vocabulary of physics and think of a particle performing successive steps .... 

- Frederick Reif [51], pp. 6-7. 

Two methods to treat diffusion offer a nice contrast to each other: an analytic approach by assembling solutions to 
the diffusion equation, and a numerical approach to mimic the actual migration, or random walk, of the atoms across 



Figure 29.8 Schematic representation of spherical adatoms (cesium or barium) resting on the corrugated potential surface of a bulk material like 
tungsten. Two fictitious lines, one red and one blue, provide a reference point for atoms to hop off and on to (they can also move along the line, but 
that does not change the number density), as suggested by the red and blue arrows, for the purposes of investigating the relationship between hopping 
and diffusion, codified in the rate of change of the number densities p n and p n+1 . 


4 Cesium dispenser photocathodes end their useful life analogously, but for brevity one or the other is explicitly mentioned and the other 
implicitly understood. 
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the surface of Figure 29.8. The analytic model speaks for itself: find the characteristic solutions to the diffusion equation 
and assemble them to represent a distribution, then start the timer and watch. The analytic approach is well-suited to 
incorporating parametric models of evaporation and sources. The numerical model treats the atoms as a gas that can 
migrate, and tracks them as they hop over potential barriers between adjacent sites (diffuse) or hop off the surface 
altogether (evaporate), with the energies associated with the later typically a factor of 5x or so larger than the former; by 
way of example, consider BaO, for which the diffusion activation energy is 0.7 eV, but the binding energy is about 2.1 eV 
[366], The inter-atomic potentials affect the barrier heights over which atoms jump and the well depths that keep them 
from escaping [398], features that become important in the lattice gas model of Section 29.2.2. Jump probabilities then 
are synonymous with random steps. 

As the surface atom (taken to be either barium or cesium, but referred to generically as an adatom) vibrates back 
and forth, it strikes the boundaries that confine it. Those boundaries due to the substrate atoms (taken to be tungsten) 
themselves vibrate, and so the energy of the adatom varies with each collision or interaction. If all the surface atoms 
along a line are imaginatively strung together like pearls on a chain, visually suggested by the red and blue lines marked 
Pj and p i+i respectively in Figure 29.8, with the subscript indexing the chain, then the number of atoms on each chain 
can be kept track of by the value of p at a time t = t n ; let it be indicated by a superscript p(t n ) = p n . In the amount of time 
taken for the adatom to typically strike the barrier and bounce back, or Af = t n+1 - t n , let the probability that the adatom 
instead hops the barrier to the adjacent site be given by p. The future value of the number of adatom atoms is kept track 
of by monitoring those that stay put, adding the ones that hop on from the adjacent chains, and subtracting the ones 
that hop off. In short, 

PT 1 =P?+ f(P" +1 " p") + ~(£i " P”) (29.7) 

where each difference indicates the net loss or gain from a changing of atoms between two adjacent chains, and where 
a factor of 1 /2 appears with p because half the atoms are on average headed to the (z + 1) line and half to the (z - 1) line. 
A regrouping of terms transforms that equation into 

PT X ~P n i = |(<+i " 2 Pi + (29.8) 


which is exactly the form of the Euler or the upwind differencing scheme (UDS) for the approximation of a derivative in 
time (left-hand side) and the central difference scheme (CDS) for the second derivative in space (right-hand side) using 
the finite difference methods of Section Al.3.2. In fact, if the spacial separation between the lines marked p t and p i+l in 
Figure 29.8 is Ax and a diffusion constant 


(29.9) 


is defined, then the continuum form of Eq. (29.8) is 




(29.10) 


If the "number of atoms" term is scaled by the length (or unit length) of the chain, then p t is a line number density, 
but the treatment is equally applicable to an area number density for the diffusion of two-dimensional sheets through a 
three-dimensional volume. As will be investigated shortly, extensions to higher dimensions is straightforward, amount¬ 
ing to d\ -> V 2 , the resulting equation dubbed Fick's (second) law and the associated D called the Fick's law diffusion 
coefficient. More importantly, this D is the chemical diffusion coefficient D to distinguish it from the tracer diffusion coefficient 
treated in Section 29.2.2 and designated D* [399], 

The diffusion equation of Eq. (29.10) has much in common with the time-dependent Schrodinger's equation and 
the heat conduction equation. No surprise, then, that methods to deal with these equations have common elements 
(e.g.. Green's function approaches [400]). The point source solution has an advantageous viewpoint that recurs in point 
charge models for the field emission of Section 30.3. It involves finding the solution to 


-zz(x, t) = D——u(x, t) 


Use of the Fourier transform representation 


(29.11) 
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results in 5 

for which the solution is 


d t u(k, t) = -Dk 2 u(k, t) 
u(k, t ) = exp(-Dk 2 t)u(k, 0) 


(29.13) 

(29.14) 


The reason for calling the method under discussion the "point source solution" is now revealed by the initial conditions: 
assume that whatever is diffusing is initially all piled up at the origin x= 0, that is, assume u(x, 0) a S(x). Then from 
Eq. (8.74), 

u(k, 0) tx e ikx (29.15) 


Inserting u(k, 0) into Eq. (29.14), that in turn into Eq. (29.14), and lastly demanding that uix, t ) be normalized so that its 
integral over all x is unity (more on which near Eq. (29.20)) results in 


u{x, t ) = exp -- 

yVof \ 4 


From these, a general p that satisfies Eq. (29.10) can be approximated by 


(29.16) 


p(x, 0 » X w i u ^ x ~ x i’ $ 


(29.17) 


where Wj are weights and Xj are suitably chosen positions. 

Adding dimensions is straightforward. In Cartesian coordinates, more dimensions simply adds to the Cartesian 
Laplacian (see Section Al.3.1), so that in two dimensions d 2 d 2 + d 2 , and so on for three dimensions. The method of 
separation of variables then posits solutions to be 


u 2 (x,y, t) = u(x, t) u(y, t) (29.18) 

u 3 (x,y, z, t) = u(x, t) u(y, t) u{z, t) (29.19) 


for two and three dimensions, respectively. Consider two dimensions (it is easier, and the generalization to three 
dimensions is straightforward) and return to the normalization question that should have been taken on in Eq. (29.16). 
There are two ways to express the normalization of u 2 \ 


J [ u 2 (x,y, t) dxdy = (J u(x,t)dx) = 2 k J u(r, t)rdr 


(29.20) 


where the first form is made possible because the integrals over x and y are identical, and the second because r 2 = x 2 + y 2 
(a trick familiar from Eq. (5.11)). The second form, though, is easier because of the transformation 




(29.21) 


where a = \/(ADt), and so the normalization of u(x, t) in Eq. (29.16) follows straightforwardly. 

Solutions to configurations modeled by Eq. (29.17) can apply to experiments designed to infer the approximate mag¬ 
nitude of D for adatoms diffusing over the surface of tungsten, needed to quantify what temperatures are most desirable 
to balance the speed of restoring the surface to a uniform condition with the losses to evaporation that result when 
higher temperatures are used. Dispenser cathodes used as thermionic sources do not generally have such a problem: the 
high temperature of operation insures that barium or cesium spreads rapidly, but for dispenser photocathodes that are 
operated at room temperature or so [381] the concern is more relevant, as the reduction of the "rejuvenation time" of 
a cesium dispenser photocathode is desirable even as the evaporation losses that attend heating are not. 

The migration of coatings can be observed, 6 as shown in Figure 29.9, by evaporating cesium through a square mesh 
onto the surface of 70% dense sintered tungsten and monitoring its evolution using a photoemission electron micro¬ 
scope. Insofar as work function ®(0) depends on coverage 9 and the quantum efficiency of emission (which is measured 


5 There are assumptions required to go from Eq. (29.12) to Eq. (29.13), such as that d t can be brought inside the integral, and D is independent 
of position [60, 400]. 

6 Images courtesy of E.J. Montgomery and D.W. Feldman (UMD), and J.L. Shaw (NRL). 
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Figure 29.9 Images taken using NRL's photoemission electron microscope (PEBM) by E.J. Montgomery, D.W. Feldman, and J.L. Shaw of cesium 
deposited through a mesh grid onto the surface of sintered tungsten. The approximate time between images is 8 minutes. Each image is 512 pixels 
across, or approximately 125 |rm. Images courtesy of E. Montgomery (IREAP, UMD, January 2012). 



Figure 29.10 Images of Figure 29.9 processed by Algorithm A3.23. The thin white line at horizontal pixel 205 in each marks the pixel column 
represented in Figure 29.13. 


by the brightness of the image) on <t>, the brightness is a proxy measure of the coverage, and so its evolution over time 
reveals the diffusion process. The images are approximately 125pm across and the grid regions about 25 pm on a side. 
The temperature was held below 100 °C to reduce loses due to evaporation. The dark areas between the squares or, even 
better, between adjacent rows show clear evidence of brightening over the 8 minute duration between the two images, 
corresponding to Cs diffusion across the 10pm scale in minutes (the bright regions dim as well). Digitally processing 
the images shows the effect better in the color renditions of Figure 29.10, where the brighter intensity regions are more 
visible. The intensity as a function of position for a vertical line drawn through the images can be extracted and rep¬ 
resented: the lines chosen corresponded to the x = 205 pixels, faintly visible as a vertical white line. The image itself is 
approximately 512 pixels across, with each pixel corresponding to roughly 0.25 pm. The intensity along the line at two 
different measurements separated by 8 minutes is shown in Figure 29.11. The regions of greater intensity diminish, and 
those of weaker intensity increase, conforming to how the cesium is expected to diffuse. The script taking the images 
and rendering them as well as extracting the data along the line is given by the code in Section A3.23. The rapidity with 
which the peaks diminish and the valleys fill serves to quantify the value of D using the point source solutions. 

Such behavior can be crudely modeled by preparing an array of point sources that mimic the experimental arrangement 
and then assessing how the magnitude decreases as time progresses. For example, consider the configuration defined by 

p(x,y) oc ^ u(x + kA,t){u(y- A, t) + u(y + A, 0} (29.22) 

a configuration which creates two parallel columns of seven sources a distance 2A apart, where the units of A and time 
are chosen so that 4D = 1. Figure 29.12 shows the evolution of Eq. (29.22) for the time steps 0.4, 0.8, and 1.2. A slice 
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Figure 29.11 Intensity along the white lines of Figure 29.10, in which the numerical values of the intensity along the 205 pixel column are shown as a 
function of their pixel position. Because the images do not overlap, the pixel coordinate of the 8 minute data was shifted until there was overlap at the 
leftmost minimum. 




Figure 29.13 The numerical values of p along a horizontal slice through the center of the images is represented in Figure 29.12. 


through the horizontal center shows the diffusion in an alternate manner as a cross-section: it is seen how rapidly the 
sharp peaks spread, and the center valley literally fills in, as in Figure 29.13. Consider the behavior near the 425 pixel 
mark of Figure 29.11: A « 50 pixels = 12.5 pm, chosen for the low concentration and reasonably clear signature. Thus, 
for cesium on tungsten for T < 373 K and for the 8 minutes = 480 seconds of the experiment, then a crude estimate 
for D is 


£>~ 


(12.5 cm) 2 
4(480 s) 


= 8x lO -10 


cm 2 


(29.23) 
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Figure 29.14 Le 

potential points corresponding to evaporation. Right, based on Figure 1 of Gomer [399]) a Lennard-Jones potential normal to the surface on top and 
the surface undulations along the surface on bottom. E d refers to the activation energy to get over a barrier or to a higher potential position. 


How good is the approximation? An empirical Arrhenius relation due to Langmuir and Taylor [393] for the value of D 
for cesium diffusion on tungsten gives 7 

D lt = O t e~ tE * (29.24) 

where p = 1 /k B T, D 0 = 0.1995 cm 2 /s and E d = 0.6072 eV (a value that compares favorably to that of E d = 0.586 eV of 
ref. [402]). The grounds for such a form will be taken up in Section 29.2.2. With these values, D lt (T) = 8 x 10 _, ° cm 2 /s if 
T = 364.4 K. Such numbers are compatible with the temperature of the tungsten in the experiment behind Figure 29.11, 
specified to be less than 100 °C = 373 K. The calculation, though, is more suggestive than accurate, given the immense 
subtlety that attends inferring diffusion parameters from experimental measurements (see Gomer [399]), where surface 
imperfections and movement of clusters of atoms complicate the analysis. 


29.2.2 Lattice gas 

Surface diffusion in adsorbed layers, a subject slighted in the past, has recently undergone a renaissance motivated both by the need for 
more information about transport processes on heterogeneous catalysts and by a concern with the dynamics of atomic jumps on solids. In 
principle at least, an adsorbed atom moving over a surface can be viewed as a convenient probe, both of interactions with the substrate 
and with other adatoms in its vicinity. 

- David A. Reed and Gert Ehrlich [403] 

When an Arrhenius relation such as Eq. (29.24) arises, the parameter E d is often referred to as an activation energy. For 
atoms hopping around the surface of a metal, its intuitively appealing interpretation relates to the size of the potential 
barrier that an atom must hop over to get to an adjacent site (or, as in Section 29.3, the barrier that must be overcome 
for the atom to be evaporated), as schematically suggested in Figure 29.14. Diffusion over the surface of a material can 
then be modeled by allowing atoms to migrate by simulating their jumps over a lattice, giving rise to lattice gas models 
[398, 399, 403-408] as a means of investigating surface diffusion phenomena. Actually, as Figure 29.14 suggests, there 
are two kinds of jumps possible: those that carry the adatom to an adjacent site (diffusion) and those that allow the 
atom to jump completely off the surface of the metal (evaporation). These atoms wiggle in a well often modeled by a 
Lennard-Jones potential of the form 

H(r) = 4e j (29.25) 

where r = r m is the closest approach for an atom with a zero total energy and -e is the depth of the well, as schematically 
shown in the top right plot of Figure 29.14. 


EXAMPLE: From ref. [402] for cesium, r 0 = 0.5336 nm at the potential minimum (their parameter e = 0.586 eV 
at 350 K is not needed here, but is comparable to the value of Langmuir and Taylor in Eq. (29.24)). Find r m . 


7 Nothing like Eq. (29.24) is found in ref. [393]. What is found (their Eq. (1)) islog 10 D = -0.70 - (3060/T), which is equivalent (see an excellent 
review by Antczak and Erlich [401]). The point here is to give their form in a way that introduces E d . 
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SOLUTION: Setting r m d r V = 0 at r = r 0 using Eq. (29.25) results in 



= 0 


Isolating r m results in 



(29.26) 


from which r m = 0.4754 nm. 


29.2.3 A simple model of the diffusion coefficient 

Into this Universe, and Why not knowing 
Nor Whence, like Water -willy-nilly flowing; 

And out of it, as Wind along the Waste, 

I know not Wither, willy-nilly blowing. 


Omar Khayyam 8 


What, then, should the diffusion term D look like? Because adatoms have energy, they have thermal agitation, causing 
them to rattle about in their confining wells, and the atoms of the underlying substrate do the same. Because these atoms 
are constantly bouncing off each other and the metal atoms, they are constantly exchanging energy. The probability that 
the adatom finds itself with an energy E d sufficient to jump the barrier to an adjacent empty site is therefore proportional 
to exp {-f)E d ). Then, from the discussion after Eq. (29.8), and letting Ax -> r 0 be the length of a lattice hop (from well 
region to well region), D can be written as 9 

D = ^ijv exp (~pE d ) (29.27) 

where 2A t -* 1/v is the round-trip time and v is therefore the frequency of the particle making that round trip. The 
coefficient of (1/4) reinforces that the adatom has one of four locations to jump to on a two-dimensional surface, as in 
Figure 29.8. Near r « r g , the Lennard-Jones potential is approximately parabolic, indicating that it is a harmonic oscillator, 
as in Section 9.3, for which a spring constant K can be defined [409], Taylor expanding V(r) in Eq. (29.25) about r & r 0 
gives 

V{r) « -e + 36^(r - r 0 f -e + ] -K(r - rf (29.28) 

Equating the last two forms shows that K = 72e/ r 2 0 , from which the ground state angular frequency is co 0 = sJl</M, where 
M is the mass. The oscillating adatom is therefore in a higher energy level tico = ( 2n + l)Two 0 /2, and the frequency v is 
v = w/2n. 


EXAMPLE: If cesium is treated as a harmonic oscillator in the Lennard-Jones potential well, find the energy spacing 
hoo 0 and estimate n. Use the cesium parameters r 0 = 0.5336 nm and E d = 0.6072 eV (after Eq. (29.24)), and take the 
mass M of a cesium atom to be 0.132905 kg/mole. 

SOLUTION: The energy spacing is 



Next, equating the two forms of D(T) of Eq. (29.24) and (29.27), and letting Tiv/2n = (2 n + l)hoo 0 /2, then 



from which v = 280 THz. 


8 Omar Khayyam (trans. Edward FitzGerald), The Rubaiyat of Omar Khayyam. Garden City, NY: Garden City Books, 1952, p. 149, Quatrain XXIX. 
9 When the hopping D is encountered again in Section 29.2.5, it will go by the name of D* (following Gomer [399]) and is called the tracer 
diffusion coefficient. 
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The notion of hopping over a barrier provides an intuitively appealing means of understanding the Arrhenius relation 
of Eq. (29.24). In a simple view, every time the adatom bangs against the edges of the Lennard-Jones potential, the 
thermal agitations of the lattice cause its characteristic harmonic number n to change (it switches, essentially randomly, 
to a new energy level E„ ), so that the probability that the adatom will surmount the barrier to the empty adjacent site 
is a reflection of the probability that its energy level exceeds the barrier height. As was repeatedly used in Chapter I, the 
probability that a particle is in an energy state E n = (2 n + \)Tim 0 /2 is proportional to exp (~pE n ). Therefore, the probability 
of a jump is the probability that E„ of the adatom exceeds the barrier height E d , where P n oc exp {-f)E n ). The sums are 
analytic and easy, resulting in 

Z” „ erK 

P(E n > E d ) = ^ 0 = exp (-pn 0 hco 0 ) (29.29) 

Z„=i e ~ pE " 

where n 0 is such that \n 0 + (\ /2)\hco = E d . Given the size of n 0 , this is about the same as exp(-fiE d ), just as the Arrhenius 
relation would have it. 


29.2.4 Modeling diffusion using a shuffle algorithm 


... accuracy for us means the absence of bias. Inaccuracy is then measured by the magnitude of the bias. 

- John Mandel [410], p. 105 


It is simply a question of sacrificing ideals to expediency. In Monte Carlo work justice will have been done if the final verdict is fair, i.e., 
the answers come out approximately right. 

- J.M. Hammersley and D.C. Handscomb [106], p. 26 10 


In light of Eq. (29.29), D(T) of Eq. (29.27) can be written as a product of a jump length squared, an attempt frequency, 
and the probability of a successful jump. A simple numerical model focuses on the last factor: a surface is populated 
with adatoms positioned on random locations, and then at each time step the probability that it will hop to a random 
adjacent site is calculated. If chosen, it jumps. Were the adatoms far apart such that rarely did an adatom have an adjacent 
neighbor, such a description would suffice, but as soon neighbors in adjacent sites becomes probable, a subtle problem 
in the simulation of the diffusing atoms can arise because of bias. Diffusion then becomes a convenient vehicle to discuss 
mitigation of that bias. 

Bias, by which is meant an unwarranted predisposition to an outcome, can skew results in remarkably indirect ways. 
It arises here when a method by which adatoms are chosen for jumping and updating show a preference to a particular 
direction. This can be seen using a very simple model of a linear chain of atoms rather than a two-dimensional surface. 
If all the atoms were far apart, then regardless of the order the atoms are chosen for updating, their options (move to 
the left, move to the right, or stay put) are the same. On the other hand, if all the atoms started in a line at the center 
of the string (so that only the leftmost and rightmost atoms can initially jump to an unoccupied site, all the rest having 
neighbors on both sides), the order they are picked matters: if the updating always occurs from leftmost to rightmost 
atom, a drift to the left occurs because jump sites preferentially open up to the leftmost atoms first. 11 The problem is, 
that which does not make a difference should not make a difference: physically, the order that the atoms are chosen should not 
matter to the outcome, so if it does something is wrong and a bias is shown. 

An obvious improvement would be to randomly choose the order that the adatoms were updated. This would provide 
a much better outcome because it would forestall a net drift, but a problem with randomly choosing the adatoms is that 
not all of them would be chosen with the same frequency: because they are chosen randomly, some would be chosen 
more often and some less often. That is, if N x atoms were on a chain and N t is the number of time steps, then the number 
of updatings to be performed is N = N t N x . For a randomly chosen order, though, the mean number of times that the 
atom; is updated would be (Nj) = N t , but there would be fluctuations about the mean number N t , and those fluctuations 
would become more pronounced the smaller N is. 


10 AIthough the quote was in reference to the generation of pseudorandom numbers, it reflects a viewpoint that applies more generally. 
u If this is hard to see, consider the problem in extremis: suppose the jump probability was 100% and the atom had to go to an available site if 
the opportunity permitted. Then, if the atoms were chosen sequentially from left to right, every one of them would have no choice but to jump 
left, and so the block of atoms would move left en masse, which is unphysical. 
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A way to insure that all the atoms are updated the same number of times N t , but that they are chosen in an order 
without bias, is to use a shuffling algorithm [411], A proper way to implement it bears little similarity to how a deck of 
cards is shuffled: cards experience a "riffle" shuffle, a method of randomization that, in the words of Knuth, is "miserably 
inadequate" as a technique of randomization and which further leaves a trace of the ordering, or a bias, in the cards that 
bridge players have learned to exploit when they "finesse". The card analogy, though, makes the discussion easier: the 
atoms correspond to cards, and the arrangement of a deck of cards corresponds to a particular order in which the atoms 
are selected. The shuffle algorithm proceeds by starting with the topmost card, then swaps its position with a randomly 
chosen card below it; the process is iterated with each card in the deck down to the bottom card. A version of the algorithm 
is part of Section A3.24. 

A simple algorithm for hopping using the shuffle algorithm constitutes the remainder of Section A3.24. It assumes a 
fixed jump probability p 0 and no interaction energy between the adatoms: if the jump is allowed, a direction is chosen, 
and if the site in that direction is empty, the adatom jumps. Periodic boundary conditions are assumed, so that an adatom 
that leaves one boundary re-enters the simulation region at the opposing boundary. A group of atoms, initially confined 
to the center of the simulation region, will spread their way out and fill the available space. The visual spread of the 
adatoms, by comparing the initial and final state positions, is shown in Figure 29.15. 



Figure 29.15 A two-dimensional area with 64 x 64 sites is initially populated with adatoms preferentially positioned near the center and marked as blue 
circles. At each time step the adatoms are randomly selected using a shuffling algorithm to determine if they make a hop, with the probability of a hop 
being p 0 = 0.2 At the end of the simulation, the adatoms are in the red dot final state positions. The simulation used Algorithm A3.24 with N = 64 and 
N s = 96, so that the number of time steps was 6 x N s , with 6 corresponding to the intermediate time steps shown in Figure 29.16. 



Figure 29.16 The distribution of radial distances from the center for the adatoms of Figure 29.15 for the time increments; x N with; e {1,6} and 
N s = 96, showing a spreading pattern compatible with diffusion. 
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29.2.5 Adatom motion and diffusion coefficients 

... explanations in engineering freely use vernacular language, no matter how flaky the explanation may seem to the logician or theoretical 
physicist. When the words are chosen carefully, they can insightfully convey what is going on in a human contrivance without equations 
or computer simulations, at least at the executive-summary level. 

- Steven Pinker 12 

In the hopping simulation of the previous section, neither the unit of the time step nor the site-to-site separation 
is explicitly given, both in some sense related to the magnitude of the jump probability p g . A constant p 0 implies that 
the conditions for all the adatoms are the same, but a moment's thought reveals just how tenuous this assumption is: 
adatoms in adjacent sites must surely affect the jump probability by altering the nature of the potential barrier between 
the adatoms, that is, the jump probability really should be coverage dependent, just as the work function will be found 
to be coverage dependent in Section 31.8. 

Appreciating that the concentration of the adatoms on the surface affects how the dynamics of their motion unfolds, 
revisit Eq. (29.7) with a more critical eye, based as it was on a dilute approximation: the jump probability p 0 reflects, 
as it were, the desire of the adatom to hop, but says nothing about whether it actually can. Were the site the adatom 
wished to hop to occupied, its ability to hop would be frustrated. The correction of this oversight identifies a key feature 
of a Langmuir gas, and brings a better derivation of Fick's law with it. Instead of using p 0 or v or the like, speak of a 
jump rate as a function of coverage T(0). Clearly, what has been discussed so far is r(0), or the jump rate in the absence 
of meddlesome companions occupying jump destinations. The probability of a jump r(0) has to be tempered by the 
availability of spaces to jump to. If all the spaces were occupied, then the rate would have to be 0, or T(l) = 0, and if 
none were, then r = r(0) as before. In short, the jump probability is related to the proportion of available sites. The 
simplest relation of the "effective jump rate" r(0) is therefore [399, 403] 

T(0) = r(0)(l - 0 ) (29.30) 

Next, the number of atoms n l along line 1 is a product of the width of the strip from -2/2 to 2/2 (where 2 is the distance 
from the initial jump site to the target jump site and is assumed to be the same on both sides), the number of sites N s per 
unit area, and the fraction 0 of the sites that are filled, or n ] = 20 1 N S , so that the rate of change of the number of atoms 
along that line is 

4m =ne 2 )WA (29.31) 

at 

The equation for the second line n 2 follows exactly the same kind of logic, and so is given by the replacement 1 <-> 2 in 
Eq. (29.31). The net flux J of adatoms is the difference between the change in n l and the change in n 2 , or 
dn , dn 2 

J= ^t~~dl = F(0)JV ^ [(1 “ W “ & - W] 

= -T(O)1V s 2(0 2 - 0,) « -T(G)tf(N s d x 0) (29.32) 

where N s d x 6 = d x C is the "concentration" gradient so that D = r(0)2 2 , and so, happily, observing the concentration evolve 
gives information about the isolated jump probability T(0), assuming the jump distance 2 is known. When Eq. (29.32) is 
joined with the continuity equation d t C + d x J = 0, the result is 

d t C = Dd x C (29.33) 

which hearkens back to Eq. (29.10). Well, almost. Reed and Ehrlich [403] draw attention to the transport of adatoms 
being driven by changes in the chemical potential p rather than the concentration C, but even before that, observe that 
when the hopping of an adatom was the focus, diffusion was in the language of the random walk of a single one, but 
when the discussion is about concentration (or chemical potential) gradients, then diffusion is in the language of a flow 
of adatoms. This suggests the diffusion coefficient for each way of looking differs in some fundamental way. Following 
Gomer, let D in Eq. (29.27) now be called D*, and reserve D for the term in Eq. (29.32). How does D differ from D*? 

Recall what D* represents: it follows an adatom around on a "random walk" over the surface. The random walk topic 
is a quintessential problem in statistical physics [51]. Just to be quick about it, consider the problem in one dimension. 


! Steven Pinker, The Stuff of Thought. New York: Viking, 2007, p. 226. 
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as the generalization to two dimensions is straightforward. Let a particle 13 have a probability p of taking a step Ax = v 0 At 
to to the right, and therefore a probability of q = 1 - p to the left at each time step. Of course, p = q, but the problem 
unfolds much better by not making that explicit until the end. The random walk problem is therefore concerned with 
two questions: first, how far away from the origin on average is the adatom at a later (usually long) time and, second, if 
the adatom does the walk over and over or, equivalently, many non-interacting adatoms all start out at the same time, 14 
what is the dispersion or their average distance (in particular, their root mean square distance) from the origin? 

A simple way to answer both questions uses the methods familiar from Section 4.4. The probability of a step left or 
right is, of course, unity (1 = p + q). So too is the probability of taking any combination of two steps left and right, or 
1 = (p + q) 2 = pp + pq + qp + qq, where each combination has been spelled out. For n steps, of course. 


*-Si 




(29.34) 


by the binomial theorem, where each term in the sum represents a particular order of events with Wjy = (k + j)\/k\j\ 
being the probability of a total of k steps to the left and; to the right, regardless of the order taken (so, for example, for 
n = 2, the probability W u is 1/2 that there will be one step to the left and one to the right, as seen by pq + qp accounting 
for two of the four possible combinations). Now consider 


n = n X W k , n - k V k <f- k = % fr+jWtj P k ^ nJ+k 

= X kW kj p k q’5 n j +k + XjjWy p k qi8 nJ+k (29.35) 


where is the ICronecker delta (unity if its arguments are the same, zero if its arguments are not). This appears to 
be an unnecessary complication, but consider what it represents: on the second line, the first term is the average ( k) 
number of steps to the right, and the second is the average number (j) to the left. But if p = q, the two sums are in 
fact identical because there is a one-to-one correspondence in their terms due to the symmetry of W k j and p k q> = p k+j , 
meaning that the average number of steps to the left is the same as the average number to the right, or (A) = k 0 = n/2, 
as intuitively understood (if the probability of going to the right on each step is p, then it is np for n steps). As a result, 
the mean displacement is zero because (k) - (j) = 0. Therefore, the adatom tends to hover around the origin on average. 

The dispersion is a measure of the deviation from an average, or (A k 2 ) = ((A - k a ) 2 ). Because 

((A - k 0 ) 2 ) = (A 2 ) - 2 k 0 (k) + $ = (k 2 ) -k ? 0 = (A k 2 ) (29.36) 

the task at hand is to find (k 2 ). Using the convenient trick 15 that k 2 p k = (pd p ) 2 p k (and treating q as independent of p for 
now), 

(k 2 ) = (pd p ) 2 £ W Kn _ k p k q n ~ k 

k =0 

= (pd p ) 2 (p + q) n = np(np + q)(p + q) n ~ 2 = np(np + q) (29.37) 

where the last step follows from p + q = 1. Now setting p = q = 1/2, then 

<k 2 > = ±n(n + 1) (AA 2 ) = (A 2 ) - <A> 2 = npq = ^ (29.38) 

The root mean squared displacement is therefore 

Ak mv . = V (AA 2 ) = yfn/A (29.39) 


13 The particle is a drunkard and the origin is a lamppost in the usual formulation, but such details invite many more that are better simply 
avoided. 

14 This is one of the details best avoided by not talking about drunkards: there is no need to consider why so many alcoholics would aggregate 
at the same lamppost, nor what failure of society led to so many to begin with. 

15 This trick can also be used to evaluate (A) = np, but variety in technique is good pedagogy. 
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The relation of the random walk to the tracer diffusion D* is by the two-dimensional generalization [399] 

A Trns = <|r(0 - r(0)| 2 > = (n x + n y )2 2 (29.40) 

where r(t) 2 = x(t) 2 + yif) 2 and x(t) = n x 2 with a similar equation for y(t). The number of jumps n x + n y is the product of 
the time t with the effective jump frequency v eff , and the effective jump frequency is simply the "try frequency" v 0 with 
the probability of success exp (-pE d ), so that 

A r 2 ms = A 2 tv 0 exp (~pE d ) = 4 D*t (29.41) 

The relationship between position r(t) and velocity v(t) allows for expressing D* in terms of velocity correlation functions. 
Using 

<|r(0 - r(0)| 2 > = ^ v(t’)dt’ • v{t")dt"^j 

= f Q dt' ^ • 0(f")> (29.42) 

Recall that (■ ■ ■) is an average, and in the absence of external forces for an equilibrium condition of a particle engaged in 
a random walk the averages depend on the time differences, not the times themselves, and so 


K(s) = (v(t) • v(t + s)) = (0(0) ■ v{s)) = K(—s) (29.43) 

/ dt' J dt"K(t' - t”) = 2 f ds(t - s)K(s) (29.44) 

Jo Jo Jo 

Although Eq. (29.44) can be shown more formally [51], it can be shown quickly if correlations in velocity are assumed 
to die out exponentially fast with a decay constant t, or K(s) « K 0 exp(-|s’|/-r). With that ansatz, direct integration of both 
sides of Eq. (29.44) shows equivalence. As a result 


D* = lim — f (t - s)K(s)ds = - f K(s)ds 
4 1 Jo 2 Jo 


(29.45) 


for one adatom. For N of them, 

D * = h%L {d]{Q) ' d}(t))dt 


(29.46) 


Now recall what D represents: it follows currents of adatoms over the surface. As a result [399] 

(29 ' 47) 

with AN 2 arising as a result of fluctuations of particles in the finite region under consideration, which is embedded in an 
infinite open system. If the particle velocities are correlated (the adatoms are affected by their neighbors), then further 
reduction of the equation is difficult and one is deep into the methods of Kubo-Green relations. Alas, as shall be seen, 
adatoms are in fact subject to the influence of their neighbors, but suppose that the interaction is small enough that 
treating the velocities as uncorrelated is a reasonable first approximation. The consequence is that sums of the form 

N N N N 

X u /C°) ■ Z v J (t) = Z ^ (0) • v i (t)+ Z u /°) • v kM < 29 - 49 ) 

J l J=1 / 1 tfk 


where the second summation is understood as a double sum but where the indices are unequal. Uncorrelated means 
that integrals over time involving the second summation are zero: compared to the reference velocity 0,(0), when t is 
sufficiently large that all memory dies out, v k (t) appears random by comparison, and sums over random quantities are 
on average zero. As a result, 

D_ _ _N_ ^ (N) 

D* ~ AN 2 (AN 2 ) 


(29.50) 


where the second form is a consequence of fluctuations in the finite region that is part of the infinite open system, so 
that average numbers of particles and their dispersion is what matters. 
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29.2.6 The Darken equation 


That's just the way it is/Some things will never change 
That’s just the way it is/Ah, but don't you believe them. 

- Bruce Hornsby 16 


Furthering an understanding about the way things are and how they change requires the methods of Chapter 5 
concerning energy and entropy, but with a variation. The energy levels are the diagonal elements of the Hamiltonian H, 
in terms of which the partition function can be written as 


Z = Trexp[- 


(29.51) 


where the trace TrM means the sum of the diagonal entries of a matrix M. 
then 

(N) = Tr N exp[—/?(H — pAQ] = t d„Z 
' ' Trexp P Z 


Introducing the temporary notation a = ppi, 
= d a In Z (29.52) 


It then follows 


d a (N) = jd 2 a Z-(^pj = {N 2 )-{N) 2 = {AN 2 ) 


(29.53) 


- = dJnO 


Using the last two equations in Eq. (29.50) gives 

AN 2 d a (N) N s d a 0 
~1T ~ (N ) ” 

where N s is the number of available sites and 9 is the coverage (fraction of sites that; 
is then 


D 


_ em 

' dlnd 


(29.54) 

occupied). The Darken equation 

(29.55) 


for constant temperature T. 

A "Langmuir gas" refers to the absence of interactions between the adatoms themselves or between the adatoms and 
the metal surface, for which the dependence of p on the coverage 6 arises from both the interaction U term and the 
entropy term TS in the free energy F of Section 5.3. Regarding the former, the energy will go as the product of e with 
the number of pairs of interacting adatoms, with a factor of 2 to prevent over-counting, or eN p /2 = (e/2 )y6N. Regarding 
the later, with S being the entropy, and with pi = (dF/dN) T A (constant area A is used instead of volume V for a surface 
layer) [403], the non-entropy parts of F give the usual chemical potential pr. it is the entropy part TS that gives the 
organizational dependence. Finding that term is a problem in combinatorics much like Section 4.4. The configurational 
entropy is determined by the number of ways to place N indistinguishable adatoms on N s available sites. From the defining 
Eq. (4.23), or recalling Eq. (4.26), 


^ ) 

Nl(N s — N)\) 


(29.56) 


= /r’ (-iV In(jV) - (N s - N) In (N s -N) + N s In(IV S )) 


(29.57) 


where the second line uses Sterling's approximation (Eq. (A2.4)) on nl = nV(n) (but neglects the ln(2;m) part as small 
compared to n). Writing the chemical potential /a as the sum of the usual contribution from the non-S part of F (call it n 0 ) 
and the part from TS gives 

MO) = Pho ~ Pd N {TS) = ppL 0 + In (29.58) 

Invoking the definition of coverage, or N = N s 6, results in 

pm = P» 0 - pd N (JS) = PFo + In ( ) (29.59) 

where pn 0 = PyeO 2 /2 from the U term, and so the Darken equation for the (non-interacting) Langmuir gas becomes 


D , dipm) _ y 

din 9 1-9 


(29.60) 


16 Bruce Hornsby, lyrics to The Way It Is, 1986. 
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where the (1 - 0) factor accounts for sites which are occupied and therefore excluded from being "jump-to" sites. For the 
Langmuir gas, D(0 ) = T{Q)X 2 (d(Pn)/d\nO) simply recovers D = r(0)A 2 as in the discussion following Eq. (29.32). 

If that were all, the effort expended to confirm what was known already would hardly have warranted the investment. 
But as the caution following Eq. (29.47) makes clear, adatoms do interact with each other and with the underlying metal 
surface. The energy of interaction with the metal surface (call it v 0 ) and the pair-wise energy of interaction with each 
other (call it e) are inescapably related to the coverage 9. This has two consequences: the interactions affect the diffusion 
(hopping) rate, but also affect the evaporation rate. The way to relate coverage to these interaction energies is to repeat 
the above analysis but attend to where those energies enter. It means, though, that the effort to find their contribution 
is harder than that for the Langmuir gas, although the Langmuir gas provides a useful limiting case. 

29.2.7 Adatom interactions 

To relate D to D* when the interaction energies (adatom with surface, or v 0 ; adatom with adatom, or e) are included, 
assume that the adatom interaction energy is short-ranged and occurs only for nearest neighbors, that is, when the atoms 
are on adjacent sites, but drops to zero for any separation further away [403, 412]. If there are N p pairs of such atoms, 
then a contribution of —N p e to the energy occurs. Assume also that the sites are on a square lattice, so that there are at 
most y = 4 nearest neighbors at the north, east, south, and west squares, as suggested by Figure 29.17. 

Each adatom on the lattice contributes an energy v 0 ; each adjacent pair of adatoms contributes an energy —e. The 
question becomes for such conditions, what is the most probable value of the coverage factor 6 [412]? This simple model 
constitutes the "quasi-chemical" or Bethe-Peierls approximation for the statistics of a lattice gas, and represents a tidy 
bit of statistical reasoning. It matters for understanding the evaporation rate in Section 29.3 and the work function in 
Section 31.8. The energy of the lattice in Figure 29.17 is 

E =v 0 ^r £S oX s i < 29 - 61 > 

where Sj = 0 or 1 for unoccupied or occupied sites, respectively. Let the probability that the center site is in a state s 0 -* s 
(ignore the subscript; if there isn't one, then the center site is being referred to) and has n nearest neighbors be Pis, ri). 
The sum over all values of s and n must be unity, or 

2>,«)=1 (29.62) 

Now comes the interesting approach: the probability of the center site being occupied is related to the energy associated 
with its occupancy and the number of nearest neighbors it has; the former is proportional to exp(-/?se) and the latter 
is a combinatorics problem familiar from Section 4.4: if there are n adjacent atoms, there are y ways to place the first 
atom, y — 1 ways to place the second, and so on down to y + 1 — n ways to place the nth. And finally, for every atom on 
the lattice grid, associate a value z = e fiv \ the so-called "fugacity approach" given its similarity to treatments of the grand 
canonical distribution in statistical mechanics [413] where v 0 here takes the role of n there, so 

Pis, n) oc 2 s -—- Z n e -Pns (29.63) 

nl(y - ri)\ 


1 



Figure 29.17 A central site labeled 0 and its y nearest neighbors. The occupancy of the sites will 
be given by s, = (0,1) for unoccupied or occupied, respectively, where./ s {0, y} denotes the site, 
y = 4 for a square grid of sites. 
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where the z s factor is due to the center site, the factorial ratio is due to the combinatorics, the z n factor is due to the 
occupancy of the surrounding lattice sites, and the exponential term is due to the interactions of the occupied surrounding 
sites with the center site. 

Consider the s = 1 case first. From the binomial theorem, the normalization of P( 1, n) is easily deduced, and so 


P(l,n) 


y\ z n e -nPe 

nl(y - n)\ (1 + ze~ p d r 


The same kind of deductions lead to the s = 0 case, or 


P(0, n) = 


y! e~ nPe 
n\{y - n)\ (1 + e~ Pe ) y 


(29.64) 


(29.65) 


Now two questions can be answered quickly, and motion towards a third begun, to give a measure of how adatom 
motion depends on coverage. First, for non-zero v 0 and e, what is the average occupation (0 S ) given the center site is in 
a state s, and, second, how does the jump rate T(fi) change? The third question, what is p(0), is sketched with a fuller 
account in ref. [403], For the first question, consider the s = 0 condition first, for which 


(0 o ) = i £ nP( 0, n) = ±zd z £ P(0, n) = ^ 


and similarly, for ( 6 1 ), 


= zq, where n = 


e~ Pe , so that 


<*!> = 


zr\ 

1 +Ztj 


The average occupation is the average of the s = 0 and s = 1 cases, or 


(29.67) 


(0 S ) = 0(0 1 ) + (1-0)(0 O ) 


(29.68) 


but because the center site is arbitrary, ( 6 S ) = 6 and so collecting terms, 


6 

1 — 0 


zd+zr,) 

l+z 


(29.69) 


which allows v 0 to be determined if 6 and e are known. It also has implications for the evaporation rate by solving for z 
by finding the correct root of the quadratic equation ri( 1 - 0)z 2 + (1 - 2 6)z -6 = 0: the correct root corresponds to z -> 0 
if 9 -> 0. Therefore, 


z(0) = 


-(1 - 26) + [1 - 4(1 - rf)6d - 0)] 1 / 2 
2nd-9) 


b{6) - 1+2 6 
24(1-0) 


(29.70) 


where b{6) = b( 1 —6) is the radical term. The symmetry in b(6) is interpretable as putting an adatom on being equivalent 
to taking a hole off, what Gomer calls a "particle-hole" symmetry. 

Diffusion is affected by the hopping parameter T(P), and that, too, depends on 6 and the energies v 0 and e. It will 
depend on the number of open sites (y - n), the probability of the center site being surrounded by n neighbors, and the 
jump rate associated with n given by T n = V{0)/r] n , or 


Ti6)=-Y J iy-n)Pd,ri)T n =- 


= r(0) (1+?rl = r ( o)^ 
(i +znY (i +zn ) 4 


(29.71) 


where the last form puts in y = 4 directly. 

The last consideration, that of fi{6 ), is quickly dealt with by observing that p relates to changes in energy from putting 
one more adatom on, which is the same as taking away one vacancy. This implies that the change to n is by a term that 
is proportional to ln(z(0)/z(l - 6)), or explicitly [403] 


/W) = M, + !n ( ln 


(b(6)-l+26)d-0) \ 
(bid) + 1 - 20)0 ) 


(29.72) 
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so that the modification to the Langmuir form of the Darken equation becomes 


Warn _ 

din 9 



(29.73) 


Look closely: when the interaction energy e = 0, then b(9) = 1, and the Langmuir gas results; also, the value 9=1/2 
holds a special place. But when there is interaction, the hopping rate changes as does v 0 . 

In short, the rapidity with which diffusion occurs depends on the fractional coverage of the surface. When these 
observations are incorporated into the lattice gas model and studied using Monte Carlo methods, the diffusion constants 
and hopping rates change in non-trivial ways [403], The purpose here, however, is to show that the presence of coatings 
changes how a submonolayer coating of adatoms diffuses, how they evaporate, and (when we return to the subject in 
Section 31.8) how large the work function ®(0) is as a consequence. 

29.3 Evaporation of coatings 

The rate of evaporation of atoms v a increases at any given temperature extremely rapidly as 6 increases. Thus at 1000° K. an increase of 
Ofrom 0.1 to 0.9 causes a 10 n -fold increase in v a . This indicates very strong repulsive forces between the adatoms, in agreement with the 
fact that these adatoms tend to be positively charged as indicated by the effect of the film in increasing the electron emission ...A single 
cesium adatom on a tungsten surface may be regarded as an ion held to the metal by its image force. 


- Irving Langmuir 17 [414] 


The mobility and evaporation of cesium atoms on tungsten filaments figured prominently in the development of 
surface chemistry for which Langmuir received the Nobel Prize in 1932. Although cesium (and barium, for that matter) 
hopped nicely across the surface, its tendency to wantonly abandon its outer electron to the tungsten metal meant that it 
wandered the surface more like an ion than an atom. Because it was in turn attracted to its image charge force, the barrier 
to its evaporation (motion perpendicular to the surface) was much higher than to its migration (motion along the surface). 
As the surface concentration 9 increases, the repulsion amongst the cesium atoms acts to weaken the potential holding it 
to the surface, and so a strong relationship exists between the evaporation rate, surface coverage, and temperature. The 
size of cesium compared to the underlying tungsten atoms meant that not much was required for it to get in the way of 
its neighbors, as suggested in Figure 29.18 using covalent radii from Figure 29.19. 

Langmuir [414] (and others afterwards [366,415]) deduced that the rate of evaporation would be dependent upon the 
number of nearest neighbors of cesium (for a monolayer of cesium on tungsten, y = 4, using the notation of Figure 29.17, 
but for submonolayer coverage, y will be smaller) and therefore that the evaporation rate, which is proportional to d9/dt, 
should behave as 



(29.74) 


an approximation often used in the modeling of the evaporation of coatings for dispenser cathodes. Integrating Eq. (29.74) 
results in 



Figure 29.18 (left) Schematic of cesium (large green) atoms atop a tungsten surface (small gray); (right) Cs and W atoms represented by spheres 
( r a = 253 pm, r w = 130 pm, as per Figure 29.19. The factor R will reappear in Section 31.8. 


17 Irving Langmuir, Nobel Prize talk, 1932. 
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Covalent Radii [pm] 


Figure 29.19 Covalent radii of the elements in picometers, excluding the noble gases and the lanthanum and 
ranges associated with the color scheme. 



Time [s] 


Figure 29.20 Evaporation rate versus coverage 0 for barium on tungsten at two different temperatures (T = 1273 K for filled circles, T = 1373 K for 
open squares), compared to Eq. (29.75) with y = 4 and a = 60 py [1/s], based on Figure 8 of Forman [416]. 


How does this fare for other coatings, like barium? Consider Eq. (29.75) compared to the data of Forman [416], as in 
Figure 29.20, 18 which shows a fairly good representation. Such a relation is found to adequately describe other coatings, 
such as barium and strontium, but Moore and Allison [415] go further, showing that 

-J—= m(-40v„O) - Erf(—4/?n„) (29.76) 

2k Tip 

where Erf(x) is the error function of Section A2.4, and with e = 3.68 eV and v 0 = 0.13 eV for barium. The agreement 
between their formulation and the data they show is very good when the values of e and v 0 are adjusted. 

As the temperature increases, neutral atom desorption gives way to the desorption of ions [393, 417]. The dynamics of 
the evaporation process were examined by Shih etal. [376] using temperature-programmed desorption studies to see how 
barium departs from the surface when more than a monolayer is initially present and the temperature is sufficiently high 
that when barium does evaporate, it does so in a charged state (Ba + ). Three distinctive peak intensities separated by time 
in Figure 29.21 are interpreted as multilayer desorption (greater than a monolayer of barium is on the surface), followed 
by monolayer desorption and then a transition to monolayer desorption with a different binding energy representing 
barium interacting with oxygen on the tungsten surface. 

Analogous to the hopping model of Section 29.2, evaporation can be treated as the escape of an adatom oscillating in 
a Lennard-Jones potential (Eq. (29.25)) where the evaporation rate E euap is the product of a probability of overcoming 
the barrier P, a surface number density factor a, a coverage fraction 0, and an inverse time between collisions with the 
confining potential (1 /t), or 

Eevap = - p ° 0 (29.77) 


ls Figure 8 in ref. [416] labels time in minutes, but that is incompatible with the description in the text, suggesting 0 = 0.3 after 5-10 minutes 
at T = 1373 K, indicating that t is in seconds. 
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Time [s] 

Figure 29.21 Evaporation of Ba + showing three distinct evaporation constituents: A, mulitlayer evaporation; B, monolayer Ba evaporation; 
C, evaporation of Ba and O. Based on Figure 2 of Shih etal. [376]. 


Figure 29.22 Comparison o 
bar represents the region o' 
of ref. [380], 



r which experimental data was taken by Taylor and Langmuir on which their projections w 


(symbols; the gray 
Based on Figure 9 


The probability factor P is analogous to Eq. (29.29). The potential well and barrier are a consequence of the charged 
nature of the adatoms, the 0-dependent v 0 and e familiar from the treatment of Eq. (29.55), and the dependence of 
the collision frequency on the coverage factor 0, a set of relations that must be solved self-consistently because of the 
collective dependence on coverage. Pan et al. [380] performed the self-consistent calculation numerically and found 
good agreement between the predictions of the model and the extrapolations of Taylor and Langmuir [418], as shown 
in Figure 29.22. 


29.4 Knudsen flow through pores 


This new theory ...states that a gas molecule, when impinging upon a solid wall, is reflected in a direction totally independent of the 
direction in which it approaches the wall, and that a multitude of molecules impacting on a section of a solid surface is sent out in 
accordance with the Cos-law as known for the radiation of glowing solid bodies. 

- Martin Knudsen [419] 19 


19 The translation is from ref. [420]. 
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The small, numerous, and labyrinthian tubes that make up the pores of a dispenser cathode are responsible for bringing 
up the coating material (typically barium in thermionic cathodes [366, 372, 378], but also cesium when a dispenser 
cathode is used as a photocathode [381, 382]) from the depths of the sintered porous tungsten. Although gaseous diffusion 
is involved, atoms constantly strike and stick to the pore walls, whereupon they evaporate again. The starting amount 
of impregnant (barium or cesium) in the porous matrix or reservoir, along with the rapidity with which it is transported 
through the pores, dictates the dispenser cathode operational lifetime [366, 378, 421], In contrast to the more familiar 
hydrodynamic flow of fluid mechanics, Knudsen flow deals with the physics of gas transport through tubes under low 
pressure such that the discrete nature of the atoms (or molecules) matters, and the tracking of their individual trajectories 
is followed. By such means, Knudsen in 1908 was able to explain anomalies that became apparent when the dimensions 
of the tube became small compared to the mean free path of the gas atoms. The concepts he introduced eventually found 
use in understanding gaseous diffusion processes [422] as used, for example, to separate uranium isotopes U-235 and 
U-238 in the 1940s [423] in the quest for atomic weapons. Knudsen flow has maintained its topicality up to the present 
day [424] and is even more relevant for understanding of the lifetime of the controlled porosity dispenser cathodes of 
Section 29.5. 

Of particular importance is the Knudsen number K n , which gives a dimensionless measure of what physics matters 
and when. K n is a measure of the ratio of two length scales: first, the mean free path l = (v)r where t is the mean time 
between collisions and ( v) is the average velocity, familiar from Section 3.2; and second, a characteristic dimension d of 
the enclosing chamber or volume, that is, / 

K„ = ~, (29.78) 

d 

It can be estimated by considering sequential collisions between two walls [425]. Imagine that an atom has just collided 
with one wall, has recoiled off the opposing wall, and is just about to strike the first wall again. An uncomplicated 
visualization is that of two walls as parallel plates, each of area A separated by a length L, but a more imaginative view 
realizes that the description is more general. The number of collisions is 1, but the time between collisions is twice the 
time it takes the atom to traverse the length L, or 8t = 2L/v cos 0, where v = (v) and 0 is with respect to the normal to 
the wall. For N = pV atoms, there are pVvcosO/2L = ipAv/2)cos0 collisions. Averaged over all angles, or {cos6) = 1/2, 
this gives the number of collisions per second as pAv/4. Comparing this to the total number of collisions per second as 
Nv/d results in 



(29.79) 


EXAMPLE: Find d for a cube, a sphere, a cylinder, and concentric spheres. 

SOLUTION: For 

• a cube of side a, V = a 3 and A = 6a 2 , so d = 2a/3 

• a sphere of radius r,V= (4/3)itr 3 and A = 4 nr 2 , so d = 4r/3 

• a cylindrical tube of length L and end radius r,V = Lnr 2 and A = 2nr 2 + 2 nrL, so d = 2 rL/(r + L) 

• two concentric spheres of radius a and b < a, the volume is V = (4^/3)(a 3 - b 3 ) and A = (4^/3 )a 2 + {4n/3)b 2 , so 
d = 4(a 3 - b 3 )/3(a 2 + b 2 ). 


The mean free path / takes more work by comparison: although the concept can be put on firm ground by considering 
Maxwellian velocity distributions and the probabilities of collision [51], a reasonable estimate is given by the following 
argument. For one kind of gas atom (cesium or barium) of radius r, two neutral atoms will collide if on closest approach 
they pass within 2r of each other. The cross-sectional area is therefore a = n{2r) 2 = 4nr 2 . The probability of an atom 
being struck by one other atom in a volume V = Al, where l is understood as the mean free path / = m, is the ratio of 
a to A. Therefore, the probability of being struck by one of N other atoms is No/A. Setting that probability to unity is a 
proxy for mean free path, with one small caveat: both atoms are generally moving, rather than one stationary and one 
moving. When both atoms are moving, their relative velocity v r is used in the definition of /, which must be related to 
the velocities tT 1 and v 2 of the two atoms. The average velocity of both atoms is the same. Therefore, 


(v 2 ) = ((?, - v 2 ) 2 ) = (v 2 ) + {v 2 2 ) - 2{v x • v 2 ) 
= (v\) + {v 2 2 ) = 2{v 2 ) 


(29.80) 
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Figure 29.23 Two reservoirs of different pressure, volume, and temperature (P, V, T) connected by a tube of length L and diameter d; the second 
reservoir, for instance, can be a vacuum. The flow of gas from one to the other is characterized as Knudsen flow. 

where because the velocities are randomly directed, the average (u, ■ v 2 ) = 0, and (v 2 ) is the average of the velocity 
squared of one of the atoms, or (v 2 ) = 2(v 2 ). This will introduce a \Jl in the velocity defining /. So, the probability of 
intersection is unity when 

Na=A=— (29.81) 

iy/2 

Letting N = pV, a = 4 nr 2 , and (from the ideal gas law) p = P/k B T, then 

k R T 

l=—p - (29.82) 

4\/2 nr 2 P 

Knudsen flow refers to the transport of molecules (or atoms) through a tube connecting two reservoirs, as in 
Figure 29.23. One of those reservoirs, say the one on the right, will be at vacuum for a thermal cathode, but keep 
the discussion general and simply assume P 2 «: P l = P. Let reservoir 1 contain liquid cesium or barium that has an 
associated vapor pressure that changes with temperature. A common representation of the vapor pressure from which 
P(T) is obtained is the Antoine equation 

logioCP) = A — (29.83) 

whre the values of A, B, and C are available in various databases. 20 For cesium, 

log 10 (P Cs ) = 6.5709 - T ™Hj 29 (29.84) 

for P in [Torr] and T in [K], whereas for barium, 

7599 4 

logioCJW = 6.9570 - T _ ™yi (29 ' 85) 


EXAMPLE: Let a tube fashioned after a dispenser cathode pore be characterized by a diameter of 0.4 pm and a 
length L » r. Let it be connected to a reservoir vessel, the pressure in which is characterized by the vapor pressure 
of cesium, as in Eq. (29.84). The radius of a cesium atom is 0.263 nm. Let the temperature T = 500 K. Find K n . 
SOLUTION: Because L » r, d = 2R. Use the ideal gas equation, p= pp with p = 23.209 [1/eV], Finally, observe 
that 1 Torr is 8.3212 x 10“ 7 [eV/nm 3 ]. Therefore, 

K = - = (1 6\[2nRr 2 pP(T)y l = 50.703 (29.86) 

a 

Thus, for the temperature and pressure chosen, the mean free path exceeds the diameter of the pore by over an 
order of magnitude. Were the cesium to be replaced by barium, a comparable K n value would be obtained for a 
much higher T of 1200 K. 


The mass flow rate Q through the tube is given by the product of the mass of the gas atom and a flux through an area 
A, as suggested in Figures 29.24 and 29.25. If the length of the tube is long compared to its diameter and to the mean free 


20 See, for example, the NIST Chemistry WebBook, http://webbook.nist.gov, which gives A for P [bar]; here, units are converted so that A gives 
P [Torr], where 1 bar = 750.062 Torr. 
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Figure 29.24 The center tube of Figure 29.23, through which gas flows from Reservoir 1 to Reservoir 2. The mass flow rate is Q. The tube is of length L 
and cross-sectional area A. Cesium atoms that evaporate off a small patch dS on the cylindrical inner surface S a distance z from Reservoir 2 travel to dA 
and contribute to the mass flow rate. 



Figure 29.25 Closer view of the relation of dS to dA in Figure 29.25. The z axis is along the tube axis. The area normal to dS defines the vertical p axis. 
The function /(0) describes the evaporation distribution. R is the radius of the tube. 8 is the angle between the vertical axis and the line / linking dS to 
dA. y/ is the angle between the vertical and the direction r to dA in the x-y plane at the entrance to Reservoir 2 such that dA = rdy/dr ; <p is also an angle 
in the x-y plane such that dS = Rdtpdz. The solid angle dSl originates at dS, encloses the line /, and ends at dA. Inset in upper left: green arrows are the 
direction of in-flowing atoms; dA is the shadow cast by dA cos 9 = (z/l)dA onto the plane A. 

path 1, then the likelihood that the atom had journeyed from Reservoir 1 to Reservoir 2 without encountering anything, 
be it sticking to the tube wall or colliding with one of its fellow travelers and getting knocked into the direction of a wall, 
is rather small indeed. Therefore, it may be assumed that all the atoms passing through the end of the tube through the 
window area marked A have done so after being evaporated off the tube wall. Calculating mass flow is then tantamount 
to adding up all the atoms that leave the surface element dS and pass through the window element dA. 

But there are matters requiring care. First is that the atoms do not approach dA head on, but at an angle: dA is analogous 
to a shadow of the window that the atoms pass through, the angle between the window and its projection dA onto the 
window area A, as in the upper left inset of Figure 29.25. Second, the total flux F into the surface dS has to be the same 
as the total flux coming off the surface, but, as Knudsen pointed out, the atom must be absorbed onto the surface and 
stay there long enough to lose all memory of its incoming direction, so that the distribution f{6, <p) is normalized to unity. 
Azimuthal symmetry of emission is reasonable, and Knudsen argued that a cosine-law applied to f(0), as was known 
to be the case for diffuse radiators in optics and which goes by the name of Lambert's cosine law: it results in the same 
flux per projected area as a function of angle 6 with respect to the surface normal (see the sub-figure in Figure 29.25). 
Therefore, setting /(#) = N c cos 0 



(29.87) 


or f{6) = (\ /n) cos 0. Such a distribution resembles the small blue-green sphere in Figure 29.25 hovering above dS. 
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Lastly, although the flux F into the wall looks like a constant, a moment's reflection shows this cannot be true if the 
reservoir pressures are different: near Reservoir 1, the pressure is higher, and so the flux must be higher. If the gradients 
are not large, 

F(z) = F(0) + z(d z F) (29.88) 


But remember that flux is already known, or 


T(x) = p(v+) = 


PP(z) 

{2nMpy/ 2 


P(Z) 

(2 nMk B T) 1 ! 2 


(29.89) 


where v+ is the velocity component into dS, and (ej) = {v)/4 for a Maxwell-Boltzmann distribution in velocities, as can 
be deduced in a variant to Eq. (3.16). 


EXAMPLE: Prove (v+) = (v)/4 for a Maxwell-Boltzmann distribution, where the + superscript denotes "in the 
forward direction". 

SOLUTION: There is no loss in generality in working in units for which the mass M is equal to 2. Thus/ MB (e) = e~ v2 
where v=\v\ and v 2 = v 2 + v 2 + v 2 . It follows 

, + , = Ip v x ^v x _ _j_ = / 0 °° v^e-^dv _ 

* f-<x> v x e ~ v *d v x 2 \[n' f^vZ-er'Pdv 

where the differing representations of the integrals involved is a consequence of the peculiar features of a 
Maxwell-Boltzmann distribution expressed alternately in Cartesian and then polar coordinates. Thus it follows that 
( v x) = (v)/4. 


Now the mass flow rate Q can be evaluated: it is the integral over the flux T(z)f(8) coming off of dS and directed 
through the solid angle d£l, or 

Q = M HI dSdClf(d) T(z) 

/■“ f* /2 f 2RcoS¥ rdrrcosy/ z 

= M J 2nRdz J dyr J -y ^ -T{z) (29.90) 


where use has been made of l 2 = r 2 + z 2 and dQ. ■ h = z/l. But observe that the integral over the first term in Eq. (29.88) 
(a constant) vanishes because the integrand is antisymmetric in z, therefore only the zd z T term survives. For the second 
term use the linear approximation 


d z T(.0) 


„ T 2 ~ T \ _ P2VP2-P1VP1 

J Z 2 ~ Zi 


L\JlnM 


(29.91) 


The remaining integrations are straightforward, and the general Knudsen equation for Q becomes 


Q = | {p 2 P\ /2 - P\P\' 2 } 


(29.92) 


For the special case in which the temperatures of the two reservoirs are equal, it simplifies to 


Qlr 1= r 2 = \y/2xMp^P 


(29.93) 


where A P is the pressure difference between the reservoirs. Below, making the assumption of equal reservoir tempera¬ 
tures immeasurably eases the discussion, and so Eq. (29.93) will be used for Q. Observe that the infinite tube, or L » 1, 
limit was used to eliminate the F(0) integral. When the tubes are of finite length, then the calculations are more difficult 
(426], 

An expression for Q using a dimensionless tube cross-section parameter W often proves convenient, the relation 
between W and Q being given by 

Q = M(v+)A(P 2 - PJW = —(v)A(P 2 - Pj)W 


(29.94) 
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Figure 29.26 Example tube cross-sections for the evaluation of W by Eq. (29.96), such that the height of each is unity. In addition to the circle (left), 
they are termed triangle, inverted triangle, diamond, and square in Table 29.98, for which (cosvd = 0.849, 0.721, 0.864, 0.916, and 0.790, respectively. 


Cast in this way, the orientation and shape of the tube, which matters in how the gas moves through it, is contained in 
the behavior of W alone, and it, being dimensionless, allows for convenient comparisons. W is seen to be defined by 

W = 2Ljf dACOSV/ ^2L (COS,y ) (29-95) 

where P A is the perimeter of the tube cross-section, A = J dA is the total cross-sectional area, and y/ is the angle directed 
to the location of dA in Figure 29.25. Observe that (cosy r) depends only on the shape of the tube, not its total area. An 
example helps dispel ambiguity, so consider the circular cross-section, or the first of the shapes shown in Figure 29.26, 
for which P A = 2nR, A = nR 2 , and 


(cos y/) = 


1-1% ™sy/dy/ fg Rmsw rdr 

I-in'ty fo Rcosw rdr 


— = 0.84883 
3 ji 


(29.96) 


where the limits of the r integration specify the rim of the tube. Thus, for a circular pore [427], 


W 0 


8R 
3 L 


(29.97) 


The circular pore, however, is unique: all the points along the rim for the evaluation of (cosy/) are equivalent. When 
the pore has a non-circular shape, however, the orientation of the pore matters, and where the origin is results in a 
different value of (cos yr). For example, the remaining pore shapes shown in Figure 29.26 can be put into integrals of the 
form 


(cosy/) = A 1 f dx f yciy 

A. Jy« V* 2 + y 2 


(29.98) 


where x a ,x b ,y a and y h depend on the pore shape and orientation and are given in Table 29.1. The numerical values of the 
integral of Eq. (29.98) are shown in the figure caption of Figure 29.26, thereby demonstrating that the value of (cos y/) 
depends on where the origin of the integration is. The shapes of Figure 29.26, though, were chosen with care to be the 
limiting (largest and smallest) cases, so the average over all origin choices should lie in the middle, say near the average of 
those limiting cases. The averages do not differ greatly from the circular case, and so for the purposes of a life-time model, 
using the circular result of Eq. (29.97) is convenient and approximately correct. Even so, given that the perimeter P pore 
and cross-sectional area A pore of the pore need not be circular in general, it is better to call those quantities out specifically, 
and finally represent Q as 

_ _^4 p 

Q « yf2MP P ° re L Pare AP(D (29.99) 


where AP(T) is the temperature-dependent pressure difference between Reservoir 1 and Reservoir 2 (taken as vacuum). 


Table 29.1 Pore integral limits for Eq. (29.98) (recall that the value of (cosy/-) 
is 0.84883 for a circular pore). 


Pore 

2^ 

x a 

x t 

y a 

y a (cosy) 

Triangle 

2-(/3 

0 

1/V3 

0 

1 - -\/3 x 0.721 

Inverted triangle 

(6/5) VI 

0 

1/V3 

0 

x/VI 0.864 

Diamond 

4 

0 

1/2 

X 

1-x 0.916 

Square 

2 

0 

1/2 

0 

1 0.790 
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In a conventional dispenser cathode, the tubes are hardly straight (as they are in the tubes of the controlled porosity 
dispenser cathode considered in Section 29.5). As long as the pore diameter is small with respect to the curvature of 
the tube, characterizing tubes by a tortuosity factor [428] is possible. A simple one-parameter model is suggested by the 
spring-like shape in Figure 29.27. The length of the tube is given by 

I = / 2 ”{ r2 + ( 2 ^) 1 } ‘*' = V 1 + ( t) J ,29 ' 10 °> 

where 2 = l/nr is the pitch. As the number of turns n -*■ 0, then I. -*• l, but for a very tortuous tube, L can become much 
greater than l. 


29.5 Lifetime of a sintered wire controlled porosity dispenser cathode 


The oldest hath home most; we that are young 
Shall never see so much, nor live so long. 

- William Shakespeare 21 

Thermionic cathodes operated SCL are at a temperature such that more electrons are emitted than can be pulled 
across the AK gap. The excess are turned back to the cathode. As a result, the electron cloud in front of the cathode 
smooths emission non-uniformity that may exist on the cathode surface. The random placement of the pores and 
the non-uniformity it entails is therefore not necessarily problematic. However, when dispenser cathodes are used for 
applications which are not run SCL, as for gyrotrons and gyro-amplifiers, [12, 13, 221, 429] or for photocathodes for 
particle accelerators and free electron lasers [380-382, 384], emission non-uniformity is undesirable so that the pore 
and grain structure may entail problems. Although uniformity can be induced by running the cathodes at a higher 
temperature to achieve faster barium diffusion and surface coverage, that comes at a cost of lifetime. By the very nature 
of the sintering process on tungsten grains, and the damage to the pores that results from machining the surface, a 
uniform placement of clear pores was not possible, for a while. 

Controlled porosity dispenser (CPD) cathodes developed by the Naval Research Laboratory [369, 370] showed proof 
of concept, with laser-drilled or etched holes approximately 5 pm in diameter making the pores. The CPD cathodes 
demonstrated promising performance in terms of uniformity of emission, although the pore size resulted in a more rapid 
depletion of the barium than desirable, and costs associated with making the pores was high. 

An innovative solution by Calabazas Creek Research (CCR) [392] wound tungsten wires about a quarter the thickness 
of a human hair 22 on a spool then cut cross-sections of a desired thickness and area to give the densely and uniformly 
placed pores shown in Figure 29.28. The simple and cost-effective technique was usable with reservoirs that can be made 
large, which provides practical advantages, a notable one being the absence of a tortuosity factor for tubes of a uniform 
length. Thus, the Q per tube is the same and a relatively simple estimate of estimate cathode lifetime can be made. 


21 Ref. [37]: King Lear, I.v.326-327. 

22 Human hairs vary, so what should have been said is a straight human hair from a Caucasian adult (East Asian hair has twice the diameter) 
that is black (80-100 pm) rather than red (50 pm) or flaxen (20-50 pm). But as tungsten wire thickness varies from 4 pm to 20 pm, a range in 
hair thicknesses to give the same ratio is permissible. 
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Figure 29.28 Left, SEM of a CCR controlled porosity dispenser cathode manufactured using 20 pm diameter tungsten wires sintered at 2348 K for 1.25 
hours, giving rise to approximately triangular pores 4 pm in extent. Image courtesy of R.L. Ives (Calabazas Creek Research), similar to Figure 9 of R.L. 
Ives, L.R. Falce, S. Schwartzkopf, and R. Witherspoon, IEEE Trans. El. Dev., 52, 2800, 2005 [392] and used with the permission of IEEE TED. Right, 
schematic details of the surface (top) and wires (bottom). 


Cesium in the reservoir is activated by heating: for dispenser cathodes, the reservoir is filled with cesium or barium 
producing compounds that liberate cesium or barium when heated. The vapor pressure relation is therefore not that of 
pure cesium or barium, but something less fulsome (a factor E d added to the vapor pressure relation will account for 
it). To evaluate the lifetime then, several factors must be known: the amount of cesium M 0 in the reservoir, its flow rate 
through one pore Q, the length L and area A pore of that pore, the number of pores N in the area of the cathode, and the 
operating temperature T and vapor pressure P(T) in the reservoir. The lifetime At is then specified by the relation [380] 


M 0 = NQAt 


(29.101) 


The factor N is determined by first relating the number of pores in the light-blue hexagon, or 6, to the ratio of the area 
of the hexagon A hex = 6\/3 r 2 to the area of the red equilateral triangle A^ = y/lr 2 for r being the radius of the wire, as 
suggested by the upper right inset diagram of Figure 29.28. Assuming that the area of the cathode is much larger than 
the the cross-sectional area of the tungsten wire, then N « A 0 /A^, where A 0 is the entire area of the cathode. 

The area of the pore A pore is affected by the sintering temperature and duration, but take it as A^ with three wedges 
("pizza slices") of area nr 2 /6 removed. Likewise the perimeter of the pore is three arc-lengths of nr! 3, or 23 


^* re =(V3—§)r 2 


(29.102) 


(29.103) 


Inserting all the factors into the expression for At results in 

3 M 0 l (6xk B T\ 1/2 

Af= 4(V^-W2))^ rAP(T) V^ 2 ' 


(29.104) 


for the CCR sintered wire controlled porosity dispenser cathode. Two features are important. First, although Q oc r 3 , the 
number of pores N oc r~ 2 so that lifetime is only inversely proportional to r. Second, if the pressure of the second reservoir 
P 2 is taken to be 0, or the reservoir is at vacuum, then it would seem that AP(T) = P Cs (X) from Eq. (29.84), but in light 
of the "cesium-producing compounds" nature of how cesium is generated in the first reservoir, a factor to account for a 
greater difficulty cesium may have in evaporating from other materials rather than itself is accounted for by modifying 
AP(T) to become 

AP(T) -► P Cs (T) e - E * lk B T (29.105) 


where E d serves the role as an activation energy that requires determination by comparison with data. 


23 The specification of a tube area and perimeter for non-circular pores has advantages over constructing an "equivalent" pore diameter, but 
the outcomes of the present approach to ref. [380] are comparable. 
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EXAMPLE: Estimate the value of E d for a sintered wire CCR cesium-controlled porosity dispenser cathode 
using the measured lifetime of 5-10 hours for the following parameters: cathode area A 0 = n(0.\7 cm) 2 , thickness 
L = 0.762 mm, radius of tungsten wire r = 23 pm, mass of cesium in reservoir M 0 = 7.7 mg, and temperature 
T = 773 K = 500 °C (measured values courtesy of Eric Montgomery, UMD). 

SOLUTION: First take E d = 0 eV. Recall that M= 132.91 grams/mole for cesium. Eq. (29.104) then gives 
At = 5.2192 seconds. Therefore, using the midpoint of the experimentally measured lifetime, e Ed/kliT = (7.5 hours)/ 
(5.2192 seconds), or E d = 0.57 eV. 


EXAMPLE: Using E d from the previous example, estimate the lifetime of a cesium sintered wire CPD cathode for 
a temperature of 250 °C. 

SOLUTION: For T = 473 K, P Cs = 1.9525 Torr and = 3.2432 x 10~ 6 . Using the cathode and material 

properties of the previous example, then At = 33.768 kHrs, or a little under four years. 


Extending such an analysis to conventional barium dispenser cathodes requires an estimate of the tortuosity factor 
as well as E d . Values of X of order unity and E d ~ 1.4 eV gave lifetime estimates that were comparable to conventional 
barium dispenser cathodes of 30 kHrs at 1273 K [380], 





CHAPTER 30 

Field emitters 


Field emitters are conventionally needle shaped in order to provide the required value of the electric field, i.e., 10 7 <F < 10? volts/cm, 
with convenient values of the applied potential. The emitter tip is usually a single crystal, approximately hemispherical with a radius of 
the order of 10~ 6 cm. Thus both </> and F vary appreciably over the emitting surface... 

- Walter P. Dyke, J. Ken Troian, Winthrop W. Dolan, and George Barnes [5] 

The analysis of field emission experimental data concerns matching current to applied voltage (I- V) rather than current 
density to surface field J(F), as in Section 13.3, where the emphasis on surface is not acknowledged nearly as often as it 
should be. Applying Fowler-Nordheim theory is nuanced [430], but as before, the simple approximate relations between 
current and current density of 7 = JA, where A is the area of emission, 1 and between applied voltage V and field of F = fl g V, 
where ft is a field enhancement factor, are oft-used relations that allow the Fowler-Nordheim equation (see Eq. (13.25)) 
to be morphed into the visually similar but nevertheless very different empirical equation of Eq. (13.30) - recall that it is 

7(V)=AV 2 exp(-|) (30.1) 

Although there are serious issues with making the I -*■ JA and V -*V/fi g replacements [431] so as to recover J fn (F), 
issues that shall be re-visited in due course, what is interesting here is why these approximations need to be made at 
all. Visualize holding two metal plates apart with with a dielectric material (in other words, a capacitor) and try to get to 
fields across the plates that are characteristic of field emission before suffering dielectric breakdown. The dielectric fails 
first, for example quartz glass (amorphous Si0 2 ) has a dielectric strength of roughly 160 MV/cm = 0.016 GV/nm [432], 
which is well below the fields needed in J PN to make it interesting. To get the field at the emission site up in the range of 
GV/m, emitters are often made into sharp points where the field is strongest at the tip. As a result, the area refers to the 
apex area (or some fraction of it), and the field enhancement refers to whatever processes cause fields about sharp points 
to be large. Therefore, it is the multidimensional nature of field emitters that really sets them apart, spawning the myriad 
technological gambits to make edges and points, and which causes the electron beams from field emission sources to be 
of a rather different nature than the planar thermal, photo, or secondary emitters they compete with. But more to the 
point here, it also makes the correct relation for I-V much more complicated than Eq. (30.1) would imply, makes field 
emitters very susceptible to space charge, and causes their performance to be undermined when groups of them are too 
close to each other. 

A few comments are in order. The methods to treat field enhancement have application to an interesting hybrid known 
as the Schottky emitter, wherein high temperatures are used for emission, but large fields are used to lower the emission 
barrier [154], The physics behind hot needle-like emitters therefore combines thermal-field emission behavior with field 
enhancement effects, so that though they are not explicitly discussed further here, the physics behind their operation 
is implicit in what is. Also, not all cold cathodes are pointy: some are flat and rely on an internal field emission model 
[150, 433-435]. Lastly, the effects of temperature always lurk in the background as when heat is unwanted, its dissipation 
through the narrow structures of many field emitter sources is problematic [230, 436-441]. In short, what is treated 
herein is not all that there is, but should be helpful for that which is left out. 


30.1 Field enhancement 

"Look aloft!" cried Starbuck. "The corpusants! the corpusants!" All the yard-arms were tipped with a pallid fire; and touched at each 
tri-pointed lightning-rod-end with three tapering white flames, each of the three tall masts was silently burning in that sulphurous air, 
like three gigantic wax tapers before an alter. 

- Herman Mellville 2 

1 Observe that the relation should be I = f JdA ns J apex A 0 , where A 0 is the "notional area'', but this gets ahead of the narrative. 

2 Herman Melville, Moby Dick. New York: Bantam Books, 1986, Chapter 119. 
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The typhoons in the Japanese seas that threatened the whaling ship Pequod in Melville's epic gave rise to conditions 
which caused corona discharges at the tips of the masts, in which strong electric fields ionized the air and created St. 
Elmo's fire (what the coffee-loving first mate Starbuck and others of his time called corposants although Melville used a 
different spelling). Given that the potential difference between cumulonimbus clouds and "ground" are several hundred 
megavolts, and that the clouds are within 2 km of the ocean surface, gradients on the order of 0.1 eV/pm result, whereas 
because the dielectric strength of air at normal temperature and pressure is about 3 kV/mm (P = 3 eV/pm), the fields at 
the mast tips must exceed that. 

The higher fields on the masts compared to those in the wide open are a result of field enhancement, whereby sharp¬ 
ened structures like the tips of masts, or lightning rods, or needle cathodes cause equipotential lines to bend in the 
vicinity of a sharpened point. A ship on the ocean distorts the equipotential lines in its vicinity that exist due to the 
voltage difference between the clouds in the upper atmosphere and sea level, similar to Figure 30.1. Electric fields are a 
measure of how many equipotential lines are crammed into how small a region: in the figure, the potential difference 
between adjacent lines (dashed blue or solid red) is 0.2 units (the unit being F 0 a as shown below). Near the ship mast, 
three red lines are crammed into the same vertical space as one blue line in the absence of the ship (blue dashed lines). 
In other words, the gradient is 0.6 energy units over one length unit for the red lines, but only 0.2 energy units when 
the ship is absent. Very crudely then, the field enhancement factor, or the ratio between the gradient in the presence of 
the protrusion compared to the field when the protrusion is absent is three times greater (and gets greater as one zooms 
in to the top of the mast itself as there are field enhancements atop field enhancements, as suggested by the Schottky 
hypothesis in Section 30.4). 

The equipotential lines of Figure 30.1 seem ad hoc, but there is a mathematical model behind them. The law of super¬ 
position entails that, as formulated by Eyges ([99], p. 62), 


Charge and charge alone can produce a potential. If in some problem we pinpoint all the different sources of charge and write an expression 
for the potential produced by each source, the sum of these partial potentials is the total potential that satisfies the boundary conditions. 

A total potential is therefore the sum of potential distributions due to individual charge distributions, a simple yet flexible 
model is useful from which to make estimates. The equipotential lines of Figure 30.1 are generated using two point 
charges of opposite sign placed in a uniform electric field at the points (z,p) = (±z p , 0) (the subscript p being for "point"). 
The reason for choosing two opposite charges rather than one is to insure that all points equidistant between the charges 
are at zero potential, which fits in nicely with the concept of the presumed ground, or in this case sea, level. The two 
charges form a dipole for which the effects on a background potential die out quickly with distance, but unlike usual 
dipoles a finite distance 2 z p separates the charges themselves. In this most simple of models, the equipotential lines are 
equivalent to the contour lines of V(z, p) given by, in cylindrical coordinates, 


V(z,p) = -F 0 z- 


yjP 2 + (z + Zp) 2 yjp 2 + (Z - Z p ) 2 


(30.2) 


where the gradient between cloud and sea is represented by the first term, and the dimensionless 8 represents the 
distortion caused by the ship or, more generally, the protrusion. If the relevant length scale of the protrusion is a 
(e.g., a is the radius of the crude hemisphere that incloses the ship in Figure 30.1), and if the ratio of the energy scale 



Figure 30.1 The potential difference between adjacent lines is 0.2 units: near the ship mast, three units 
(red lines) are in the same vertical space as one (blue dashed line) unit in the absence of the ship. The 
potential gradient, that is, the electric field, therefore has increased approximately by a factor of 
0.6/0.2 = 3. 
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48 Q/a to F 0 a is denoted by 


48Q/a _ 48Q 
F 0 a F 0 a 2 


(30.3) 


then 



(30.4) 


(30.5) 


Importantly, the u function is dimensionless. This interesting equation suggests that problems which scale such that 
the product F 0 a is constant have a commonality. Electric fields are the gradient of potentials, and field enhancement is 
the ratio of an electric field in the presence of a protrusion compared to that which would exist in the absence of the 
protrusion. Defined thus, field enhancement is a dimensionless relationship that is a function of the geometry alone, and 
it has the same value for different fields and length scales so long as X is the same for both conditions. On the axis, the 
field enhancement factor p = F/F 0 is a dimensionless ratio of fields, reflecting a convention to make the terminology field 
enhancement and p factor interchangeable. Note, though, that other conventions can be found. In the extended theoretical 
treatments of field enhancement for a variety of geometrical configurations by Edgcombe et al. [442-444] and Forbes 
etal. [445], a distinction is explicitly drawn between the dimensioned factor multiplying the anode potential, as done by 
Dyke and Dolan [155], and the dimensionless factor multiplying the background field as done here, and they prefer to 
use y = F/F 0 to represent the latter. The reader should beware of both the distinction and the nomenclature. Here, 



(30.6) 


where ha is the height of the protrusion (h is dimensionless), and d z u(z, p) = 1 when <5 = 0 (the protrusion-free case). 

For metal-like protrusions, the electric fields are normal to the surface, and so the gradients from which the field 
enhancement factors are determined should be calculated in the direction of the normal as well. Later, this complication 
will be respected, but here and now it is enough to observe that for the rotationally symmetric problem, the normal at 
the apex is along z, so that h ■ VV = d z V, a simplification that is too useful to ignore. 


EXAMPLE: For the u(z, p) function shown in Figure 30.1 (where z is the vertical and the units are such that a = 1), 
find the p factor knowing that z 0 = 0.5 and <5 = 0.3. 

SOLUTION: The field enhancement factor is the value of d z u(z,p) evaluated on the axis (p = 0) at the apex (z = h). 
Therefore first find h and then evaluate d z u(z, p) at (z, p) = (h, 0). Finding h requires solving 


u(h, 0) = 0 -* -h 3 + z%h + 28z 0 = 0 


for which h = 0.792739 is the real root (alternately, using the methods of Section A3.7.2, iterating h j+l = 
(28z 0 + Zghj) 1 ^ starting with h 0 = 0 and converging to 0.792739 after eight iterations). Using this and h 2 -z 2 = 28zjh 
from u(h, 0) = 0 gives 


-d x u(z,p)\ z=hl>=Q = 1 + — = 4.32123 


The field F tip at the apex of the hemispherical shape in Figure 30.1 is therefore 4.32 times larger than the background 
field F 0 , or F tip « 432F 0 . 


Setting up and solving electrostatic problems in this manner is an example of a summation problem [99], where the 
potential energy everywhere for an electron of charge -q can be evaluated by 



(30.7) 


The form of Eq. (30.7) holds true even if the number density p e (f) is a sum over delta functions J^jXj8(f - rf) correspond¬ 
ing to point charges, a viewpoint that becomes invaluable in Section 30.3. Contrast this to the separation of variables 
approach, where a sum over elementary solutions subject to boundary conditions gives the potential, a particularly simple 
example of which is the case of the hemisphere on plane considered next. 
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30.2 Hemispheres and notional emission area 

Our life is frittered away by detail... Simplicity, simplicity, simplicity! 

- Henry David Thoreau 3 

Just as the point charges of the previous section represented a particularly simple method of introducing field enhance¬ 
ment, the simplest of the expansion methods to do the same involves a hemisphere of radius a on a plane such that far 
away from the hemisphere the field F 0 is constant and in the z direction [446]. This elegant little problem is quickly 
resolved by expressing the potential everywhere in terms of a Legendre polynomial expansion [100], as is often useful 
for problems with spherical symmetry (remember the treatment of the hydrogen atom in Section 9.1). The hemisphere 
on a plane of Figure 30.2 has rotational symmetry about the z axis, so in spherical coordinates the solution everywhere 
can be expressed in terms of the Legendre polynomials of Section A2.5 as 

V(r,9) = £ (a/ + P,(cos0) (30.8) 

Far away from the hemisphere, the potential is that associated with a constant field, and so for r -* oo, V(r,9) -» -F 0 z = 
—F 0 rcos9. As a result, all A, vanish, save for A 1 = —F 0 because all higher P l involve higher powers of cos#. But the 
hemisphere, too, is held at ground, and so at r = a the potential again vanishes. Because all A, for Z ^ 1 are zero so too 
must B, be zero for all Z ^ 1 because there is nothing to cancel them otherwise. But, at r = a, it follows that S, = —F 0 a 3 
to cancel the A 1 term, and so for a boss (another albeit less exact name for a hemisphere) for which P 1 (cos 9) = cos 9 

V boss (r,9 ) = -F„rcos0 11 - (^) 3 | (30.9) 

where terms have been gathered up. The normal to the hemisphere is radially outward (h = r), and so the field enhance¬ 
ment factor along the surface of the hemisphere is given by 

0(0) = -yd r V(r,9)\ r=a = 3 cos 9 (30.10) 

The field enhancement at the apex (9 = 0) is therefore 3. One can verify independently that the same result at the apex 
is obtained in cylindrical coordinates for which r = y/p 2 + z 2 by finding d z V and evaluating it at p = 0. But here, the point 
is more general: the field declines (as cos 9 for a hemisphere) as one moves off-axis from the apex of a rotationally symmetric shape. 
The rapidity with which the field declines is a measure of the curvature of that shape. 

The last point deserves dwelling on because of its potential for confusion (pardon the pun). Field enhancement 
is frequently and implicitly treated as being the ratio of the on axis field on the protrusion to the background field 
[286, 431, 445, 447] (often with the background field being that due to a parallel plate, for which F g = V anode /D, where 
D is the anode-cathode (AK) gap), but doing so can be possibly misleading [448], It is not simply the apex of the emitter 
which figures in field emission: the shape of the protrusion matters, and for that reason understanding that 0(9) varies 
over the emission site, or tip, has pedagogical value. 



Figure 30.2 A conducting hemisphere on a plane subject to a background field F = F 0 z. Both the hemisphere and the plane are held at ground. Red 
lines are the equipotential lines; blue dashed lines would be the equipotential lines if the hemisphere were absent. 


5 Henry David Thoreau, Walden, Or, Life in the Woods, Dover Thrift Editions. New York: Dover Publications, 1995, p. 59. 
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The shape of the protrusion matters because it affects how much of the surface is apparently emitting. Often the emis¬ 
sion area £2 is crudely estimated as the ratio of the total current I 0 from the protrusion with the apex current density J(F tip ), 
or £2 = I 0 /J{F tip ), but this overlooks the fact that the current density varies over the tip. That crude approximation can 
be put on better footing: the notional emission area ("notional" being something that only exists in theory or imagination) 
[449] should be something like 

£2„~— J(F)d£l (30.11) 

J ( F tip> Jboss 


where the integration is over the surface of the hemisphere and the subscript “n" for notional emphasizes that it is not 
the true area of emission, even though it and the true area scale the same with field. Thus, the so-called crude estimate 
and the notional area are alike. The active emitting area, though, is larger. As an estimate of it, suppose the active area 
is defined as the area from which 90% of the emission comes, and further assume that J(F) falls off as a Gaussian in the 
axial coordinate p (which is a better approximation than it appears if p = a sin 0). Then the active area is defined by an 
axial coordinate p a such that 

[~ a e.xn(-ap 2 )2np dp 

-= 0.9 (30.12) 

Jo exp(-ap 2 )2np dp 


or p a = \/ln(10)/a. It follows that £2 = ln(10)£2„ = 2.3 £2„, where £2„ = it/a in this approximation. Therefore, the active 
radius is some 52% larger than the notional radius, but both scale as 1/a. 

Armed with that understanding, revisit the hemisphere and pin down what its notional emission area is. As with p a 
above, a "notional angle" 0 n can be defined such that [346, 450] 


w)= S =(cos8)! ~' exp 


r 12 , 

= / 2n a 2 P{0) sin OdO 

B o $ 3 / 2 (1 — cos 0 )\ 


(30.13) 

(30.14) 


where the field emission current J(F) is given by Eq. (13.25) and P{6) acts as a weighting factor. Making a change of 
variables x = (1 - cos 0)/ cos 6 and letting ri = cos 0 n , then 


l-«-/ 


<i+*r 

®V2 


(30.15) 

(30.16) 


Because b is large, the approximation (1 + x) 4 v » e (4 v)x can be made for small x, and so for a hemisphere, 

ri = cos6>„ « + y (30.17) 

The area of the hemisphere is Inc?-. Therefore, letting g(F tip ) denote the fraction of the hemisphere that would emit if the 
current density across it were J(F tip ) with F tip = 3 F 0 , the notional area is then 2na 2 g(F tip ) or 

UV = f J(m))dn = 2na 2 g(F 1ip )J(F tip ) (30.18) 

g(F) = f (1 +x) 4 - v e~ bx dx« b + \_ v (30.19) 

where the behavior of g(F) is shown in Figure 30.3 for various values of <t>. For low fields and using the small angle 
approximation cos 0 « 0 2 /2, then 9 n « y/2g(F). 


EXAMPLE: Find the notional angle g = cos 0 n and the fraction of surface emitting factor g(F tip ) for a hemisphere 
using ® = 4.5 eV and a background field of 2 eV/nm. 

SOLUTION: The field at the apex is F tip = 3F 0 = 6 eV/nm. Using Table 2.3, the factor b = B 0 &/ 2 /F tip = 10.868 for 
® = 4.5 eV, and v = ( &Q/9K)\j2m/<& = 0.77281. Therefore ri = 0.92905 and 0 n = 10.9°. Lastly, gOF 0 ) = 1/i.b + 4 - 
v) = 0.070947. 
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Figure 30.3 The behavior of #(F) for a hemisphere for various work functions assuming that J(F) is given by Eq. (13.25). Because J PN is not defined past 
p = ® 2 /4Q, the lowest work function <6 = 3.0 eV line is dashed after the maximum field of F mwc = 6.25 eV/nm (F nwx = 11.11 eV/nm and 14.06 eV/nm 
for <t> = 4.0 eV and 4.5 eV, respectively, and so are not visible in the range shown). 


The prototypical manner in which current from a protrusion is wrested from the current density equations of elec¬ 
tron emission is encapsulated in Eqs (30.18) and (30.19): current density from differential emission elements has to be 
summed over the surface of the entire emitting structure. Although differing shapes will give rise to differing g(F), the 
algorithm of creating an area factor apart from the apex current density is the means by which Eq. (30.11) is realized. 

Figuring prominently - or at least not unobtrusively - in the equation for total current from the hemisphere in 
Eq. (30.18) is the radius a of the hemisphere, which will become the radius of the emission site in due course when 
quirkier shapes than hemispheres are considered. Backing out the radius of curvature from the equipotential lines of 
V(z, p) is therefore necessary. If z s (p) is the equation of an equipotential such that V(z s (p), p) = constant, then a Tay¬ 
lor expansion of V(z,p) near the apex (z = a, p = 0) can be used to find a from the behavior of V. The surface of the 
hemisphere is defined by the equation z 2 + p 2 = a 2 , or z s {p) = -\Ja 2 - p 2 , that is, z s (p) is a function of p 2 itself. Inserting 
Eq. (30.18) into the Taylor expansion 

V(z, p) = Via, 0) + {z - a) d z V(a, 0) + ^p 2 d 2 V(a, 0) (30.20) 

where V(z,p) = V(a, 0) along the surface and d p V(a, 0) vanishes at the apex of the emitter, results in the relation 
z s = a- (p 2 /2a), which is recognized as the leading order expansion of z = \J a 2 — p 2 . That relation is a direct consequence 
of d z V(a, 0) = -3 (familiar from the field enhancement calculations) and d 2 V(a, 0) = -3/a. As a result, the radius of the 
hemisphere is related to the gradients of V(z, p) evaluated at the apex of the emitter r s (0) = z 0 by 


d z V(z 0 , 0) 
djV(z o ,0) 


(30.21) 


Although this very important relationship is trivially satisfied for the hemisphere, it will be far more consequential when 
the shape of the emitter is more engaging. More importantly, it will allow I boss (F tip ) to be applicable to general emitters, 
that is, if the field enhancement and the radius of curvature of the apex of a field emitter is known, then Eq. (30.18) can 
be used to approximate the current from it as long as a suitable g{F) can be found. 

That the treatment of notional emission area has been firmly rooted in the usage of the Fowler-Nordheim equation 
for J(F) is worth keeping in mind. For cold emission, if approximately 11% of the surface field emits, then g(F) = 0.11, 
which implies 0 n « 30°. A hot hemisphere is more difficult [451]: the entire surface of the hemisphere can contribute to 
thermal (or even photo) emission, so that J GTP (F, T ) ofEq. (17.61) should be used in Eq. (30.18) under such circumstances. 
But that is for another time. 


30.3 Point charge model 

We are what we pretend to he, so we must be careful about what we pretend to 


- Kurt Vonnegut 4 


4 Kurt Vonnegut, Mother Night. New York: Dell Publishing, 1966, p. v. 
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Whatever shape the field emitters take on when pressed into the service of devices, it is not the shape of two charges in 
the configuration of a dipole, or a hemisphere on a plane. Those models are learning tools: they are analytically tractable 
so that methods can be judged against them. Although real emitters are far more devious, those methods can be brought 
to bear to reveal hidden features. The advantage of the dipole problem is that the shapes it can treat are malleable and, 
once chosen, the field enhancement factor is dictated as in Eq. (30.4) by u(z, p) regardless of F g = W a /D, so that I(V a ) 
curves can be obtained. The advantage of the hemisphere problem is that notional area factors can be readily obtained. 
The task now is to combine those advantages. To do this, pretend that the protrusion is composed of a distribution of 
point charges cleverly arranged. 

A straight up extension of the dipole problem places additional charges above and below the z = 0 plane. That is, the 
u(z, p) of Eq. (30.4) is generalized to n charges and their images. Usage of the images ("dipole model") is better for treating 
Spindt-type field emitters which have a more conical shape; using only one sign of charge ("monopole" model) results 
in emitters resembling Taylor cones [230], To keep the narrative simple, because conical field emitters for r > 0.35 are 
better described by the dipole model, and because a generalization to a line charge model for wire-like emitters follows 
naturally from the dipole model, fix attention to it, although there are some quirks associated with it for which one must 
be on guard, for example for too small a charge the equipotential surfaces begin to deform into a floating sphere model 
which looks more like a balloon tethered to the ground plane: although seemingly unphysical, it resembles the "floating 
sphere" model of field emitters [452], 

There is the matter of what a now represents. When n = 1, it was comparable to the radius of the hemisphere-like 
enclosure. To retain a as a measure of the apex radius, however, there is utility in subscripting the radius of the n = 1 
point charge, and so refer to it as a 0 . Therefore, let 



(30.22) 



(30.23) 


so that the u(z,p) of Eq. (30.4) is really u Y {z,p). There are many ways to choose the Aj and Zj, but only some will result 
in emitter shapes that bear any resemblance to conical Spindt-like field emitters [175] (or, for that matter, melted Taylor 
cones [31]). The aesthetics of their choice affects the possibility of analytical solutions and is the essence of the point 
charge model [230, 346, 439, 453, 454], 

The parameter governing the magnitude of the ;'th charge, or A jr appears to be independent of the location Zj, but 
the two are in fact intimately entwined. A thought experiment shows why. Let n point charges plus their images across 
the z = 0 plane be used to model an emitter: if the last point charge A n is placed at z n too far away from z n _ x , then its 
contribution to the zero-equipotential surface is an unphysical disembodied sphere floating above the other point charges. 
In order to get shapes that correspond to physical emitters, the Zj must be chosen strategically and the Aj then adjusted 
so that the zero equipotential mimics physical surfaces. Their joint specification can be suitably chosen by appealing to 
"self-similarity". This is accomplished in two steps: 

• First, the (j + l)th point charge location bears the same relation to the ;th charge location as the ;th charge does to the 

(j - l)th, enforced by demanding that 



(30.24) 


for all j, with ? 0 = 0, where the dimensionless r should not be confused with rr. 
• Second, all Aj are set so that z n+1 is on the zero equipotential, or 

u n(z„+ 1 » 0 ) = 0 


(30.25) 


These two requirements are sufficient to determine all Zj and Aj in terms of the number of charges n and the 
"self-similarity" factor r. Superficially, it seems that r < 1, but in fact r is not so constrained: it shall be seen that r can take 
on any value (and in fact some interesting shapes result by having it large). But it is true that for conical or ellipsoidal 
emitters nominally meant to resemble Spindt-type field emitters that taper down to a point, r < 1 is a better choice. 

The first requirement is easy: z l corresponds to the radius scale of the emitter, or z x = a 0 . For; = 2, then z 2 =z x + a 0 r = 
fl 0 (l + r). By induction, it follows that 



(30.26) 


where Eq. (A1.4) is invoked and the dimensionless term Sj(r) is introduced and defined by this relation. For convenience, 
the r argument of Sj is suppressed below. Observe that the nth charge therefore sits at the apex of the {n - l)th line, 
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Figure 30.4 Placement of the first five point charges, and the lines they create, with the lines and the point charges of the same color, for r = 0.75. The 
Hth charge sits at the apex of the tn - l)th line. The z axis is along the horizontal and the p axis is vertical, so that the figure is to scale. 


as in Figure 30.4. The second requirement is a bit more tricky. It is tantamount to requiring 


■ = 2M 



(30.27) 


If these equations are numerically solved for small r, it is rapidly found that Aj becomes exponentially small as j -*■ n, 
which unpleasantly complicates the numerical arts. Rather than endure an ugly determination of A jr it is better to coax 
out the small nature explicitly, and so let 2 ; (r) = r j Pj(r) where now P ] is to be determined and Aj calculated from it. 

But now for the artistry: a rather pleasant and straightforward substitution using Eq. (30.26) shows that the following 
relation holds 

r j S n+l _j = S„ +1 - Sj (30.28) 


which eliminates the numerically problematic part of the denominator of Eq. (30.27). Lastly, observe that n can take on 
any value, in fact S n can simply be the nth component of a larger vector S of length N (and similarly P n with P), with N 
greater than any value of n likely to be encountered in practice (N = 24 works well). This means that a matrix equation 
of the form 


y -p 

M„j 


' Vi-A +w * 


(30.29) 

(30.30) 


where 0 n j = 1 if j < n and 0 otherwise, is to be solved. Observe that y is a lower triangular matrix L, making the matrix 
equation to be solved a subset of the LU-decomposition method (where LU denotes lower triangular-upper triangular 
and refers to when y = LU) [91, 236], and therefore easily implemented using numerical packages. 

An iterative approach, however, is possible from the boundary condition equation m(z„ + 1 ,0) = 0, which can be written 




S„ +W (S,- + s„ +1 ) 


(30.31) 


Taking a cue from knowledge that the matrix solution involving y involves a lower triangular matrix, it is evident that 
solving Eq. (30.31) sequentially for P n is possible. Isolating P n gives 
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from which it is straightforward to show 

Pi 
P 2 

after which recourse to numerical evaluation of the sums for n > 2 is preferable. 

The definitions of field enhancement and apex radius for the point charge model now have to be updated. The deriva¬ 
tives are straightforward but involved, and so only the final form for the dipole case is provided (the monopole case being 
in refs [453, 454]). For field enhancement. 


-(l+r)(2 + r) 

(r 3 + 2 r 2 + 4 r + 2)(r 2 + 2r + 2) 

~>(r- 1 - lVr2 4 - r-l- 7) 


(30.33) 

(30.34) 


„ = -a 0 d z u n (x,p) I z = Zb+1 ,„ =0 
4 PjS,S n+ , 




pi P[S n+l _j(S n+l + S,)] 2 


(30.35) 


Likewise, the apex radius is 

v — 

a„ i d z u n | ^prUs^s^+s,) p 

“o ~ % d 2 p u n | z=w=0 “ 2 » r± ¥^ 3 ^,> 

p{ rV IS„ +1 _j(S„ + 1 +Sj)P 


(30.36) 


A numerical evaluation of the various terms is given in Table 30.1 (see also Section A3.25) and shown in Figure 30.5 
of the case of r = 0.75. Strikingly, even though the height of the emitter never exceeds z x = 4 a 0 (a series which pays 
homage to Eq (6.13)), the field enhancement keeps growing as the sharpness of the protrusion increases (a n decreases). 

The trend that P n exponentially increases, and a n decreases, with n is even more pronounced for smaller values of 
r, as shown in Figure 30.6, behavior that figures markedly in Section 30.4 and in studies relating electron emission to 
heating through changes in /?(r) (e.g., Figure 2 of ref. [439]). On the other hand, as r-» 1, that behavior is broken so 
that P n increases more slowly and a n actually increases itself, with n, behavior that leads to the line charge model of 
Section 30.6 (again, recall the special status of x = 1 in Eq. (6.13), alluding to the special status of r = 1 as a transition 
value). The manner in which the point charges build up - or not - reflects the geometrical structure that they are tasked 


Table 30.1 Point charge model evaluation of S n ,P n ,p n and a n for the dipole point charge model 
for the value of r = 0.75 and shown in Figure 30.5. Observe that 5 n (r) -> 1 /(I - r) as n <x>. 


n 

S„(r) 

P„(r) 

P„(') 

a„/a o 

1 

1.00000 

2.40625 

3.96970 

0.708589 

2 

1.75000 

1.72054 

4.88324 

0.629186 

3 

2.31250 

1.53834 

6.04215 

0.506569 

4 

2.73438 

1.39196 

7.40333 

0.397150 

5 

3.05078 

1.25858 

8.99928 

0.307404 

6 

3.28809 

1.13530 

10.8780 

0.236114 

7 

3.46606 

1.02189 

13.0979 

0.180418 

8 

3.59955 

0.918287 

15.7287 

0.137344 

9 

3.69966 

0.824194 

18.8528 

0.104258 

10 

3.77475 

0.739115 

22.5685 

7.89674x1 O’ 2 

11 

3.83106 

0.662430 

26.9925 

5.97072x10” 2 

12 

3.87329 

0.593466 

32.2639 

4.50808x10 -2 

13 

3.90497 

0.531542 

38.5484 

3.39983x10 -2 

14 

3.92873 

0.475995 

46.0437 

2.56161X10" 2 

15 

3.94655 

0.426207 

54.9855 

1.92855x1 O’ 2 

16 

3.95991 

0.381600 

65.6549 

1.45101 xIO- 2 

17 

3.96993 

0.341648 

78.3876 

1.09114x10~ 2 

18 

3.97745 

0.305872 

93.5839 

8.20161 xIO" 3 

19 

3.98309 

0.273840 

111.722 

6.16251X10" 3 

20 

3.98731 

0.245162 

133.372 

4.62896x10" 3 
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Figure 30.5 The parameters of Table 30.1 shown graphically for r = 0.75. The exponential behavior visible after n > 3 for P n and a n is characteristic of 
r < 1 behavior. 



Figure 30.6 Field enhancement P n (thick solid lines) and apex radius a n /a 0 (thin dashed lines) as a function of n for r = 0.25,0.50,0.75 and 1.00 (red 
circles, green squares, light blue diamonds, dark blue triangles, respectively). With the exception of the r = 1 line, all show exponential behavior. 

with imitating. But points are not the only distributions of charge that can be pressed into such service. For example, 
rings of charge of increasing diameter with differing charge can more faithfully model conical emitters [455], showing 
the power of the technique when harnessed to numerical methods to determine the optimum charge distribution more 
complex than points but which share the rotational symmetry of the protrusion. In the same spirit (although predating the 
introduction of the point charge model) modeling emitter structures by charged spheres has application to open and closed 
carbon nanotubes [456], Such analytical and semi-numerical methods allow for the evaluation of the field enhancement 
factor, particularly where numerical methods alone cannot capture the very sharp rise in field enhancement, such as 
when a very small protrusion sits atop a larger one, as considered next. 

30.4 Schottky's conjecture 

Great fleas have little fleas upon their backs to bite 'em, 

And little fleas have lesser fleas, and so ad infinitum. 

And the great fleas themselves, in turn, have greater fleas to go on, 

While these again have greater still, and greater still, and so on. 

- Augustus De Morgan 5 


5 Augustus De Morgan, A Budget of Paradoxes. Waxkeep Publishing, 2015, p. 377 (first published in 1872). De Morgan was a British mathematician 
and logician. He contributed a section on "mathematical induction" (a term he coined) to The Penny Cyclopedia, a source used by Herman Melville 
while composing Moby Dick. 
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In Uber kalte und warme Elektronenentladungen 6 [138], Schottky imagined a hemisphere much like Figure 30.2, on top of 
which another smaller hemisphere sits. Consideration of the equipotential lines led him to conclude that" ...wir erhalten 
dann eine nochmalige Verdoppelung des Potentialgradienten" 7 or, as synopsized by Stern et al. [285], 

If the emitting area lies on a second hemispherical boss lying on the first and is small compared with it, then p = 3 2 , and so on. 
[emphasis added] 

where Stern refers to the "extra factor due to surface irregularities", ox field enhancement, as p. In other words, the field 
enhancement of the top hemisphere (a factor of 3) multiplies the field enhancement of the lower hemisphere (a factor 
of 3). In short, as long as the topmost protrusion is small by comparison to the bump on which it rests, field enhancement 
factors are multiplicative, a speculation noted and sometimes dubbed Schottky's conjecture in later studies [32, 457]. Math¬ 
ematically, if p a and p b are the field enhancement factors of protrusions a and b on otherwise flat planes, where a is small 
by comparison to b, 8 then 

Ptotal = Kh (30.37) 


But why stop there? As with De Morgan's fleas, there can be bumps on bumps, and the interest is in how they stack 
up. The point charge model allows for an investigation of the effect. For larger r, understanding the behavior of P„(r) is 
required. A numerical evaluation shows that an exponential behavior of P n « P 0 e~ an (where a is defined by this relation 
and is not the fine structure constant) seen in Figure 30.5 is more generally apparent, as in Figure 30.7. Clearly, a is a 
function of r, as implied in Figure 30.8 for the combination l/(re“), a combination that will prove useful below. To see 
how this helps demonstrate Schottky's conjecture, consider the behavior of Eq. (30.35) in greater detail. 

The first bump to which all other bumps cling is characterized by n = 1: let this primal bump, as it were, be designated 
by p 0 . After explicit expansion and collection of terms, and using Eq. (30.33) to start the iteration, it is 


(30.38) 


Next, insert the approximation Pfir) = P 0 e~“ ] into Eq. (30.35) along with S n (r) from Eq. (30.26) to obtain (where the 
arguments of the Sj and Pj have been suppressed, and where the o subscript on P 0 is not the n = 0 version of P„(r) but 
rather a fitting coefficient) 


?„(r) = 1 + 4P 0 £ 


(1 - r) 2 (1 _ r » + i )(1 _ r j )e -gj 
(1 - r"+W) 2 (2 - r i - r n+i ) 2 ri 


(30.39) 



Figure 30.7 Symbols: numerical evaluation of P n (r) for various r as labeled (0.05, 0.10, 0.25, 0.50, and 0.75). Lines: least squares exponential fits for 
P„(.r) * 


6 On cold and warm electron discharges 

7 “ ...we. then obtain a further doubling of the potential gradient" 

8 Miller et al. argue that when "the half-width of the macroprotrusion is less than the height of the microprotrusion'' [457] then deviations from 
Schottky's conjecture occur, so small relates to that. 
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Such an equation is a discomforting mess and calls for approximations. As the small r is the situation of greatest interest, 
observe that the 1/r 7 term is largest for j approaching n, but under such circumstances r 7 «: 1 when n is appreciable. 9 
Also observe the combination e~ aj / r 1 that naturally occurs. Therefore, make the replacements r" +I -*■ 0, 1 - r> -* 1, and 
2 — r 7 -> 2, and make use of 

A p(r) = (30.40) 


to find 


re a(r) 

fi n (T) = 1 + P„( 1 -r) 2 X: 


A /? 7 


f (l_ r „ + W )2 

which is more accessible. The summation part can likewise be manipulated to give 

j .=Arj^- 2 =^- 2 C n ir) 

(1 - r" +l_7 ) 2 p (i - r 7 ) 2 (1 - r) 2 ” 

Now recall the fit of A/?(r) shown in Figure 30.8. For « = 1, Cj(r) = 1, whereas for n -*■ oo, then 
C M (°) = £ 2 l ^:e 2 

cji) = £r 2 = f(2) = | 


(30.42) 

(30.43) 


(30.44) 

(30.45) 


where L'Hopital's rule has been used for C^fl), and Rieman's zeta function of Eq. (A1.7) makes yet another appearance. 
The point is, approximating C n (f) as independent of n really is not a bad approximation. Supposing one is bold enough 
to do so, what happens? One finds 

f) n (r) « 1 + P 0 C(r)Ap(r) n « P 0 C(r)Ap(r) n (30.46) 


But that means 


p n (r) a A p P n _ y (r) 


9 Why the caveat about "appreciable"«? Remember, the interest is in bumps on bumps and the assumption of a perfectly flat base plane is different 
than that, therefore what is of interest is a few bumps out, say n = 3 - 4, where (0.25) 3 = 0.015625 and (0.5) 4 = 0.0625, thereby approaching 
what is meant by "negligible." Naturally, departures are expected when r approaches unity, but let's see how far the analysis can be pushed. 
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Figure 30.9 Symbols: Numerical evaluation of P n (r) (circles) using Eq. (30.35) and P (r) using Eq. (30.32). Lines: Eq. (30.48) using the parameters of 
Table 30.2. Schottky's conjecture works well throughout, although discrepancies are becoming visible by r = 0.75. The algorithms of Section A3.25 


Table 30.2 Parameters for P n (j) and p n (r) approximations. Least squares estimates of a where 
P n (r ) fa P 0 e~ an . The factors A p = 1 /(re") and p o from Eq. (30.38) for using Eq. (30.48) are also given. 


r 

<*(r) 

P 0 (r) 

mr) 

PJr) 

0.05 

2.3461 

6.2397 

1.9148 

22.5122 

0.10 

1.6892 

3.3736 

1.8466 

12.5238 

0.25 

0.8789 

1.8943 

1.6610 

6.5556 

0.50 

0.3549 

1.7033 

1.4025 

4.6000 

0.75 

0.1110 

2.1839 

1.1932 

3.9697 


is a good approximation. Setting p x {r) = p 0 of Eq. (30.38) for the primary protrusion, then 


4 +1 (r) « A p k p 0 (30.48) 

where k = n — 1 is the number of additional bumps after the primary one. The Schottky conjecture (as Stern et al. [285] 
phrased it) has now been demonstrated: each additional bump past the primary one adds another factor of Aft. But 
notice that the primary bump has a different genesis than all the other bumps, 10 a sort of nuanced modification of what 
Schottky said. 

Does Eq. (30.48) work? Actually, using the algorithms of Section A3.25 it works surprisingly well, as shown in 
Figure 30.9. 11 In fact, although departures are visible by r= 0.75, the approximation nevertheless remains good up 
to that relatively large value, although much beyond that, and certainly by the time r -*■ 1, the approximation becomes 
poor. As a result, for the point charge model Schottky's conjecture works for values of r larger than might be thought 
applicable. 


30.5 Assessment of the tip current models 

Nothing is too wonderful to be true if it be consistent with the laws of nature and in such things as these experiment is the best test of such 
consistency. 

- Michael Faraday 12 


10 The bump it sits on is perfectly flat, after all. 

11 Recall, though, from Section 30.3 that for r < 0.3 in the dipole model under strong field, the emitter shape is more like the floating sphere 
shape than a conical emitter shape; this does not change the multiplicative field enhancement, but does affect the visualization of what it 
represents. 

12 Bence Jones. The Life and Letters of Faraday (V.2). University of California Libraries, 1870. p253 (Entry: March 19, 1849). 
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The hemisphere model maintains that the current from a field emitter can be known if the field enhancement at its 
apex and its shape (or, more accurately, its radius) is known. The audacity of that proposition motivates testing how 
well such a model describes actual field emission data before embarking on models based on line charges, ellipsoids, and 
boundary element methods. Consider then the data of Figure 2 of Schwoebel etal. (ref. [458], but see also refs [459,460]), 
which subjects two micro-fabricated emitters to pulsed current processing, where the heating induces blunting of the 
emitters. That the tips of the emitters are complex (showing several crystal planes of molybdenum subject to "events" 
corresponding to migration of contaminants, and so on) is evident from field emission electron microscope (FEEM) 
images [461], What matters here, though, is that changes to the apex radius (particularly when heated) occur when 
the sharpness of the tip reflects a balance between surface tensions, which act to blunt the tip, and high fields, which 
tend to sharpen it [462]. During "field forming" the tips were conditioned (blunted by "thermally activated field assisted 
self-diffusion") to putatively bring the emitters to the same radius that balanced surface field and surface tension. The 
surface migration drove emitters to emit uniformly, desirable because emitters of the same radius and material perform 
comparably, and uniformity of emission over an array, wherein the current load is shared evenly amongst the emitters, 
contributes to operational reliability. Individual emitters of this type have, remarkably enough, generated a total current 
of up to 3 mA from a single emitter; at a more modest 300 pA/tip a triangular array of such emitters separated by 4 pm 
would achieve a current density of 2165 A/cm 2 . 

Can the current as a function of voltage for these emitters be predicted? This is a different question than projecting 
performance based on line fitting of I-V data and extracting parameters from it. The present question concerns using 
one I-V data point to find the radius for the data shown in Figure 30.10 and then using the calibrated model to predict the 
remainder of the I-V curve. The correspondence of the I-V curves with the experimental data, differing only by the value 
of the apex radius describing differing emitters, gives a measure of the model's fidelity. 

A complication is that the emitters are gated, in which a close proximity gate (a layer coplanar with the emitter tip 
having a gate hole through which the tip protrudes [15, 175, 447, 463-465], as in Figure 13.1: the emitter shown there 
is comparable to the ones generating the data of Figure 30.10) is used to generate the high apex fields using a reasonable 
and achievable gate potential. A relation between the apex field and the gate potential is therefore required. Such a 
relation is available from boundary element simulations (Section 30.8.2) and given in Eq. (30.124) (after Eq. (2) of 
ref. [466]). From it, the apex field F tip can be related to the gate voltage V g by the relation F tip = P g V g , where fi g has units 
of [1/length] and is a function of apex radius a s , gate radius a g , and cone angle 0 C . 

The theory lines of Figure 30.10 may then be determined using Eqs (30.18) and (30.19). Parenthetically, Eq. (30.19) 
should undergo modifications when the prolate spheroidal models of Section 30.7 are used, as the form of g{F) will 
alter, although the alteration is weak by comparison to the other terms of I boss when a s is changed. Calibrate the theory 
by finding the value of a s using one I-V data point, the other parameters of gate radius a g and tip cone angle 9 C being 
determined from examination of the field emitter (e.g.. Figure 13.1) or known, as with the work function <t> = 4.3 eV 
of molybdenum. Although different crystal faces of molybdenum exhibit different <t> and a tip exhibits several faces, the 



Figure 30.10 I-V data from tv 

to those shown in Figure 13.1. Symbols correspond to experimental data; lines correspond to the theoretical model of tip current. Data points digitally 
extracted from Figure 2 of P.R. Schwoebel, C.A. Spindt, and C. Holland, J. Vac. Sci. Tech., B21, 433, 2003 (ref. [458]) as in Table A3.2. 
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value of O = 4.3 eV for molybdenum is neither the lowest nor the highest of the values listed in ref. [135] and is the 
recommended value of ref. [467]. A method for determining a s from a single data point is presented in the algorithm of 
Section A3.26. The results of such an analysis are seen in Figure 30.10: the agreement is rather notable. Note that the 
agreement for the 12 line would be improved by using a larger work function <t>, as would be the case for a different 
crystal face or contaminant, but that has not been done. 

The ability to predict the remainder of the I-V curve from given parameters and the extrapolated apex radius for a 
single data point bears favorably on the usage of the field enhancement plus emission area factor method of estimating 
the current of a tip. Although p and g(F) will change, the methodology of integrating current density over differential 
surface elements is the manner by which total current from a multidimensional field emission structure can be predicted. 

The effects of field enhancement, geometric variation, and notional area can start affecting how Eq. (30.1) for I(V) 
begins to emerge from Eq. (30.18) for the current over a rounded emitter apex. Couple that with the changes wrought 
by transmission probability effects (thermal, Kemble, and so on) leading to the general thermal-field equation of Eq. 
(17.61) and it is clear that more prefactors start to aggregate on the basic form of the Fowler-Nordheim equation J fn (F) 
(or even I(V)). The prefactors are not insignificant: they result in variations of O.lx to lOx the simple form [186]. Forcing 
all such changes to be carried by field enhancement and apex radius (or energy slope factor p F ) is likely asking too much, 
so understanding the origins of the variation by recourse to simple models is very useful. 

30.6 Line charge models 

It is the grand object of all theory to make these irreducible elements as simple and as few in number as possible, without having to 
renounce the adequate representation of any empirical content whatsoever. 


- Albert Einstein 13 


Reflecting the scientific penchant for keeping matters as simple as possible - be they suppositions or models - goes 
by the name of the principle of ontological economy, or, in Latin, entia non sunt multiplicanda praeter necessitatem ("entities 
are not to be multiplied beyond necessity"), a phrasing that is erroneously referred to as Ockham's razor. 14 After a point 
of charge, the next simplest arrangement is a line of charge. But there are different kinds of lines: the simplest one 
is a line of constant charge per unit length. Such a model has much in common with the prolate spheroidal models of 
Section 30.7 in that are alternate means of modeling conical-like emitters. A modification of almost comparable simplicity 
is to let the charge density of the line be varied linearly over its length (tapered). The tapered line charge models become 
useful for modeling wire-like emitters (carbon fibers or nanotubes). Each is considered here. 

30.6.1 Simple line charge model 

A line of constant charge per unit length qX in a background field produces a potential everywhere given by 
[99, 173] 15 



(30.49) 


where Q 0 (x) is a Legendre function of the second kind as in Eq. (A2.21), and the ellipsoidal functions u and w are 
related by 


u(z, p) ± w{z, p) = \Jp 2 +(z± L) 2 
ufz, p)w(z, p) = Lz 


(30.50) 


(30.51) 


13 The Herbert Spencer lecture, delivered at Oxford, June 10, 1933, reproduced in Albert Einstein, Ideas and Opinions. New York: Crown Pub¬ 
lishers, 1954, p. 272. 

14 As Anthony Flew observes, "These actual words are not in fact to be found in the extant works of William of Ockham." Antony Flew, A 
Dictionary of Philosophy. London: Macmillan, 1979, p. 253. 

15 The notation follows ref. [173], but unfortunately some of the symbols and function names have appeared previously with different meaning. 
Cave, lector. 
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and where w(z, p) will not reappear until Section 30.6.2. As with the point charge model, at the apex of the emitter z = z 0 , 
the total potential vanishes. The potential everywhere for a line of charge in a constant background field is then 


V(z,p) = F 0 z 0 


Q 0 [L/u(z,p )] 
Q 0 [L/u(z o , 0)] 


F 0 Z 0 U c (z,p) 


(30.52) 


where U c (z,p ) (the c subscript is for "constant") is the line charge analog of u n (z, p) in the point charge model: do not 
confuse u n (z,p ) in the point charge model with the analog of u(z, p) here, as they mean very different things. Observe 
that, as with the point charge model, an overall energy scale F 0 z 0 exists, multiplied by a dimensionless function that will 
govern the nature of the field enhancement factor p, but in the place of n and r of the point charge model the simple 
line charge model is now specified by z 0 and 8 defined by the relation 


(30.53) 


where 8 acts like the fraction of the emitter above the extent of L and therefore is expected to bear a very simple relation 
to the apex radius a when <5 is small. Examples of typical choices of l/Q 0 (z o , 0) are shown in Figure 30.11. Observe that 
in terms of 8, 


The evaluation of p requires knowing d z u(z, p) evaluated at p = 0 and z = z 0 . But uiz > L, 0) = z, and so d z u(z, 0) = 1. As 
a result, for <5 1, 


F, jp 


-F 0 z 0 d z U c (z 0 , 0) = -F 0 Z 0 d z 


F ° + (z 2 - L 2 )Q 0 (L/z o ) 

* F °( 1 + 8 ]^ 2 / S )) 


Qq(L/z) \ I 
Q 0 (L/z o ))\ z=Z ' 


(30.55) 


a finding that will bear great similarity to the field enhancement for a prolate spheroidal geometry. It is often suggested 
that field enhancement scales as the inverse tip radius, and the present result shows that this is mostly true, but not 
quite: it is a little worse than inverse tip radius because of the factor ln(2/d) in the denominator. In the opposite limit, 
when 8 -*■ oo, where Q 0 (l/(1 + <5)) -*• 1/8, then p = F tip /F 0 -> 2, which is recognized as the point charge model composed 
of a single charge at the origin; a single charge has a smaller p than the hemispherical result of F tip = 3 F 0 , a result of the 
equipotential lines having a softer bend near a point charge in a background field in contrast to a hemisphere. 

The apex radius is more involved because of the need to consider d 2 u(z, p). It is wise to proceed in stages in evaluating 


d z U c (z,p) I 
d pU c (z,p)\ z=ZsP=0 


(30.56) 


The numerator is straightforward from the experience in evaluating F tjp . The denominator takes care. Using 


d z U c (z,p)\ z=Zo , p=0 

dju c (z,p) u w=0 


(Z 2 0 - L 2 )Q 0 (L/z o ) 


(Z 2 ~ L 2 ) 2 Q 0 (L/z 0 ) 


(30.57) 

(30.58) 



Figure 30.11 Equipotential surfaces of the i 


: line charge model for various 1 /<2 0 (?„, 0). The size of the view region is 2 L x 2 L. 
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Figure 30.12 Field F Up and radius a as a function of 8 = ( z 0 /L ) - 1, represented as KF^/F^ - 2] (left axis, red circles) and ( a/LS) (right axis, blue squares). 

where in the second line use has been made of the following relations, valid when p -*■ 0 and z -* z 0 after the 
differentiations: 


Assembling the pieces, and substituting 


d p u(z, p) -*■ 0 

d 2 u(z,p) -»■ Z ° 

Zo~ L 

z 0 = L(1 + S), the apex radius a is then 


_ <5(2 + 6) 

1 +.5 4 


r <5(2 + <5) F 

1+-5 


(30.59) 

(30.60) 


(30.61) 


Interestingly, a/L -*■ 28 for both 8 0 and 6 —> oo. The behavior of a/(LS) and F tij ,/F 0 as a function of 6 is shown in 

Figure 30.12. 


EXAMPLE: Although a(8) can be evaluated once <5 is specified, consider the reverse problem: find 6 given that 
a = 0.011,0.11, andL. 

SOLUTION: The iteration method of Section A3.7.2 works well. Reconfiguring Eq. (30.61), define the (n + l)th 
iteration as 

= (a\ _ (1 +S n ) 2 _ 

” +1 \l) (2 + 6„)(l+6 n + 6„(2 + 6 n )Q 0 (l/(l+6„))) 

Starting with 6 0 = 0, by n = 4 the iteration has converged to S 5 = 0.0048697,0.043843, and 0.36290 for 
a/L = 0.01,0.1, and 1.0, respectively. 


30.6.2 Tapered line charge model 

How sharper than a serpent's tooth it is... 

- William Shakespeare 16 

The line charge model of Section 30.6.1 was akin, both in methodology and in the shapes of the equipotential surfaces 
to which it gave rise, to the "monopole" point charge model. For the point charge model, the Taylor-cone-like shapes 
of the monopole model gave way to the conical emitters of the dipole model, so that the analog for the changes to 
be wrought on the line charge model are to make the charge below the boundary at z = 0 of opposite sign to the line 


16 Ref. [37]: King Lear, 1.4.193. 
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Figure 30.13 Equipotential surfaces ot the tapered line charge model for various l/(z o Q 0 (L/z o ) - L). THe size of the view region is 2 L x 2 L. 


charge above z = 0. But if the purpose is to model wire-like emitters (and it is the purpose to model such emitters), then 
something more is required: the charge density on the line charge has to be tapered, from a maximum value at the apex 
of the line charge to a vanishing value at z = 0. This will avoid spread-out and odd-shaped bodies in favor of slender and 
long equipotential surfaces like those of Figure 30.13. The cost is a minor increase in complexity. 

For a wire of charge for which the charge per unit length X -* X 0 z/L varies linearly over the length of the wire 
(a "tapered dipole" model), then the potential V(z, p) in cylindrical coordinates for a single wire is 


4QX 0 f L 

<z - p) = -rL 


y/p 2 + (z- z') 2 


(30.62) 


where now V(z a ,p) = 0 identifies the apex and defines z 0 . In terms of the u(z,p) and w(z, p) functions of Eq. (30.50), this 
integral is 

V(p, z) z zQ 0 (L/u(p, z)) - w(p, z) 

TT = ~ + zaw-l <m63) 

As with the point charge model, the factor F 0 z 0 sets the energy scale. As with the constant line charge treatment, again 
let = L(1 + S). Then 


p=yd z V(z 0 , 0) = 


5(2 + 6) [(fi+l)Q 0 ( I ^)-l] 


(30.64) 


Similarly, for the apex radius, the derivatives of Eq. (30.56) take a bit more effort, but gratifyingly, after simplification. 


a 


L5 


2 + S \ 
1+5/ 


(30.65) 


which is seen to be the first term in the constant line charge radius of Eq. (30.61). 


EXAMPLE: For the constant line charge model, p -*■ 2 as LI a -*■ 0, reflecting a single point charge. Find the 
analogous result for the tapered line charge model. 

SOLUTION: The presence of the additional (-1) in the denominator of Eq. (30.65) complicates matters for the 
5 —► oo limit, as it requires more terms to be evaluated in the Taylor expansion of Q 0 [ 1/(1 + 5)]. For large 5, using 

<2 0 [1/(1 +5)] =i-J| ln(l + 25~ l ) « S” 1 - 5~ 2 + 1<T 3 - 26~ 4 

where terms up to the fourth order in (l/<5) must be retained, then asymptotically, pm 3 + 30<5 2 -*■ 3, or the 
hemispherical result. This limit is used in the vertical axis of Figure 30.14. 


30.7 Prolate spheroidal representation 

The point is that the coordinate system should he chosen to fit the problem, to exploit any constraint or symmetry present in it. Then, 
hopefully, it will he more readily soluble than if we had forced it into a cartesian framework. Quite often "more readily soluble" will 
mean that we have a partial differential equation that can be split into separate ordinary differential equations, often in ‘standard form " 
in the new coordinate system. 


George Arfken [60], p. 72 
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Figure 30.14 Field F tip and radius a as a function of 5 = (z„/£) - 1, represented as - 3] (left axis, red circles) and ( a/LS) (right axis, blue squares) 

when considering the difference between 0 and its large 8 limit. 

Poisson's equation is easiest to treat when a coordinate system is such that separation of variables can be used, relying as 
it does on the Laplacian V 2 . There are a fair number of coordinate systems to chose from. 17 Of them, the prolate spheroidal 
coordinates are the most useful for treating emitters shaped like needles (ellipsoids of revolution) or cones (hyperbolas of 
revolution), where, because rotational symmetry is presumed for now, only two of the orthogonal coordinates matter in 
the discussion. The Laplacian and gradient operator in the prolate spheroidal system is determined in Section Al.3.1.2. 

Why pursue yet another representation of a field emitter? Simply, because in orthogonal coordinate systems the emitter 
surfaces are taken to correspond to when one of the coordinates is held constant, making finding the surface trivial. The 
field lines, in turn, are along the other coordinate and can be found with little effort. To emphasize the point, if one 
coordinate corresponds to the equipotential surface, 18 the other coordinate corresponds to the field lines and this can be extremely 
useful. Knowing the surface and the field means that an exploration of the notional emission area factor g{F) becomes far 
easier, so that an understanding of the relationship between the large-scale geometry and the fraction of the tip emitting 
can be explored. 

The advantages of the prolate spheroidal approach to field emission structures has been often considered [197, 
468-470], with various ways to represent the coordinate system; here, specifying the coordinates by angles (r\,v) is 
preferred, 19 and their relations to the cylindrical coordinates ( z,p ) are 


p = L sinh(?/) sin(b) 
z = L cosh (17) cos(h) 


(30.66) 


(30.67) 


where L is a length scale (the more general case is given by Eq. (A1.64)). Hyperbolic surfaces are defined by constant v, and 
ellipsoidal by constant q, both of which are shown in Figure 30.15. The hyperbolic representation allows for a close-spaced 
anode along z=0«» = njl, whereas the ellipsoidal representation allows putting in a constant background field F 0 , 
but this apparent freedom is rather inflexible: the hyperbolic model is exact only for a diode geometry with the anode 
a fixed distance away and related to the cone angle, whereas elliptical models are exact for an ellipsoidal emitter all by 
itself in the presence of a uniform field. The prolate spheroidal treatment therefore does not describe arrays, but it does 
reveal how the fields along the surface F(q, v), and particularly the apex field F tip , depend on geometry. 

Using the separation of variables methods that were so useful for treating the hydrogen atom of Section 9.1 can be 
used here. Poisson's equation can be recast in the form of 



(30.68) 


where Vir), v) = U(q)W(v). As the sum of the U and W terms must be 0, they must each equal the same constant but of 
different signs. Let that constant be n(n +1), where the fact that n is an integer follows from analogous considerations 


17 The second edition of Arfken [60] contains many, but later editions treat fewer. 

18 Or, more precisely, "if the surface of the conductor corresponds to holding one coordinate fixed 

19 In the cited sources, the angles are sometimes (a, 0), but as these symbols have been terribly overworked already, a small change in notation 
is deemed helpful. 
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Figure 30.15 The prolate coordinate system: grid lines are in gray. A hyperbolic surface (constant v) is blue and pointing down: its "anode" would be the 
z = 0 plane, and field lines would correspond to the solid gray lines. An ellipsoidal surface (constant»/) is red and pointing up: field lines (in the absence 
of a background field F ) would correspond to the dashed gray lines. 


that determined / in /(/ + 1) in the hydrogen atom was likewise an integer. The value of n is determined by the boundary 
conditions one wishes to impose. 


30.7.1 Hyperbolic emitter 

For a close-spaced planar anode at v = jt/ 2 and a conical emitter defined by v = v 0 , then for the n = 0 case and using 
Eq. (A1.72), 

(sin vdj + cos v d,)W 0 (p) = 0 (30.69) 


This is true for W(v) equal to a constant, but it is also true for W(v) <x Q 0 (cos v), as can be shown by substitution and using 
d v Q 0 (cos a) = -1/sino. The most general solution then is the sum of the constant solution and the solution proportional 
to Q 0 (cos v ) set to satisfy the boundary conditions that the zero equipotential is at v = v 0 , and so 

Q 0 (cosb) \ 


W 0 {v) = V„ I 1 - 


(30.70) 


Q 0 (cosv 0 )J 

The result is independent of ri as along the equipotential lines V is constant, and rt marks where along the equipotential 
line the point of interest is situated. The ^-dependence of the field along the surface arises because of the gradient operator. 
From Eq. (A1.74), 

v-F = F v (n, v) = -—' -— d o W 0 (t>) (30.71) 

L(sinh »7 + sinm) 

Recall that, as per the methods of Section 30.2, the apex field F tip = F( 0,v o ) is given by 


yj sm 2 v 0 + sinh 2 »7 
V„ 

F ti = ---— 

LQoCcosUjfsin^ v 0 

Similarly, Eq 30.56 in prolate spheroidal coordinates gives the apex radius 


where the tip-to-anode distance is D = L cos v 0 . In the limit that a <K D, or tan 2 o 0 «: 1, make use of 


Qofcosu,,) = - In 


1 / (1 + secw „) 2 \ 
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Figure 30.16 The factor F Up D/V 0 for the hyperbolic diode configuration, where D is the anode-cathode separation (tip-to-anode) and V 0 is the anode 
potential. o 0 is the cone angle in degrees. Symbols use Eq. (30.73) and the red line corresponds to the asymptotic limit of Eq. (30.76). 


and cosv 0 « 1 to find [197, 470, 471] 




2V 0 

a ln(4Z)a) 


(30.76) 


the performance of which is shown in Figure 30.16. The field enhancement factor is therefore comparable to the 
commonly used form of p = l/ka, where k is called the macroscopic component of p [472], with macroscopic being with 
reference to a by the ratio D/a in the logarithm term: macroscopic features do not affect field enhancement with anywhere 
near the impact that tip radius does [473], a consequence of the equipotential lines being more dependent on the charge 
distributions closest to them compared to the ones further away. 

The impact of field variation as a function of rj now allows the notional emission area factor g(F) to be evaluated for 
the hyperbolic diode emitter. Compared to Eq. (30.19), more work is required: the current density v 0 )) over a 
ribbon of area dA must be integrated over the surface. For the hemisphere, dA = 2nr 2 sin Odd, a well-known result, but a 
generalization to it is needed for surfaces that are rotationally symmetric but for which all points on the surface are not 
equidistant from a center (hence rd6 is no longer the width of the ribbon making dA, its length being 2nrs\n6 - recall 
Figure 21.3 in Section 21.2). The problem is to obtain the area of a ribbon along the edge of the disk, such that the 
width of the ribbon is at an angle to the plane of the disk. If the disk is of radius p and has a width dl, then dA = Inpdl. 
The element dl, in turn, is the hypotenuse of the amount the ribbon changes in the vertical dz and radial dp, or 


dl = y/dz 2 + dp 2 


(30.77) 


dA = 2izp ( ) dr\ = 2 kL 2 sin P sinh r\yj sin 2 o„ + sinh 2 >7 dr\ 

\dn) v 

The integration of current density over this differential element gives the hyperbolic diode g(F) factor as 

0 yj l +x(x+ 2)s\n 2 v g 

where F tjp = F(0,v o ) and (1 + x) 2 = 1 + (sinh r\j sin v 0 ) 2 . For small x, 


am * 


(30.78) 


(30.79) 

(30.80) 


(30.81) 


is a reasonably good approximation. Compared to the curvature of a hemisphere, the curvature of the hyperbolic cone 
is more gentle and this is reflected in the 4 in the denominator of Eq. (30.19) being replaced by 4 -*• sin 2 o„. This is not a 
huge change when compared to the magnitude of b = B 0 /F tip , but it is non-trivial, and it shows the extent to which the 
macro-scale geometric features intrude upon emission at the apex. An ellipsoid has a sharper apex than a hyperboloid, 
but not so sharp as the hemispherical boss, creating the expectation that its area factor must lie between these cases. 
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30.7.2 Ellipsoidal emitter 

For a conical emitter in a background field F 0 (like the hemispherical boss or the constant line charge density model), 
such that the ellipsoid is defined by r\ = r\ 0 , and asymptotically, the potential is that of a constant background field, or 


lim V(rj, a) = —F 0 Z = —F 0 L cosh rj cos a 

For Poisson's equation V 2 V = 0, use separation of variables and let V(ri,v) = F 0 L U(rf)W(p) for which 

I ( 1* (sinh r\ = (sina —)) 

U \ smh ri dr/ \ dq ) J FF l sin a dv V dv ) f 

From the asymptotic behavior, W = cos a, so that 

-— ( -J— -f- (sina cosa) } = 2 = n(n + 1) 

cos a L stn a da V dv ) ) 

or n = 1. As a result, U(r\) is expected to be of the form 


(30.82) 


(30.83) 


(30.84) 


U(r\) = - cosh >7 + (cosh > 7 ) 


(30.85) 


where Q t (x) is defined in Eq. (A2.22) but for x > 1, or 


, , cosh n , 

Qj (cosh 17 ) = — ^ l n 


cosh ri + 1 
cosh 17 — 1 


(30.86) 


and Ay is determined by the boundary condition that 
This amounts to 

U(r ]) = - cosh »7 


on the surface of the ellipsoid (17 = ri 0 ), the potential vanishes. 


r cosht 7 0 Ch (cosh 17 ) 1 
[ cosh >7 Qifcosh^)] 


(30.87) 


and so, just as with the methods of Section 30.2 (compare Eq. (30.9)) 


V(ri, a) = —F a L cosh r/ 0 cos a 


cosh ri 
cosh n 0 


Q^coshi?) \ 
Qy (cosh rj 0 )) 


(30.88) 


This is more than coincidence: a sphere is an ellipsoid for which the major and minor axes are equal. Therefore, 
the ellipsoid result should morph into the hemispherical result under the right conditions: when 17 becomes large, then 
cosh »7 « sinh >7 « e’ 1 /2, so that z = r cosa and p = rsina if r = Le n /2. Thus, consider the large >7 limit of Eq. (30.88) using 
Q^x) « l/3x 2 , so that 


V(t 7 » 1, a) = —F 0 L cosh 17 cos a + F 0 L cos a 

= f,rcos»[l-(2)’] 


cosh 3 r\ 0 
cosh 2 17 


(30.89) 


if a = L cosh r) 0 . As a result, the hemispherical model arises from the large >7 limit of the prolate spheroidal (n= 1 ellip¬ 
soidal) model. 

Flush with the success of having treated the hyperboloid, the ellipsoid evaluations are analogous. The field along the 
surface is found to be, again using Eq. (A1.74), 


sin 2 a + sinh 2 > 7 ,, 


Q,(cosh r] 0 )sinh 2 r] g 


(30.90) 

(30.91) 


The apex radius can be found as before, but consider another approach. The apex of the emitter is at z 0 s Lcoshx?,,. 
From the definitions of z and p in Eqs (30.66) and (30.67), it follows that along the surface defined by z = z s (p) where 


L 2 s\nh 2 r} 0 L 2 cosh 2 rj 0 


(30.92) 
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Solving for zfp) and demanding that it correspond to the parabolic relation z s (p) « z 0 - (p 2 /a) in the small p limit, it is 
found 


sinh 2 ^ 
cosh n 0 


z 0 tanh 2 r] 0 


(30.93) 


As a final exercise, consider a characterization of the apex field F tip in terms of the dimensions of the ellipsoid itself. 
Were it a circle, its base radius and height would be the same, but for an ellipsoid the radius of the major radius (or height 
of the ellipsoid) to the minor radius (or radius of the base), which shall be called R, is given by 


L cosh >7 cosO 

-— = coth r\ 

L sinh r\ sin - 


(30.94) 


Now consider the needle-like (R-* oo) and sphere-like (R —> 1) limits. Expressing F tip in terms of R results ii 
15F 0 


F ttp (R -> D = 
F tip {R » 1) = 


7-2 R 2 

2R 2 — 3 = 2z„ -3 g 

2(ln(2i?) - 1) 0 a(ln(4z 0 /a) - 2) 1 


(30.95) 

(30.96) 


Whereas the R -*■ 1 result reproduces the hemispherical result of F tip /F 0 = 3, the R -* oo result of /? « 2z 0 /a ln(4 z 0 /a) 
when a<zcz 0 mimics the hyperbolic relation of Eq. (30.76) but with z 0 taking the place of D, a result that might have 
been intuited if one observes that the distance over which the equipotentials take to be free of the bending caused by 
the protrusion is on the order of the protrusion's height, as in Figure 30.13. Under such conditions, the first reasonably 
flat equipotential functions much like a virtual anode and is easy to view as such. 

Finding the area factor proceeds as before. For the ellipsoid, the area element is 


dA = 2 np dv = 2 kL 2 sin f) sinh rp\J sin 2 a + sinh 2 i / 0 dv 


(30.97) 


which, compared to the hyperboloid dA of Eq. (30.78), differs only by the exchange of dr) with dv and the specification 
of which angular term is the fixed one. Thus, the ellipsoid in a background field g(F) factor is 20 


— J J[F{n,v 0 ). 


] °r 


[l + (l+x) 2 sinh \] 2 

» / (1 +xy- 1 e~ ix dxK , 1 

Jo b — v+1 


where (1 +x ) 2 = (sinh 2 u + sin 2 j/„)/(cos 2 wsinh 2 i/ ( ,) and, as before, b = 6 0 O 3 /, 2 /F ftp . As anticipated, the ellipsoid g e (F ) is 
between the hemispherical g{F) and the hyperbolic g h (F). 

The hemispherical, hyperbolic, and ellipsoidal models therefore reveal that the parameter on which tip current most 
depends is the apex radius due to the field enhancement factor being inversely proportional to it. But conditions fur¬ 
ther away from the apex, namely, the shape of the emitter, make themselves known through a logarithmic factor in 
the denominator of the f) factor, and in a small change to the notional emission area factor g(F). A comparison of the 
hemispherical, hyperbolic, and ellipsoidal g{F) factors is shown in Figure 30.17. 


30.8 A hybrid analytic-numerical model 

One ring to rule them all, one ring to find them, 

One ring to bring them all and in the darkness bind them. 

- J.R.R. Tolkien 21 


20 The upper limit corresponds to t> = n/2, but since the dummy variable x has a factor of cos v in its denominator, its upper limit is oo. 
21 J.R.R. Tolkien, The Fellowship of the Ring, The Lord of the Rings. Boston: Houghton Mifflin, 1987. 
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Figure 30.17 Comparison of the notional emission area factors g(P) for the hemispherical (blue triangles, Eq. (30.19)), ellisoidal (green diamonds, 

Eq. (30.98)) and hyperbolic (red circles, Eq. (30.81) for a cone angle of 15°) emitters. The work function is <6 = 4.5 eV, v = 0.77281, and b = B 0 ® 3/2 /F. 


Even with field enhancements of f> « 500, to get to the ~ 5 GV/m (F = 5 eV/nm) fields required for field emission 
from refractory metal and semiconductor field emission arrays one still must start with background fields on the order 
of 50 MV/m, and that is difficult. By way of comparison, dc photoinjectors demand very high fields on the surface of a 
photocathode with values comparable to 10 MV/m. Alternately, the accelerating grids above the surface of thermionic 
dispenser cathodes produce surface fields less than that: an accelerating grid 1 cm from the surface of a cathode producing 
2.334 A/cm 2 current under space-charge limited conditions requires a grid potential of 10 kV, implying a field of 1 MV/m. 
Macroscopic fields or distant anodes are not the way to achieve the requisite high fields, as they bring with them the 
problem of high potentials. 22 

What is needed is to bring the effects of the anode closer to the emitter tip. This can be done by excising a hole in the 
sheet constituting the anode and bringing it down to the plane of the emitter: electrons shot through such a hole will have 
to be collected by another conducting surface further away, and that surface becomes the true anode. In such a way, the 
potential of the conducting sheet with the hole in it can be held in the hundreds of volts and still give rise to apex fields 
that cause field emission. Because such structures can be switched quickly, they can be used to turn on (generate sufficient 
fields to cause emission) or off (reduced in potential such that sufficient fields do not exist to cause emission) the emitter, 
and so they are called gates. In their physical manifestation, therefore, field emitters sit in small wells with their pointy ends 
approximately coplanar with the gate plane and centered in the gate holes, just as shown in Figure 13.1. In the spirit of 
the point and line charge models of Sections 30.3 and 30.6, the simplest conceivable representation of a gated field emitter 
is therefore a charged sphere in a background field F 0 surrounded by a ring of charge representing the gate, such a ring 
ruling whether or not emission occurs, as it were [474]. A floating sphere, gated or not, is far from a good representation 
of a field emitter plus extraction grid, but what it does is point the way to an analytic model parameterized by numerical 
simulations. 


30.8.1 Analytic: the saturn model 

In memory of the Golden Age, when (Saturn) reigned in Italy, the great feast of the Saturnalia was held every year during the winter. 
The idea of it was that the Golden Age returned to the earth during the days it lasted. No war could he then declared; slaves and masters 
ate at the same table; executions were postponed; it was a season for giving presents; it kept alive in men's minds the idea of equality, of 
a time when all were on the same level. 

- Edith Hamilton 23 


22 To get a feel, the huge MIT Van de Graaff generator, operational around 1933 but now housed in Boston's Museum of Science, was capable 
of reaching a potential of 5 x 10 6 V, so that a cathode would have to be placed 10 cm from it to create a field of 50 MV/m, under vacuum 
conditions. 

23 Edith HavmkonMythology. Boston: Little, Brown and Company, 1942, p. 45. Although the Romans sought to equate their gods with the 
Grecian deities by identifying Jupiter with Zeus and Saturn with Chronos (Zeu's father), that was imperfect: Saturn was an agriculture deity; 
Chronos ate his children. 
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Figure 30.18 The tip is represented by a sphere of radius a s and the gate by a ring of charge of 
radius a r . t is normal to the plane of the ring and in the direction of F 0 . The distance between the 
center of the sphere and the center of the ring is z . The gate potential is defined at V(z , a ) = V . 
The observation point is a distance r from the center of the sphere, and a distance r 1 from the 
center of the ring. The polar angle 9 is defined by z ■ r = r cos 9, and the angle a by z ■ ? = r' cos a. 
For purposes of illustration, the sphere representing the tip is grotesquely enlarged. 


If the simplest emitter shape is a sphere, then the simplest gated emitter is a sphere with a ring about it: as in 
Figure 30.18, let the sphere be of radius a s and the ring of charge of radius a r , with r measuring the distance between an 
observation point r and the center of the sphere, and r' the distance between the observation point and the gate ring, 
with the z axis normal to the plane of the gate ring. Such a Saturn model is a simple and convenient representation of a 
gated field emitter [475, 476], but unlike the planet that lent its name, the plane of the gate ring need not pass through 
the center of the sphere, that is, the origin of the sphere and the origin of the ring need not coincide. Rather, the top of 
the planet and the ring plane are, as it were, on the same level. 

Define the distance between the ring and sphere centers as z g such that r - r' = z g z, where z is in the direction of both 
the background field F 0 = F 0 z and the normal to the plane of the ring. Let 9 be the angle r makes with z, and likewise let 
a be the angle f' makes with z. The gate potential V g is not defined at the ring location, but rather a small distance from 
it, comparable to, say, half the gate film thickness (on the order of a pm). If a r is the radius of the ring representing the 
gate, then V g = V(z g , a g ), where V(z, p) is the total potential energy. 

The potential energy V(z, p) is then the sum of three contributions: 24 that due to the background field V 0 = —F 0 z, that 
due to the ring of charge V r , and that due to the sphere V s , or 


V 0 = -F 0 z= -F 0 r cos 9 

f= _Sl r^L_ 

r 2na r J 0 \f-?\ 
V s = £ A l r l P,(cos9) 


(30.99) 

(30.100) 

(30.101) 


where dl is a differential length along the gate ring and r g =z g + a 2 (mind the subscripts) and where if n r = q r jq with q r 
being the total charge on the gate ring, then Q g = 4Qn r = X/4ne 0 in SI units, with X = q r /2na r being a charge per unit 
length. Usage of Q g is easier, though. 

Consider the gate ring first. If $ is the azimuthal angle, then dl = a r d<p is the differential element, and so [197, 475] 


V r {r,9) 


=^r 

2 x J 0 




. 2 % K(P) 


(30.102) 


(30.103) 


24 Observe that the co-mingling of spherical and cylindrical coordinates is done depending on which is most convenient at the time: (r, 9) with 
a are the spherical coordinates, and (z, p) with a and a r are the cylindrical ones. 
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where p = \jdpa r jy, y = \{p + a T ) 2 + (z - z g ) 2 \ 1 ^ 2 , and the function K(m) is a complete elliptical integral of the first kind [60] 
defined by 

K(m) = f — (30.104) 

Jo y/l - m 2 sm 2 4> 

Take note of the limits of the integral: apart from a factor of 2 because the integrand is symmetrical, in going from 
Eq. (30.102) to (30.103) a half-angle formula of the form cos(2</>) = 1 - 2sin 2 $ has been used. Because elliptical integrals 
can be expanded in a series involving Legendre polynomials, V r can be rewritten as [99] 


2 K(p) 
ji y 




^ P ; ( cos a)Pfi cos 0) 


(30.105) 


where cos a = z g /r g = z g /-Jz 2 + a 2 . The value of Q g is found by demanding that the potential be V g at a distance t = a r — a g 
from the gate ring (near the gate ring, the equipotential lines, to a good approximation, are toroidal): 


(DKKlP-ftM 1) 


where r' = Jz 2 + r 2 . Assuming that a s «: a., then to leading order. 




[ nr 


(30.106) 


(30.107) 


where the second form is for when the apex of the emitter is approximately coplanar with the gate (z g is small) in addition 
to small apex radius a s . Finally, the charge on the sphere is given by A 0 , or Q s = Q g {a s /r g ), a result that is reminiscent of 
a charge outside a conducting sphere [99]. 

For the determination of F tip = ji g V g , concentrate on the surface of the sphere, for which V g +V r + V s = 0. This require¬ 
ment specifies the A, to be 


A, = 


y-a'fi 1 P ,(cos a) - P 0 a] § l t 


(30.108) 


and so the field along the surface F(6) is 


F(0) = —3F cos6 -- Y(2/+l)[ - 

a s^a i-r\ V 


Pf cos a)Pj( cos 6) 


(30.109) 


which suggests that to leading order, F tip « Q g /(a s r g ). Ferreting out higher order terms requires an approximation to K(p) 
in Eq. (30.107). To leading order, [477] 

Km * i ta (l^) + (1 - m2) [i to (l^) - j] <30 ' UO » 

for which the first term dominates as m -* 1. When z g « 0, it follows that y = a g + a r and p = - s jAa g a r / (a g + a r ). Letting 
t = a r - a , then [475] 


(30.111) 


There are similarities and differences with the other analytic forms for the prolate spheroidal models of Eqs (30.76) and 
(30.96), where in the logarithm term, a g has usurped the role of D and z 0 and t the role of a s . This suggests that in a "hybrid" 
formula to treat physical gated conical emitters, the logarithm term might morph into something like ln(k a g /a s ), where 
finding the term k = k(a s , a g ) requires more than the analytical models brought forth so far to treat emitters (ref. [476] 
contains a discussion of such log-terms, in addition to an unusual model of a gated field emitter: the "bowling pin" 
model). Finding k requires numerical input. 
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30.8.2 Numerical: boundary element methods 

The boundary element method has many attractive features which make it especially well suited to numerical solution of these potential 
field problems 25 when compared to other techniques.. .such as finite differences or finite elements... Other features include high accuracy, 
the ability to handle an infinite volume without inversion, easier incorporation of meshes which vary dramatically in step size, simpler 
mesh generation, and change of design with partial rather than complete remeshing. 

- Robert L. Hartman, William A. Mackie, and Paul R. Davis [478] 

The actual characterization of field emitter performance in an array in order judge their utility for devices which seek to 
use them [166, 173, 464, 466, 479-483] must take into account the actual physical geometry of the emitter and where it 
sits in relation to the gates or extraction grids that are used to generate the fields at the apex of the emitters. Although the 
simple models are intuitively appealing, the locations and potentials of close-proximity electrodes shifts the dependence 
of emitted current to the known and adjustable parameters of gate radii, cone angles, gate and anode potentials, and 
base-to-gate thicknesses of dielectrics to understand the spread of the beam [484], Finite difference and finite element 
analyses, often the workhorses of such simulations, are profoundly undercut by the orders of magnitude differences in size 
between the emission sites and the micron-scale geometry of the gates and tips. A methodology that avoids fine discretiza¬ 
tion where it is not really needed and concentrates it where it is (right near the emission site) for rotationally symmetric 26 
field emission sources is the boundary element method [475, 478, 485-488]. The boundary element method is perfectly 
capable of calculating the emission current by itself without recourse to the analytic models of previous sections, but the 
development of a semi-analytic model (in which the behavior of the field enhancement term, in particular the k(a s , a g ) 
term suggested after Eq. (30.111), is inferred) is useful: numerical solutions provide answers, but analytic ones provide 
insight. 

If points, lines, and rings can replace the influence of the gate and model the conical emitter, then a ribbon to model 
the gate might be better, and going further, if a ribbon of charge can model the edge of a conducting surface then ribbons 
of charge can model every conducting surface: once the boundary conditions are matched by the placement of charge, then the 
solution V(z, p) in the interior is a matter of numerically solving Poisson's equation and the problem is finished (unless 
space charge comes into play [485]). The trick, as it were, is then knowing how to assign the charge on every ribbon that 
represents a conducting surface. 

The boundary element technique discretizes the boundary into sections. In a rotationally symmetric model, these 
sections are annular ribbons that can be taken quite small in the vicinity of edges or corners or - most impor¬ 
tantly - nanoscale protrusions where field emission occurs. Where the potential does not vary markedly, then the width 
of the annular ribbons can become quite large. The factors that determine the size of the annular ribbons are tip radius, 
tip height, gate hole radius, base-gate distance, gate-anode distance, gate thickness, and shape of the emitter, such as 
those suggested in Figure 30.19. Unlike finite element or finite difference techniques, the boundary element method 
does not require gridding or discretization in the interior region away from the conducting surfaces. 




i 

T 


5 


Figure 30.19 Parameters important for the simulation of a field emitter using boundary element methods: a s = apex radius, a g = gate radius, 
d t3 = base-to-gate separation, 0 C = cone angle, = gate thickness. 


25 The authors refer to emission from areas vastly smaller than the size of the simulation region. 

26 In principle, the method can be generalized to three dimensions, but in practice the size of the matrices required for inversion explodes and 
become far more difficult for inversion. 


www.ebook3000.com 











Chapter 30 


The starting point is rewriting Eq. (30.7) for general number density distributions p e (r') to more specific ribbons with 
sheet charge density o-(r'), or 

V(r) = 4(2 [ dQ. (30.112) 

Jn\f-r'\ 


where dQ. is the differential surface element and a is the number of charges per unit area. Represent the boundaries as 
ribbons of charge, each with a constant surface charge density [489] (a faster approach than assuming the charge density 
varies linearly across the ribbon, which, although more accurate, is less important if the ribbons are sufficiently thin). 
This causes Eq. (30.112) to become 


V(r) = 4Q Yj 



(30.113) 


for which the Saturn model has been good preparation. The various components are defined by 


= Zj+l-Zj 

(30J 

114) 

’ - p j+l - pj 



p=^f- 

r 

(30J 

115) 

y = {(p + p') 2 + (z-z') 2 } 1/2 

(30.] 

116) 


the potential is specified at the middle of each ribbon, or 


V(z,p)^V } rnV(z m , 2 ,p j+m ) 


(30.117) 


where Zj +x / 2 = (Zj + Zj +i )/2 and p ;+1 / 2 = (p ; + p j+1 )/2. The values of V } are known: they are simply given by the potentials 
of the various conducting surfaces (emitter cone, gate plane, or anode plane). What is unknown are the values of the 
charge densities Cj = <r(Zj +1 / 2 i Pj+i/f)- But observe that with the introduction of a matrix defined by [197, 487] 


[Ufj 




(30.118) 


where p t and Zj are buried in the definition of p and y, then the boundary element method amounts to a solution of the 
matrix equation 

V = M -a -»• a = y 1 • V (30.119) 


The inversion of the matrix equation therefore solves for the charge densities, and once those are known the fields along 
the ribbons are quickly deduced by 

t) = —Oj = HmQoj (30.120) 

There is only one small wrinkle: Eq. (30.118) is easy to numerically calculate as long as (z i+ i /2 , p i+l / 2 ) does not lie in the 
center of one of the ribbons but that happens every time i = j, and when it does, the integral is hard to evaluate because 
K(p) contains a logarithmic singularity of the form (1/2) ln( 16/(1 - p 2 )) as p -* 1. Fortunately, the singular portion can 
be integrated to give 


■ m? { l96f2 - (sWl [ 3 ln (J=) + '] + I9V } 


(30.121) 
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where for this equation 





z 7 — ?i+ 1/2 + s ip' ~ Pi+1/2 ) 

y/4P , Pi+ 1/2 
Y 

r = iip' + a + 1/2 ) 2 + (z 7 - z i+1/2 ) 2 ] 

With the singular part of the integral removed, the remainder integral can be readily evaluated using numerical integra¬ 
tion means such as those of Section Al.2.3. 


30.8.3 Semi-analytic model 

These two efforts always go together, as they mutually act and react on each other; they are integral parts of one and the same view, and 
we have only to ascertain what effect is produced when one or the other has the predominance. 

- Carl von Clausewitz 27 

Numerical and analytic techniques both have a role to play, but they need not be kept distinct, and indeed, joining 
their virtues provides a model with advantages over either. The boundary element method provides a full solution to the 
problem of field emission from a gated emitter by evaluating the field on each annular ribbon specifying the conducting 
surfaces, and in particular those ribbons constituting the apex of the field emitter. There is an artistry to the selection of 
the ribbon widths: they should be very narrow in regions of high curvature (such as near the apex of the emitter), but 
can be rather wide in regions of low curvature (such as along the shank of the ellipsoid, or the flat regions of the gate 
and the anode). Not many annular ribbons are required, so that the inversion of Eq. (30.119) is computationally rapid. 

Still, having to recalculate the ribbon charges whenever some small change is made is time-consuming, and doing 
so does not allow for fast evaluations over a range of gate potentials and anode fields. It would be convenient, not to 
mention enlightening, to have an analtyical model of what F tip = ft V g is. One such method to provide that relation makes 
use of an approximation to the behavior of the potential V(z,p) on axis (p = 0), with z = z- a s measured from the apex 
of the emitter (and z from the center of the sphere representing the apex). For small z, V(z, 0) « F tip z, whereas for large z, 
V(z, 0) « F 0 z, where F 0 is the field between gate and anode. If D » a g is the anode-gate separation then F 0 = (V a - V g )/D, 
and F 0 a s is negligible. An expression which captures this behavior is 



where the first fraction captures the small z behavior (V g » F tip z), and the term in parentheses the large z behavior 
(V g «c F 0 z). Although this "derivation," such as it is, lacks rigor, it is based on the actual variation compared to boundary 
element evaluations and therefore captures reasonably well the values of V(z, 0) and its derivatives at z = 0 and z = D. 
Moreover, the hybrid approach it enables gives a reasonable account of the behavior of a single Spindt-type gated emitter 
[490,491], Therefore, return to Eq. (30.111) and the discussion following it to investigate how they can be bent to present 
needs. 

The first step will be to replace V g in Eq. (30.111) with V(z g , 0), where z g is the height above the emitter where 
V(z g , 0) « V g . This substitution indicates 

*z.V. 

1 = -—-— (30.123) 

d s {V + FupZg ln(k —)) 


27 Carl von Clausewitz (trans. J.J. Graham), On War, new and revised edition, with introduction and notes by Colonel F.N. Maude. London: 
Kegan Paul, Trench, Tribner & Co., 1908, Chapter II: End and Means in War, https://www.gutenberg.org/catalog/. 
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From the hyperbolic model, Eq. (30.74) with D 
ing it and solving for F tip , 

F ti « 


-* z g suggests that z g = ajtan 2 v 0 , where 


^ In (ka g /a s ) 



the the cone half-angle. Insert - 
(30.124) 


The final component is the approximation for k, for which there is no analytical model. Therefore, boundary element 
simulations for several gate geometries and cone angles modeled after SRI-like and MIT-like field emitter arrays, 28 as 
shown in Figure 30.20, are used to develop an approximation to k(a s , a g ). The geometrical factors that were used in 
the boundary element method simulations are summarized in Table 30.3. These simulations gave rise to the best fit 
representation 

kfts ~ ( 86+ — ) coto„ (30.125) 




and the results of the simulations are shown in Figure 30.20. 

Modeling current from a tip is one thing, but describing the distribution of charge that is emitted from a single 
Spint-type emitter is a more demanding test of the theory. To that end, the theoretical model (a semi-analytic model 
with parameters determined from boundary element simulations and a trajectory analysis) was used to analyze the 
spread of a beam of electrons from a single Spindt-type field emitter as experimentally measured using a microfabricated 
detector system with a Faraday cup anode. The Faraday cup collected current as a function of radial distance from the 



Figure 30.20 Evaluation of k in Eq. (30.124) by boundary element simulations (symbols) for field emitter arrays having the geometries of those 
fabricated by SRI and MIT, and k{.a g ,a s ) as given by Eq. (30.125). Based on Figure 1 of ref. [466], 


Table 30.3 Values of gated field emitter parameters as used by boundary element simulations to 
determine k (Eq. (30.125)) for MIT and SRI parameters. The resulting k values are shown in 
Figure 30.20. 


Parameter 

Symbol 

Unit 

SRI-like 

MIT-like 

Base-to-gate 

% 

pm 

1.00 

0.19 

Gate thickness 


|im 

0.25 

0.05 

Cone angle 

v a 

Degrees 

15 

23 

Gate radii 

a g 

|im 

0.2, 0.25 

0.08, 0.09, 0.10, 0.12, 0.16 

Tip radii 

3 s 

nm 

4, 5, 6, 7 

4, 5, 6 7 


28 The physical arrays were developed as part of programs to develop inductive output amplifiers (IOAs) operating at 10 GHz, specifically the 
ARPA/NRL klystrode and the NRL twystrode (see ref. [492] for a description). Images of the arrays and their characteristics are given in ref. 
[480]. A 100 W cold cathode TWT was demonstrated by Whaley etal. [483] in 2009. 
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pW 

Figure 30.21 Comparison of experimental measurements of emission from a single Spindt-type emitter (SRI geometry) using a microfabricated detector 
to a theoretical model plus boundary element simulations. Numbers refer to the gate-anode separation in microns. Based on Figure 8 of ref. [490]. 


axis of symmetry: the microdetector was capable of moving in steps of 3-4 nm [490, 493] and was therefore capable of 
resolving the spread in the beam. Although the theoretical model used in the analysis presumed a constant k value, the 
agreement with the experimental characterization of the Gaussian-like spread of the beam was quite good, as seen in 
Figure 30.21. The varying k(a s ,a g ) model was subsequently developed and successfully used to investigate space charge 
effects from an array of emitters [491], 


30.9 Shielding 

To be in company, even with the best, is soon wearisome and dissipating. 

- Henry David Thoreau 29 

Take two ellipsoids like the "0.2" case shown in Figure 30.13 and start bringing them together as in Figure 30.22. 
As they approach each other, the equipotential lines between them will start deforming in that the equipotential lines 
between the ellipsoids rise. Additionally, the number of lines compressed into the small space above the emitters declines, 
that is, the field reduces. As a consequence, the presence of a nearby emitter causes the p factor of either emitter to become 
smaller. This effect, known as screening or shielding, is well-known, particularly for wire-like emitters such as carbon 
nanotubes [494-497] and carbon fibers [498]. Shielding is even more pronounced in arrays, particularly for ungated 
emitters. 

The modeling of wire-like emitters in particular suggests a parameterized form of accounting for the effects of screening 
given by 

= 1 - exp [-a(b/h)\ (30.126) 

Po 

where p a is the field enhancement for a single emitter, and p is the enhancement factor of a line of emitters of height h 
and emitter-to-emitter separation b, where a is a fit parameter suggested to be a k, 2.3172 [499, 500] as in Figure 30.23. 
Extending the analysis from a one-dimensional line of wires to two-dimensional arrays (where the emitters in the unit 
cell of an array were on the corners of either a square or an equilateral triangle) using the line charge model [501] tended 
to confirm Eq. (30.126) but generalized it to become 

— = 1 - exp [~a(b/h) c ] (30.127) 

Po 

where a and c are found by fitting (a similar formulation was found by Read and Bowling [496]). Applying Eq. (30.127) 
to the data digitally extracted from ref. [499] would give a « 2.3668 and c ta 1.176 for h = 2 pm. 


29 Henry David Thoreau, Walden and Other Writings, The Modem Library Classics. New York: Modern Library, 2000, p. 128. 
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Figure 30.23 Simulations of the ratio of fi for a line of emitters separated by a distance b to an isolated emitter fi 0 , compared to Eq. (30.126) using the 
parameters a = 2.3172 and h = 2 pm, based on Figure 4a of Bonard et al. [499]. 

The tapered line charge model of Section 30.6.2 makes for a straightforward analysis of the effect when applied to 
idealized geometries [501-504], Let an infinite array of emitters be characterized by their unit cells, or the smallest 
repeating unit of the array. Two types in particular that reflect how actual arrays are fabricated are the square unit cell 
(where an emitter is situated at each corner of a square) or a triangular unit cell (where an emitter is situated at each 
vertex of an equilateral triangle), as in the lower right-hand corners of Figure 30.24. The factor c in Eq. (30.127) differs 
depending on which unit cell is under consideration. Assuming an infinite array simplifies the calculations for field 
enhancement factors, enabling the values of a and c to be found and shown in Table 30.4 for the square and triangular 
array types. Although Eq. (30.126) and its improvement as Eq. (30.127) are based on a fitting of simulation results, there 
is a way to demonstrate the reasonableness of the parameterization of (3/ (3 0 by using arguments derived from the point 
charge model treatment of Section 30.3, without having to go through the time-consuming methods for the numerical 
approach of refs [501-505], which involves manually adjusting parameters until the apex radius acquired a desired 
value. 
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ie rings get larger, the number of wires per ring and the radius of the ring change as shown. 


Table 30.4 Values of c and a in Eq. (30.127) for the triangular and square arrays 
of tapered line charges in the configurations of of Figure 30.24. Also given is a s , 
as in the line charge model the nature of the emitter depends on it. 


Parameter 

Triangular (Al 

Square (□) 

a 

1.265 

1.45 

c 

1.09 

1.00 

h [urn] 

1500 

1500 

a s [pm] 

1.5 

1.5 


Most of the charge in the wire for the line charge model aggregates at the ends near the apex. The simplest approx¬ 
imation is to push all of the charge to the end: this constitutes a point charge model with only one point charge, plus 
its image, for each emitter site on the array. A problem, though, is that when the point charges are sufficiently close, 
the ellipsoidal shape of the equipotential lines for the point charges and their images are undermined, that is, the point 
charge model fails to account for the overall geometry, particularly near the base. Although such defects can be fixed by 
the clever placement of yet other charges, it is the qualitative behavior of shielding as emitters very far apart are brought 
closer together that matters, and for that the qualitative behavior of the simple point charge model is sufficient to reveal 
what unfolds. Therefore, let point charges and their images occupy the points on the square and triangular arrays. A 
few points are worth explicit mention. First, all of the charges are in the same plane, as are all of the image charges: as 
a result and perhaps confusingly until one thinks about it, for an infinite array there is no net force on a charge due to 
other charges in the same plane, so that a charge is only affected by the image charges. Second, although the tip-to-tip 
separation is the same for both triangular and square arrays, the triangular array has a higher packing density (number 
of tips per unit area) than the square array does, as suggested by the number of emitters on the first circle shown in 
Figure 30.24: the square array has four, but the triangular array has six. 


EXAMPLE: Let a be the number of emitters per unit area. Find a A and a n for the triangular and square arrays, 
respectively, where the tip-to-tip separation is b. 

SOLUTION: Consider the unit cells shown in Figure 30.24 and epitomized by the color unit cells in the lower 
right-hand corner. 

• In the triangular element, there is N A = 1/2 emitters: each vertex has 1/6 of an emitter, and there are 3 vertices, 
so (1/2) x (1/6) = 1/2. The area of the triangle is 

A a = |/> 2 sin(|) = (\/3 b 2 /4) = 0.43301b 2 (30.128) 

Therefore <r A = N A /A A = (2/yfi)/b 2 = 1.1547 /b 2 . 

• In the square element, there is N D = 1 emitters: each vertex has 1/4 of an emitter, and there are 4 vertices. The 
area of the square is A u = b 2 . Therefore, a n = N u /A c = (l)/(b 2 ) = l/b 2 . 
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When each tapered line charge is replaced by a point charge and its image, the potential energy for the ;'th charge is 
Vj(x,y,z) = 4Q S ^ - -0 (30.129) 

rf = ((x - xj) 2 + (y - y,) 2 + (z± L ) 2 ) 1 ' 2 (30.130) 


where Q s = qqj\6ne 0 = ( q s /q)Q and q s is the magnitude of the point charge. For an infinite array, by symmetry arguments 
the field at the apex of each emitter is solely in the z direction, and so evaluating F = -d z V is sufficient to find the field 
enhancement /?. In units for which all lengths are scaled by L, then the apexes of the emitters reside at h/L =1+5, and 
so the ratio p/p 0 is [505] 


njlSip 2 + 6 2 ) V 2 - (2 + 8)(p 2 + (2 + <5 ) 2 )- 3 / 2 } 
8- 2 - (2 + <5) 2 


(30.131) 


where pj is the radius of the /th ring on which the point charge sits, and where rij is the number of point charges on that 
ring (e.g., n ; (A) = 6 and «,(□) = 4 for j = 1 and 2). Clearly, the larger pj, the less of an influence the ;th ring has: as a 
practical matter, those rings for which pj > 3 b can be ignored. Simple geometry shows that for the first six rings. 


{(Pj/b) 2 } D = { 1,2,4,5,8,9} (30.132) 

= 11,3,4,7,7,9} (30.133) 


(notice that the j = 4 and ; = 5 of the rings for (A) have the same radius) and tij for each of those rings can be easily 
counted from Figure 30.24, making the evaluation of Eq. (30.131) straightforward. 

By way of example, consider the case for 8/h = 0.1 and 0.2 shown in Figure 30.25, with the triangular and square 
evaluation points compared to fits calculated according to Eq. (30.127) with the parameters of Table 30.5. The parame¬ 
ters tend to have different sizes than those associated with the line charge studies, and that has to do with how charge 
is distributed: coplanar charges do not give rise to a z component in force, whereas the line charge models do. More 
importantly, as the factor S/h changes, the best-fit parameters also change, underscoring the interpretation of these 
parameters as those which capture the effects of geometry and placement. Therefore, although there are differences in 
magnitude between the values of a and c between the tapered line charge and point charge models, they share qualita¬ 
tively analogous behavior 30 that has been traced to the placement and distribution of the charges making up the emitter 
surface. 



Figure 30.25 Evaluation of Eq. (30.127) for S/h = 0.1 (dark blue and red lines; filled symbols) and 8/h = 0.2 (light blue and red lines; open symbols) 
for the point charge model. The best-fit lines use the parameterization of y(x) = 1 - exp( -ax 0 ) with values given in Table 30.5. 


30 The upturn in behavior of the point charge model when b/h as 1 does not have an analog in the tapered line charge model. It coincides with 
the equipotential lines no longer resembling bumps. The evaluations for b/h < 1 is only shown for completeness, not because that behavior 
imitates the tapered line charge model (it does not): f}Q>) -* 0 as b -*■ 0 for the line charge model [501]. 
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Table 30.5 Best fit values of c and a in Eq. (30.127) for the triangular and square 
arrays of point charges in the configurations of Figure 30.24 as described by Eq. 
(30.131) and shown in Figure 30.25 for the two cases of 5/h = 0.1 and 0.2. 

Parameter 5/h A □ 


0.1 3.7195 4.0509 

0.1 0.3457 0.3133 

0.2 2.4897 2.7887 

0.2 0.4711 0.4290 



b/h 


Figure 30.26 The array current density J a normalized to its maximum value as a function of the ratio of the tip-to-tip separation b with the wire 
height h for ad hoc metal-like parameters in a triangular array (4> = 4.3 eV, h = 2 pm, a s = 7 nm, and /)„ = 1000), as evaluated using Eq. (30.134). The 
values of b m and J^ are shown in Figure 30.27. 



Figure 30.27 The values of b opt and J max , as evaluated using Eq. (30.134) for metal-like parameters (® = 4.3 eV, h = 2 pm, a s = 7 nm, and P o = 1000), as 
shown in Figure 30.26. 

The importance of the behavior of p{b)/p 0 is because it demonstrates that as b -* 0, then the field emission current 
density over the array, J array (pF 0 ), is negligible when /?//?„ <sc 1. Conversely, as b -* oo, then J array likewise declines as 1 /b 2 . 
As a result, J array maximizes for an intermediate value of b. That maximum can be found by evaluating the current density 
from an array of emitters by [501] 

J array (b, p) = o p (b)2na 2 s g\p{b)F 0 \J FN \p(b)F 0 \ (30.134) 

for an array p = □ or A, where a n = 1/b 2 and = A/sf^b 2 (see Eq. (30.128)). For a given apex field p 0 F 0 (the field an 
isolated emitter would experience at its apex), the behavior of J array can be evaluated, as in Figure 30.26. The analysis 
clearly shows that the optimal spacing depends on the background field, with lower fields favoring closer spacing and higher 
fields favoring greater separation. That the optimal b/h for intermediate field values is near unity (the tips are as far apart 
from each other as they are high) matches conventional expectations borne of anecdotal data or simulations. 
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30.10 Statistical variation 


Besides the possible ramifications of the resulting spatial nonuniformity, a large variation in the TV characteristics between tips in an 
array decreases the average current per tip that can be extracted without inducing voltage breakdown events. Thus, for a given tip packing 
density, the total current density that can be extracted from an array is not proportional to the array size, which is clearly undesirable. 

- Paul R. Schwoebel, Charles (Capp) A. Spindt, Chris E. Holland, and John A. Panitz [460] 

In the estimation of array current in Eq. (30.134), the implicit assumption is that all of the emitters are identical, but 
insofar as emission is from nanoscale structures at the tips of the emitters, it stretches credulity to assume that all such 
protrusions are alike, regardless of the similarity of the larger structures on which they sit. Given the acute dependence 
of field enhancement f) on tip radius a s over all other parameters, variation in the tip currents of individual emitters 
must likewise be expected. The question then becomes, how is such statistical variation manifested in current-voltage 
relations? Or, put another way, why does current density not scale with the number of emitters (or the emission area)? 

Take a very simple model to start the analysis. Assume that there are N emitters, and that the distribution in radii is 
such that there are n { emitters characterized by a radius a ir so that 


N=J j n i (30.135) 

^ = (30.136) 


where clearly not all «,■ are non-zero, and where l tip is given by Eq. (30.18). The average current per tip is then 


lave = ih) = 


N 4-1 


(30.137) 


If N is sufficiently large, then the summation can be approximated by an integration over tip radii. Let the minimum 
radius for which Hj is non-zero be a 0 (not to be confused with the Bohr radius), and likewise let the maximum radius 
for which nj is non-zero be a g (l + A). The array current is then 


n r» (1+A) , 

(y = — / 2 tea 2 g[F(a)]J PN [F(a )] da 


(30.138) 


A similar formulation would result if, instead of radius, the work function varied [388], but for simplicity take the only 
variation to be caused by geometry. A simple model then approximates F(a) a F tip (a 0 /a), so that F tip is the apex field on 
the sharpest emitter characterized by a 0 . The simplest model further takes v « 1 for ellipsoidal emitters (recall Table (2.3)): 
those two approximations result in a wonderful cancellation of inconvenient terms (try it), resulting in 


(1 _ e ~bts\ 

I array = N- ——— -I tip (,F tip ) = N eff I tip (30.139) 

where b = B 0 ® 3 / 2 /F ftp . Were all the emitters very uniform in apex radius, then A would be small, and if bA were small, 
then the coefficient of NI tip would be approximately unity: under such conditions arrays of emitters would scale with 
area or number: that they do not is an indication of what changes are wrought by A. In short, the factor (1 - e~ bh )/bA 
acts much like the fraction of tips that are emitting if all the tips were identical and had the smallest radius (much like the 
notional area acts like the fraction of the surface that is emitting if the current density across the surface were the same 
as the apex value; both the notional area and the fraction of tips emitting factor under-represent how many tips or how 
much of the surface is actually contributing). 


EXAMPLE: Calculate the fraction of tips emitting factor of Eq. (30.139) for the copper-like parameter of ® = 4.5 eV 
and a tip field of F tip = 6 eV/nm for a smallest tip radius of a s = 5 nm and a spread of A = 2. 

SOLUTION: Using B 0 from Table 2.3, 

i, = = [“) = 10.868 

F ti P \eV 1/2 nm/ 6 eV/nm 

or bA = 21.736, the size of which is so large that Eq. (30.139) is well approximated by N eff /N « 1 /bA = 0.076678. 
Thus, effectively 7.7% of the emitters are contributing. Because b depends on F tip , this percentage increases as the 
field increases. 
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As simple and convenient as the uniform radius model is, are there reasons for considering it representative of actual 
distributions of emitter radii? In spite of its obvious pedagogical value, sadly, the answer is "not quite": 31 microfabrication 
processes produce distributions in apex radii that are better modeled by Gaussians than a uniform distributions. However, 
there is a limit to how sharp emitters may become: they must be, at least, greater than zero. To get a distribution that has 
the semblance of a Gaussian but which forbids negative arguments is the closely related log-normal (LN) distribution, 
which better describes the radii of actual emitters [388]. Data for carbon nanotubes (Figure 10 of ref. [506]) and explicitly 
measured for silicon field emitters [507, 508] can be matched by a log-normal distribution (even when matched by a 
Gaussian, the distribution shows underlying hints of log-normal behavior, as in Figure 3 of ref. [509]). For example, the 
data of Guerrera etal. is reproduced in Figure 30.28 alongside the best log-normal fit. 

The log-normal distribution L(x) is defined by the relation in Eq. (29.1), where a now governs the spread and n is the 
median value of x (not to be confused with the chemical potential) where x now refers to apex radii. For L(x), and unlike 
the Gaussian distribution to which it is related, the mean (x) and the median n are different (recall Section 4.1 for an 
example of how median and mean differ): the median n is such that 

J\(x)dx = J L(x)dx =| (30.140) 


whereas the mean is (x) = /r exp(<r 2 /2). For a discrete set of apex radii a ( with N>i>\,fi and a are evaluated by 


Mf) = ^ 2 In («i) 

•‘-bin*. 


in terms of which the array current is given by 


n L 


N / L(a)I t Ja)da 


(30.141) 

(30.142) 


(30.143) 


Return, now, to Figure 30.28: the larger a is, then the more likely it is that a small number of very sharp emitters 
are present. Because of the sensitivity of tip current to apex field, these emitters rapidly dominate their neighbors. An 
indication of the behavior is given in Figure 30.29, calculated using the algorithms of Section A3.27: only for distributions 
where a is small does it appear that most of the emitters "turn on." A second way to see the dominance of the sharpest 
emitters is shown in Figure 30.30, where for larger a the sharpest emitters are seen to be responsible for most of the 
current. For the integrated current, for the larger a = 0.16, almost all of the current put out by the array is due to the 
half of the emitters for which a t < a 0 , and so the integrated current curve has taken a hard shift to the left by comparison 
to the tight distribution a = 0.02, for which the number and current distributions are more aligned. 



Eqs (30.141) and (30.142).(data courtesy of S. Guerrera (MIT)): se 
2524, 2012, particularly Figure 10, for a description) 


: S.A. Guerrera, L.F. Velasquez-Garcia, and A.I. Akinwande, IEEE Tram. El. Dev., 59, 


31 The "top hat" distribution may, however, be useful for modeling work function variation due to crystal face or coverage; see ref. [388]. 
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Figure 30.29 A distribution of field emitters tips in a 32 x 32 array with radii a t log-normal distributed with median ii = 6 nm and a as shown in the 
lower left of each panel. A work function of 4> = 4.5 eV is assumed. The apex field is assumed to be F (a) = F(ji)(pi/a), with F(n) = 5 eV/nm. When a is 
large, then only a few emitters dominate the emission current, as their contriDunon becomes well in excess of their neighbors. 



a i [nm] (3j - a min ) / (a max - a min ) 

Figure 30.30 Left, the distribution of N 2 tips for N = 128: circles correspond to a = 0.02, squares to <r = 0.16. "rn" and "rj" are number n, and current 
density J ; . normalized to their maximum value (i.e., for density, 7(a) = «(«)/« max and similarly for current). The number of bins {« ; .} is 64. When a is 
large enough, a small number of very sharp emitters exist, which quickly dominate the current that the array produces. Right, the normalized 
integrated current as a function of the normalized tip radius (a ; - - a min ), where the maximum and minimum values are determined by the 


A third and final way to see the impact of the sharpest emitters compared to the median radius emitters is through a 
representation of the current from the array on a Fowler-Nordheim-like plot, where \n(I array /F* ) is plotted against 1 / F tjp , 
where F tip is the field on the median emitter a 0 = p and where Eq. (30.137) is used, but with the n t being log-normal 
distributed according to Eq. (A3.24). Alongside the Fowler-Nordheim curve for all the emitters are two additional curves: 
in the first curve, only the output of the first non-zero n i set of emitters characterizing the sharpest radii is shown. In fact, 
for the simulation shown, n 1 = 1. That it approaches the array current at small field indicates that the array current is 
dominated by a very small number of sharpest emitters. As the field rises, more emitters with radii closer to the median begin to 
contribute. The second curve is therefore for an array composed solely of identical emitters with tip radii a, = a 0 = p: the 
Fowler-Nordheim characteristics of the actual array are seen to converge with this hypothetical set as the field increases, 
indicating that the mean emitters begin to dominate. 

Such a finding bodes poorly for the common practice of inferring apex radii, or even a single field enhancement factor 
p characteristic of all the emitters, from data on a Fowler-Nordheim plot. Doing so is a rather severe oversimplification, 
as can be appreciated on general grounds [431] or even careful measurements [510]. Inferring emission area and field 
enhancement from data is complex and dependent upon assumptions [511-513]. The motivation for doing so may seem 
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to be related to a desire to have an "effective emitter" with which to characterize improvements in emitters or perform 
simulations. In simulation, however, observe that all the current of the array originating from a few hot spots produces 
a very different beam than uniform emission over an array, especially if space charge and emittance effects are part of 
the investigation. 

But were one to try to do so anyway, of what must one be careful? The whole premise of "fitting to a Fowler-Nordheim 
curve" is predicated on a linear fit like y(x) = C 0 + C x x, where y(x) = ln(J/F 2 ) and x = l/F. But supposing one did wish to 
eke out an effective area and field enhancement anyway, and use the Fowler-Nordheim equation to do so. The assumed 
linearity of In (I/V 2 ) with 1/V plus the approximations F = pV/D and I = A 0 J, where p is the field enhancement, and 
A 0 is an emission area, suggest that the slope and intercept on a Fowler-Nordheim plot is enough to determine field 
enhancement and emission area. If the tips are distributed in field enhancement (e.g., by being distributed in radius), 
however, the slope and intercept are no longer unique. The causes are several. First, the underlying Fowler-Nordheim 
equation of Eq. (13.25) indicates that In (J/F 2 ~ v ) is linear in l/F rather than In (J/F 2 ). Second, the area factor g(F) is, 
to leading order, linear in field (recall Figure 30.17), so that ln(J/F 3-v ) should be used. Lastly, the changing slope of 
Figure 30.31 suggests that the effects of the distribution add their own modification. Isolating the contribution of a 



Figure 30.31 The current from the log-normal distribution of emitters of Figure 30.30 for a = 0.16, /j = 6 nm <1> = 4.5 eV, and N 2 = 128 2 = 16384 
(circles) compared to, first, the current only from the n 1 = 1 emitters with the smallest apex radius (red line) and, second, an array of N identical 
emitters all of radius a 0 = n = 6 nm. The sharpest emitters are seen to dominate at low field; the median emitters by virtue of their number dominate 
at high fields. 



the sharpest emitter 
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log-normal distribution separate from the area factor and current density is a challenge, but one way is to write 
N f L(a)I ttp \F(a)\da = NR(F)I t JF) (31 


where F = F tip a 0 /a min is the field on the sharpest emitter, and where a 0 is the median radius and a mjn the radius of the 
sharpest emitter present in the array, or 

1 =N^ mn L(a)da (30.145) 

Using Eq. (13.25) for J(F), Eq. (30.18) for the current and Eq. (30.98) for the area factor g[F(a)], then R(F) is approximated 
by 


0 = a mm fa 


Defined in this manner, as the field decreases R(F) -> 1 /N when the sharpest emitter dominates, but R(F) then increases as 
F increases and more of the blunter emitters contribute. Performing the integration numerically for <t> = 4.5 eV and a 0 = 6 
nm produces the behavior shown in Figure 30.32. Using a power law fit of the form F(F) « R 0 F P suggests that ln(J/F 3+p-v ) 
is linear in 1 /F. As a result, the slope and intercept methods very much depend on where the slopes and intercepts are 
taken if the ln(J/F 2 ) form is used as a consequence of the weak convexity that exists on a standard Fowler-Nordheim plot 
[197], As a result, when the actual distribution n t of emitters is known or can be modeled, and if I tip can be reasonably 
represented, resorting to Eq. (30.136) is better for modeling the array current. 
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Photoemitters 


. ..accidental observations have played an important role in the development of electron emitting materials in general and of photoemit¬ 
ters in particular. ..It is not surprising, therefore, that luck, combined with trial and error, has been so essential in the development of 
photoemitters. 


-Alfred H. Sommer [134] 


Among the most challenging applications of photocathodes are particle accelerators [514-516] and light sources such 
as free electron lasers (FELs) [517-519], both of which demand a great deal in terms of current density and ability to shape 
the electron pulse. For these applications in particular, the need to overcome the limitations on beam brightness imposed 
by thermionic cathodes led to the development of high-performance RF photoinjectors [520-523]. As the operational 
wavelengths of FELs in particular are made smaller and smaller with the goal of an X-ray FEL [524], five metrics of 
photocathode performance emerge as particularly important: 

• quantum efficiency governs how many electrons are emitted compared to how many photons are absorbed, with high 
quantum efficiency being desirable 

• emittance describes the tendency of an electron beam to spread out as the beam propagates, with low emittance beams 
being desirable 

• the operational lifetime of the photocathode before replacement or rejuvenation 

• the ruggedness or survivability of the photocathode, particularly in the messy environment of an RF photoinjector 

• the response time of the photocathode, which affects how well pulse shaping of the electron beam can be accomplished. 
All high quantum efficiency photocathodes involve cesium, and the highest quantum efficiency photocathodes tend 

to be semiconductors. Not surprisingly, therefore, electron transport through the bulk material of the photocathode and 
the conditions of the surface when the photo-excited electrons get there are particularly important to the understanding 
of photocathode operation. This will lead to a focus on scattering during transport, and the relationship between cesium 
coverage and low work function (a similar concern of thermionic cathodes, although there barium is used because of its 
lower evaporation rate), to understand how the processes of emission affect the metrics. Of the five metrics, quantum 
efficiency and emittance often dominate attention: generally, they increase together [181], Lifetime appears to be distinct, 
but a low quantum efficiency requires that high peak/average current applications must increase the intensity of the drive 
laser power on the photocathode, causing the cathode to heat. Ruggedness refers to how long photocathodes can endure 
in the injector environment: RF photoinjectors have worse vacuums than DC photoinjectors, and the ions produced 
erode coatings and damage crystal lattices. Response time refers to the matching of the electron pulse shape to the shape 
of the drive laser [525]: yield from photocathodes like GaAs or GaAsP has a thermalized electron component which, in the 
case of negative electron affinity (NEA), can continue to be emitted even after the laser pulse has stopped. Thermal tails 
are seen experimentally [526] and in Monte Carlo simulations of related photocathodes like Cs 2 Te, where the response 
time is lengthened (approximately 0.4 ps) compared to the fast response for metals [527], as in Figure 31.1. 

As long as most of the pulse shape of the electrons mimics the drive laser pulse shape (i.e., the pulse length is long 
compared to the response time), some simple estimates may guide the theory most useful in understanding photocathode 
operation. Return to Eq. (14.3) as a starting point. For FELs, rather generic parameters for what is desired are considered 
in the next example, although they, and the pulse repetition frequency, depend on what the FEL is designed for [528], 


EXAMPLE: Develop a relation to estimate the required drive laser power as a function of wavelength X and quan¬ 
tum efficiency ( QE ). Ignore reflectivity. 

SOLUTION: The QE is the ratio of the number of electrons out compared to the number of photons in, or 



(31.1) 


for X measured in [nm] and I x in [Watt/cm 2 ]. 
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Delay Time [fs] 


Figure 31.1 Time delay after photo-excitation for emission to occur in a Monte Carlo simulation of Cs 2 Te. Monte Carlo data digitally extracted from 
Figure 5 of ref. [527]. The simulation by Ferrini et al. fitted the yield data for a Cs 2 Te layer on a molybdenum substrate over a range of photon energies 
between 3.7 and 5.5 eV. 


EXAMPLE: For a charge per bunch of A Q = 1 nC, a pulse length of 8t = 50 ps with Af = 5 ns between pulses, and 
an emission area of A = 0.2 cm 2 , find the peak and average currents. 

SOLUTION: The peak current J e = AQ/StA is 

J =--= 100-^- 

(50 ps)(0.2 cm 2 ) cm 2 

and the average current is smaller by J ave = J e (8t/At) = 1 A/cm 2 . Using the relation between J e and I x of the previous 
example, to pull 100 A/cm 2 from the photocathode for a drive laser with a wavelength of 266 nm and a quantum 
efficiency of 0.1, the drive laser would have to provide a peak power of 4.66 kW/cm 2 of incident laser power and 
an average power of 46.6 W/cm 2 (the example is for illustration only). 


The illustrative example above can be used to develop a table of values relating what is desired in terms of current 
to what is demanded from the drive laser. A few other considerations matter. The drive laser is usually from a readily 
available laser (such as an Nd:YAG with an output wavelength of 1064 nm) that is then frequency doubled (to 532 nm), 
tripled (to 355 nm) or quadrupled (to 266 nm), an amount characterized as the harmonic and labeled n = 2,3, or 4, so that 
X = X 0 /n. Other drive lasers with other wavelengths, like 800 nm, can be used, of course, but keep the discussion simple. 
Such conversion generates waste heat in the doubling crystals, the conversion efficiency crudely characterized by 50% 
for n = 2, 30% for n = 3, and 10% for n = 4. Also, metals are quite rugged, so their lifetimes tend to be long, but good 
photocathodes like CsK 2 Sb which perform well at lower current levels can be taxed so hard that their lifetimes shorten 
to hours [529]. Similarly, semiconductor photocathodes scattering physics leads to longer response time than metal 
photocathodes, which affects pulse-shaping demands. A table of laser parameters, as in Table 31.1, current densities, and 
photocathode candidates can be constructed from such considerations. 


Table 31.1 Relation of photocurrent to drive laser. "Harmonic" is the change of the X a = 1.06 pm wavelength to a shorter wavelength 
using frequency doubling crystals such that wavelength 2 = XJn. "Efficiency" is the conversion efficiency of shortening the wavelength. 
"Temp. Resp." is the temporal response. "Ave P at cath." is the average laser power deposited on the cathode. "Ave P laser" is the average 
power of the primary 1064 nm Nd:YAG before harmonic conversion. 8760 hours is one year, but metal photocathodes last longer than that. 


Property 

Units 

K 2 CsSb 

Cs 2 Te 

GaAs 

Cu 

Mg 

Harmonic 

_ 

2 

4 

2 

4 

4 

Wavelength 

nm 

532 

266 

532 

266 

266 

Efficiency 

% 

50 

10 

50 

10 

10 

Starting QE 

% 

8 

5 

5 

1.4E-2 

6.2E-2 

Lifetime 

hours 

4 

> 100 

58 

>8760 

>8760 

Temp. Resp. 

- 

Prompt 

Prompt 

<40 

Prompt 

Prompt 

Vac. tol. 

- 

Poor 

Very good 

Poor 

Excellent 

Excellent 

Ave P at cath. 

W/cm 2 

29.13 

93.22 

46.61 

33293 

7518 

Ave P laser 

W/cm 2 

58.26 

932.20 

93.22 

332932 

75178 
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One consequence of low quantum efficiency evident from Table 31.1 is the substantial power levels deposited on the 
photocathode: power not leading to photoemission leads to heating of the cathode. Metals subjected to moderate power 
levels already get hot, an example being the tightly coiled tungsten filament in a standard 60 W incandescent lightbulb, 
heated by the passage of current through the tungsten wire. How hot can be crudely estimated, as shown in the next 
example. 


EXAMPLE: Estimate the temperature of a tungsten filament knowing that the length of the wire is 2 m and its 
diameter is 1/100 of an inch. Assume the emissivity of tungsten is e = 0.35 (it is not a perfect black body emitter). 
Assume that the length of the coiled tungsten wire is 2.5 cm. 

SOLUTION: The temperature of a hot body was treated in Section 3.3. Assume that the tungsten filament is packed 
into a cylinder half the density of a tungsten wire (the empty space being due to the coil). The radius of the cylinder 
of tungsten coil is then approximately given by 


/ 1 2.54 cm 
V100 2 


) (2 m) = —(2.5 cmjjzr 2 


or r « 0.4612 mm. The power P 0 = 60 W is radiated from this cylinder, and so based on oT 4 law of Section 3.3, 


P 0 

In rL 


(31.2) 


or T « 2541 K after using P 0 = 60 W and a sb = 5.67047 X 10“ 7 W/m 2 K 4 . 


In spite of the crudeness of the previous example, it nevertheless suggests that a metal photocathode will experience 
temperatures beyond its melting point if driven as hard as the parameters of Table 31.1 would suggest are required, given 
the vastly larger amounts of laser power (compared to 60 W) that are indicated as needed if a metal photocathode were 
employed [83, 85, 206]. Semiconductors exhibit greater quantum efficiency and therefore relax drive laser demands. 
Understanding why involves examining the transport characteristics and the nature of the emitter surface. 

Although scandate and barium dispenser cathodes give interesting performance levels when used as photocathodes 
[23, 530, 531], in general it is true, as Sommer observed [134], that all high quantum efficiency photocathodes involve 
the use of cesium, either as a coating or integrated into the stoichiometry of the bulk material. On metals, cesium 
provides a low work function coating. For semiconductors, multi-alkali antimonides and tellurides give much higher 
quantum efficiencies. Various photocathodes are compared in Figure 31.2, in which quantum efficiency is compared to 
lifetime (measured by the 1 /e parameter r or the amount of time taken for quantum efficiency to degrade by a factor 


Cs-GaAs 8 

#K 2 CsSb 5 * Cs2 T e1 
• Cs 3 Sb 4 * Cs2Te 
Cs-GaAs 3 Csl 4 


KT 4 

10° 


Peak UV QE (%) 
Peak Green QE (%) 
y = 100/x 


10 1 10 2 1 0 3 1 0 4 1C 

Time constant decay factor x [hours] 


Figure 31.2 The quantum efficiency of various photocathodes or materials (based on Figure 1 of ref. [532]). The red dashed line is used only as a guide 
to the eye and corresponds to QE x x = 100, reinforcing that quantum efficiency scales inversely with the 1/e decay factor x. The superscripts on the 
chemical formulae identify the reference: 1, Suberlucq [533]; 2, Moody et al. [534]; 3, Monaco et al. [535]; 4, Siggins et al. [536]; 5, Kong et al. [537]; 

6, Rao et al. [538]; 7, Wang etal. [539]; 8, Van Loy etal. [540]; 9, Calabrese etal. [541]. 
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of 1/e). A pattern emerges: not only is quantum efficiency improved by better transport through the bulk when scat¬ 
tering does not undercut emission, but it is also related to the surface coating and fades as that coating is damaged or 
disappears. 


31.1 Scattering consequences 


Nowhere to run ain't got nowhere to go 

- Bruce Springsteen 1 

Scattering in metals is a bit different than semiconductors as far as quantum efficiency is concerned. Some hint as to 
why is revealed by considering a collision between two equal mass particles labeled a and b ; the problem is easier working 
in the reference frame of the b particle, taking the incident a particle to be traveling in the x direction, and letting the 
plane of the collision products be in the x-y plane. Before the collision, there is no y component of momentum p, and 
after the collision must be the same: therefore, ■ p a = mv y and particle b will have ■p b = -mv y . This will ease notation 
in that the initial velocity of particle a can be labeled v a and the final velocities in the x direction can be written v' a and 
v' h . As a result, and after eliminating the mass term, which is the same for both particles, conservation of momentum and 
energy can be written as 


for which solutions are 



(31.3) 

(31.4) 


(31.5) 

(31.6) 


These equations will be treated in greater detail in Section 32.1; here, it is enough to observe that the primary electron 
shares its energy with the secondary (primed) electrons, and on average the energy is divided evenly amongst the primed 
electrons. Already that reduction in energy can be fatal to emission if the photon energy does not much exceed the work 
function. Even more than that, the initial state of the b electron has to be occupied, but the final states of the primed 
electrons must be unoccupied : if not, then the electrons have nowhere to go. 

Now the crucial difference between metals and semiconductors can be understood. The b electron is typically near 
the Fermi level p in a metal whereas the a electron has an energy typically near p + hco : sharing their energy (if shared 
evenly) would leave the primed electrons with both their energies above p in generally unoccupied states if hco > k B T, 
as dictated by the Fermi-Dirac distribution f PD (E) of Section 6.2. Compare that to semiconductors, for which the photon 
energy must exceed the band gap E g to excite an electron from the valence band into the conduction band. Were that 
excited electron to scatter with another valence band electron, both of the electrons would have to have an energy 
greater than E g or the transition is forbidden, and this does not happen until hco is sufficiently large, that is, in the words 
of Spicer and Herrera-Gomez [180], a "magic window" exists in which there is no electron-electron {e-e) scattering for 
semiconductors, as schematically suggested in Figure 31.3. Electron-phonon (e-p) scattering does not involve such a 
large energy transfer, and so the photoexcited electron keeps its energy longer and therefore has greater opportunity to 
be emitted. 

When scattering of photoexcited electrons does occur in semiconductors, therefore, it is by way of e-p scattering, 
which bleeds the energy off the primary electron far slower than e-e scattering. Photoexcited electrons will therefore 
tend to travel further, and if they do scatter they may nevertheless retain sufficient energy to surmount the barrier 
if the semiconductor is a positive electron affinity (PEA) material. For NEA materials, for which the vacuum level 
is either always below the conduction band minimum ("true" NEA) or as a result of band bending when a field is 
present, even those electrons which have thermalized with the lattice nevertheless may be emitted, as shown in 
Figure 31.4. 


^ruce Springsteen, lyrics to Born in the USA, 1984. 
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Figure 31.3 Left, e-e scattering in metals: photoexcited electron scatters with an electron at the Fermi level p, with both final states above p, so that the 
scattering is allowed. Right, e-e scattering in semiconductors: photoexcited electron must have sufficient energy so that when it scatters with a valence 
electron both final states are allowed (thick arrows); if the final states are in the band gap, the scattering is not allowed (thin dashed arrows). Thus, 
there is a "magic window" in energy where e-e scattering is forbidden, for which the quantum efficiency is larger. 


No Scattering Some Scattering Diffusion 




Figure 31.4 Three types of photocathode material characterized by differences in scattering. Left, metal-like: scattering is fatal to emission and primarily 
unscattered electrons surmount the work function barrier. Middle, semiconductor-like: a few scatterings do not impede emission, so that scattered and 
unscattered electrons can contribute. Right, drift-diffusion-like: electrons can experience many scattering events and thermalize to the bottom of the 
conduction band, yet still be emitted. 


104 1. yi 


Drift Diffusion 



Quantum Efficiency [%] 

Figure 31.5 The three emission mechanisms of Figure 31.4 characterized in terms of quantum efficiency versus response time (based on Figure 14 in 
ref. [180]). Metals are fast responders but have low quantum efficiency; conversely, drift-diffusion semiconductors like GaAs have high quantum 
efficiency but long tails in their emitted pulses. The bands show the dominant scattering mechanism: e-e for metals, e-p for semiconductors, and 
drift-diffusion for NEA photocathodes. 


Scattering, however, slows down electron transport. As a result, a difference in response time exists between the 
various photocathode material types, with metals being the fastest responders and drift-diffusion materials like GaAs 
having longer response times. The tracking of the quantum efficiency with the response time is indicated in Figure 31.5. 
Correlated with scattering by virtue of drive laser requirements, there is finally the question of laser power deposited on 
the photocathode and photocathode sensitivity to the environment. The relations are compactly shown in Figure 31.6, 
based on a survey of photocathode metrics and performance by Dowell et al. [523], where the vacuum tolerance of the 
various photocathodes (metal, antinomies, and cesiated GaAs) is shown in conjunction with their average current versus 
quantum efficiency regimes behind curves of constant current for various wavelengths. 
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Figure 31.6 Average laser power versus quantum efficiency required to produce average currents. Number labels are "wavelength [nm]/current [mA]". 
Lines are of the form I e = (qX/2xhc) QE I A . Ranges for the general cathode types, and their vacuum requirements, are also shown. Based on Figure 1 of 
ref. [523]. 


31.2 Basic theory 


Much in the letters, nothing in the praise. 

- William Shakespeare 2 

Finding the number of electrons to be emitted therefore requires knowing the probability that electrons with an energy 
E absorb a photon and get to an energy state where photoemission is allowed. The time-dependent Schrodinger's equation 
is therefore of the form 

(ihd t -H)\y/) = Q (31.7) 

where the Hamiltonian is the sum of H = H 0 + 8H, and where the solutions to 


(ihd t - U 0 )W„) = 0 (31.8) 

are known. Introducing the retarded Green's function, G(f — t'), where 

(ihd t - H 0 )G(t - t') = 8(t - t') (31.9) 

and letting 6(t - t') be the Heaviside step function (1 for positive argument, 0 for negative), then formally G is given by 
G+(f -t') = —iO(t - f) exp [i(H 0 /h)(t - t')] (31.10) 

and so m 

I V + )=\V„) + f G + (t — t')8H\iff + )dt' (31.11) 

Letting |yr) = \a)e~ m « t , where hoo a = E a and 


0 = (H-E a )\a) 

0 = {H 0 -E a ) | a 0 ) (31.12) 


and introducing a small parameter r\ to make the integrations behave properly, then Eq. (31.11) involves an integral of 
the form 


< 


expfdCff,, - E a )/h - iri]t'}dt' = 


(E a - H 0 )/h + ip 


2 Ref. [37]: Love's Labor's Lost, V.ii.40. 
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and so, letting e" 1 -*■ 1, the Lippmann-Schwinger equation 



-5H\yr) = \y/ 0 ) + G+(E a )8HW) 


(31.14) 


results. Transitions W ab from state |i p a ) to \yr b ) are given, to leading order, by Fermi's golden rule 
W a<b = 27r\( ¥b \SH\ Va )\ 2 8(E a - E b ) 


(31.15) 


Now, when SH <xp - A, where A is the radiation field of light [542, 543], then the yield, or number of electrons excited 
per photon, can be laboriously calculated once the Hamiltonian H 0 and the basis states \y/ 0 ) are specified. But even if 
they are, transport past the surface barrier constitutes the next non-trivial difficulty. Those who have even made it this 
far can be excused for experiencing a cold sense of foreboding that an equation of the simplicity of the other canonical 
emission equations has seemed to pass well beyond reach. Many-body physics is hard, even as it is powerful, but if the 
desire is for a quick computational tool, yearning for a more phenomenological treatment is inevitable. There is hope. 
It is next. 


31.3 Three-step model 

Wan waves and wet winds labour, 
Weak ships and spirits steer; 
They drive adrift, and whither 
They wot not who make thither... 


Algernon Charles Swinburne 3 


A phenomenological model of photoexcited electrons migrating in all directions and diminished in numbers due to 
scattering proved an efficacious description of the yield of photocathodes. The three-step model (TSM) [168, 179, 180, 
544, 545] provides a way past the hardship of a proper model, 4 the utility of which continues unabated to describe 
experimental quantum efficiency and emittance [181, 546]. In the TSM, the photoemission process is separated into the 
three distinct processes of absorption, transport, and escape. Models for each of them are then developed, from which 
an estimate of photoemission current can be made. The power of the method and its heuristic appeal lie in the ready 
visualization of the components, and then in the development of yet more models to treat the parameters on which those 
components depend. Compared to how the TSM is developed by its progenitors, the derivation here is more improvised 
so as to introduce concepts before undertaking the moments model in Section 31.4. Five simplifications [179] allow for 
a tractable TSM: 

• the photoexcited electrons are isotropically distributed 

• the only scattering events treated are taken to be inelastic 

• inelastic scattering is treated in the context of a mean free path that depends only on the photoexcited electron's 
energy 

• the scattering is isotropic 

• the normal energy of the photoexcited electron (the energy component into the barrier) must exceed the barrier height 
for escape, and when it does the probability of escape is unity. 

Although such approximations entail conceptual difficulties [547], they allow the evaluation of yield to be separated into 
the three distinct processes: 

1. the penetration into the bulk by photons followed by their absorption 

2. the transport of electrons back to the surface with their numbers whittled away by scatterings with other electrons and 
phonons 

3. the escape over (or through) the surface barrier into vacuum. 


3 Algemon Charles Swinburne, The Garden of Proserpine, in M.L. Rosenthal, Poetry in English: An Anthology. New York: Oxford University Press, 
1987, p. 744. 

4 Berglund and Spicer were not so blase: they spoke of "extreme difficulties..." instead of "hardship.'. 
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This demarcation means that the physics of each process needs to be developed. For laser absorption, reflection at the 
surface and the depth to which the light penetrates mean the reflectivity of the surface and the dielectric constant of 
the material require examination. For electron current, the transport to the surface and emission into vacuum means 
scattering rates and energy loss as well as emission probability require examination. The examination of emission can be 
dismissed by assuming a step-function transmission probability in lieu of Chapter 18. 

The fraction of photons absorbed is 1 minus the fraction reflected, or (1 - R ), where Rim) is the reflectivity as a function 
of frequency (or wavelength). Of those photons absorbed, only electrons with a starting energy E> p-hco stand a chance 
of absorbing them, as the energy levels to which they aspire are otherwise filled as per the Fermi-Dirac distribution, for 
which the zero temperature limit is a good approximation. Quantum efficiency is then a ratio of the current of electrons 
that can pass the barrier with the current of all excited electrons. Restrictions on the ability to surmount the barrier 
introduces the escape cone: define 9 m as the maximum angle of a photoexcited electron for which the "normal energy" 
exceeds the barrier height (E + hco)cos 2 9 m = p + <t> or 


(31.16) 


Not all electrons make it to the surface, though. The ones that scatter can lose enough energy that they no longer are 
able to surmount the surface barrier, and so are lost. For metals, a single collision between two electrons can be enough 
for interfere with emission, and so th e fatal approximation is often made, 5 whereby any collision thwarts emission [58, 85 
179]. If electrons are photoexcited at a depth x into the bulk material and are then traveling at an angle 9 with respect 
to the surface normal, then they must travel a distance x/cos 6, as in Figure 31.7. A fraction fx(E, cos 9) will therefore 
survive the journey to the surface. To evaluate that fraction, the number of electrons that are generated at a depth x must 
be known. Let the optical penetration depth be <5(w). The probability of photoexcitation at a depth x is then proportional 
to exp(-x/<5). The probability an excited electron will scatter and therefore be lost decreases exponentially with distance, 
with the length scale governed by the mean free path l (distance between scattering events on average) according to 
exp(-r//) where r is the distance traveled (Section 32.2.1). With x = rcasO, 


f A (cos 9, p(E)) = 


fo°° ex P (“ 


/ 0 °° exp (-X(5) dx 


cos 9 

cos 9 + p(E) 


(31.17) 


where 


p(E)= S -^ = m8(co) 
1(E) hk(E)r(E) 


(31.18) 


is the ratio between the laser penetration depth 8(co) and the mean free path 1(E) = Tikr/m as a function of the total 
electron energy E, and where x is the distance the electron must travel to freedom. 


4(cos0, E) 



Metal \ r(w) Semiconductor 


Figure 31.7 Components of the TSM for metals (left) and semiconductors (right): (i) absorption and reflection of light R(w\ (ii) transport and losses to 
the surface /_,(£, cos 9), and (iii) transmission into vacuum with the transmission probability D(E X ) that depends on work function <6 (for metals) or 
electron affinity E a (for semiconductors). 


5 But not always: Bergiund and Spicer [179], for example, assess the effects of electrons that have scattered but once, and a phenomenological 
model of the contribution of scattered electrons will be considered in Section 31.7. 
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EXAMPLE: Metal-like: Evaluate p(E) for an electron excited from the Fermi level p = 7 eV by a photon with 
wavelength 266 nm (Tim = 5 eV). Assume the laser penetration depth in copper is 12.6 nm. For a hot electron 
of E = p + Tim « 12 eV, assume the e-e relaxation time is 3.4 fs [85, 548], 

SOLUTION: From the energy of the electron of E = p + hm = 7 eV, therefore k(E) = sJlmE/h = 17.747 1/nm. 
Therefore 

I = mS = (5.6856 eV/c 2 ) (12.6 nm) 

P{ metal Tik(E)r (0.65821 eV-fs) (17.747 nm _I )(3.4fs) 


or p(E) = 1.804. 


EXAMPLE: Semiconductor-like: Evaluate p(E) for an electron excited from the valence band for cesium antimonide 
(Cs 3 Sb) for which [308]: mjm = 0.1176, band gap E g = 1.6 eV, and laser penetration depth <5 = 27 nm and t = 31 fs. 
SOLUTION: The energy of the electron is E = Tim - E g = 0.73053 eV. Therefore, k = 0.43788 1/nm, and so 

nm _ ms _ (0.66862 eV/c 2 ) (27 nm) 

P( )\semi (0.65821 eV-fs) (4.3788 nm- 1 ) (31 fs) 


or p(E) = 0.2021. 


Taking a moment to appreciate the differences between the Fowler-DuBridge model of Chapter 14 and the TSM is 
useful. In the modified Fowler-DuBridge model, the number of electrons that survive the odyssey to the surface, as in Eq. 
(14.6), is separated out in a manner that encapsulates the three steps of the emission process by (1 - R) -*■ absorption, 
F x -*■ transport, and P(Tim) -*• emission. Where F x acts as an overall multiplicative factor to the average probability of 
emission, the scattering factor f x depends on the energy of the scattered electron. A summation over the product of 
the energy-dependent loss factor /,(cos 9, p(E)) with the energy-dependent emission probability is therefore required if 
the emission probability is not a step function, which forebodes differences arising between the treatment of metals 
compared to semiconductors as a consequence of the emission barrier (think image charge/Schottky barrier compared 
to a triangular barrier) in interesting ways, as will be undertaken in the moments model. 


31.4 Moments model 

Knowledge, the known, and the knower 
are the three things that motivate action: 
instrument, action, and agent 
are the three components of action. 

- Bhagavad Gita 6 

The reflectivity and laser penetration depth (related to the optical factors) and the relaxation times (related to the e-e 
and e-p scattering rates) depend on the photon energy, the former directly, the latter because of how it affects the electron 
energy. Thus, the factor F x of Eq. (14.6) really should not be part of the energy integration. If the TSM is recast in the 
form of an integration over the possible energies of the emitted electrons and the language of discussion shifts over to a 
distribution function outlook, then the moments approach results: quantum efficiency QE is a ratio of currents, and that 
current is the first "moment" of the emitted distribution function as in Eq. (7.8), or 

M, (k 7 ) 

Qg={l—K(fi>)} ' z - (31.19) 

2 M 1 ( k )| fl _ 1 ^ =1 

where D(E) refers to the transmission probability and/, to the scattering factor. The denominator accounts for the current 
of all electrons, so it includes those going in the (+£) direction as well as the (—z) direction, thereby giving a factor of 2x. 
Additionally, all excited electrons are considered regardless of what direction they are headed, thereby using k rather than 


6 Stephen Mitchell, Bhagavad Gita: a new translation. New York: Harmony Books, 2000, Verse 18.18, p. 185. 
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k z in the argument of the moment function in the denominator. For metals, the energy E{k) of the electron is constrained 
by the initial energy range and the availability of final states: in the zero-temperature limit of the Fermi-Dirac distributions 
that govern both, 

f FD (E){ 1 ~f FD ( E + ft®)) -*■ - E)&(E -p + hco) (31.20) 


where 0(x) is the Heaviside step function (1 for positive argument, 0 for negative). For semiconductors, in contrast, 
the photoexcited electron must have an energy above the band gap, for which the energy is constrained by a factor 
of ®{hco + E- E g ). For metals then, absorption, transport, and emission combine to describe the current of all electrons 
making their way to the surface such that their normal energy component exceeds the barrier height and is evaluated 
by 

k 2 dk fgihk cos 6/m)f l (cos0,p)27r sin OdO 

QE«(l-R)— ---5- (31.21) 

2 f * _ k 2 dk f 0 1 (hk/m)2jcsinOdO 

V k F- k l 


where the lower integration limit in the denominator is zero if hco > p (a situation outside present interest), and where 
Ti 2 k 2 F /2m = p, h 2 k 2 n /2m = p- (hco - <p), h 2 k 2 /2m = hco, and the equation is approximate because scattered electrons are 
neglected. Relations involving 0 m (k) in Eq. (31.16) that will prove useful are 


sin 2 0 m (k) = 
sin 2 0 m (k p ) = 


E(k) — p + hco — d> 
E(k) + hco 
hco — 4> 
p + hco 


(31.22) 

(31.23) 


where the second, evaluated for energy at the Fermi level E{k p ) = p, will be particularly useful, as it is often a small 
parameter for metals. 


31.4.1 Metals 


In the case of all things which have several parts and in which the totality is not, as it were, a mere heap, hut the whole is something 
besides the parts... 

- Aristotle 7 


Because the small 0 region dominates the integrals, the approximation f x {cosd,p) « cos 0/(1 + p) is reasonable and 
convenient given the size of p for metals: with a partially filled conduction band, many final states for a photoexcited 
electron to scatter into, and the sheer number of roaming electrons that a metal contains with which the photoexcited 
electron may scatter, the mean free path is small (Section 32.3.3). The approximation results in 

QE «- 2(1 ~ R \ -— / ^ k 3 [ 1 - cos 3 0 m (k))j dk (31.24) 

Xl + p)kt{2k 2 p -k 2 01 )J km 

Making the substitution cos 0 m {k) = k 0 /\Jk 2 + k 2 results in an integral that can be evaluated analytically to give 


2(1 -R) 


3(1 + p)k 2 a (2k 2 p - k 2 a ) 


k* P - 4 + 3 + 2 k 2 w ) ■ 


4 k 3 0 (k 2 P + 2 k 2 w ) 
yjk 2 p + k 2 m 

where h 2 k 2 J2m = p + d>. When {hco - cj))/{p + tjo) is small, it is possible to show 8 


(1 — R) pjhco-cl)) 2 
(1 + p) 4hco(p + hao)(2p - hco) 


(31.25) 


(31.26) 


(31.27) 


7 Aristotle, Metaphysica Book VIII (trans. W.D. Ross), Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 8, Aristotle I. Chicago: 
Encyclopaedia Britannica, 1952, p. 569. 

8 Showing it using Eq. (31.26) takes work. See the discussion around Eqs (31.99) to (31.103) for an alternate way to the same destination. 
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Such an equation should give an approximate account of the quantum efficiency of metals up to an overall multiplicative 
factor C taking into account multiple crystal faces, adsorbates on the surface, effective mass variations, and other factors 
which may cause only a fraction of the surface to emit. 


EXAMPLE: Find the factor C given that QE = 2.049 x 10 -4 at hco = 5 eV based on Figure 3.6 in Section 3.3 and 
using the following parameters and assumptions: p = 7 eV, ® = 4.615 eV, and F = 50 eV/pm. Take the reflectivity 
R = 0.4, which is typical for copper at a photon energy of 5.0 eV and treat it as constant (even though it is not). Take 
the relaxation time in copper to be t = 3.4 fs (even though it is energy dependent), and take the laser penetration 
depth to be 8 = 12.6 nm. 

SOLUTION: First, k = \/2m(p + cjo)/h = 17.257 [1/nm], Second, the penetration depth to mean free path ratio p = 
mS/hkr = 1.8549. As a result of the field, 4> = ® QF = 4.3468 eV. The ratio between the experimental QE and 
Eq. (31.27) is therefore C = 0.70504. 


Using Eq. (31.27), the data of Figure 3.6 can be re-examined using the value of C = 0.67383 determined from the previ¬ 
ous example. The comparison is shown in Figure 31.8, along with the Fowler-DuBridge hypothesis that QE(X) oc (hco - cjo) 2 
(dashed gray line). The better agreement over the Fowler-DuBridge relation is evident, even though the relaxation time is 
a function of the photoexcited electron energy t (E + hco) and the reflectivity is a function of frequency R(a>) and therefore 
both are affected by hco. 

As a final observation, the moments-based approach cleaves to the k-based approach of calculating current density, 
whereas an account based on Spicer'sTSM [179] works with energy, where E = h 2 k 2 /2m is often assumed (the application 
of the TSM to the present copper data analysis is encapsulated very well in Dowell et al. [58]; the energy dependence of 
t(E) will be revisited in Sections 32.3.3 and 32.3.4, the former for e-e scattering, as in Figure 32.15, being particularly 
important for metals). As a result, although the equations for QE for each approach differ in appearance, they are kindred 
methods. 


31.4.2 Semiconductors 


In desperation I asked Fermi whether he was not impressed by the agreement between our calculated numbers and his measured numbers. 

He replied, "How many arbitrary parameters did you use for your calculations?" I thought for a moment about our cut-off procedures 
and said, “Four." He said, “I remember my friend Johnny von Neumann used to say, with four parameters I can fit an elephant, and 
with five 1 can make him wiggle his trunk." 

- Freeman Dyson [549] 

Semiconductors add complication to the analysis. There are all the considerations of Chapter 22, but in addition, first, 
the barrier at the surface has an increased triangular shape that affects the transmission probability (D(E) -*■ D^(E)), 


KT 3 

10 -4 

a 

KT 3 

icr 6 


Figure 31.8 Experimental data for the quantum efficiency data of copper (courtesy of David Dowell (SLAC); see also ref. [58]) compared to Eq. (31.27) 
scaled by C = 0.70504. The dashed gray line corresponds to QE = QE 0 [(ha> - 4>)/(hm 0 - <b)] 2 where QE 0 is obtained by setting tuo -»<p in Eq. (31.27) 
except for the (hco - 4>) 2 factor. 
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Figure 31.9 The quantum efficiency of Cs 3 Sb as measured by Spicer [550] (red dots and line) and by Taft and Phillip [551] (blue squares and line). The 
fit parameters were based on data digitally extracted from Figure 1 of ref. [308]. 


and, second, for the "magic window" of semiconductors, scattering effects are not nearly so fatal as for metals. Most 
importantly, the behavior of QE as a function of photon energy hco takes on a different appearance than the quadratic 
behavior exhibited by metals in Eq. (31.27), as shown by the measurements of Spicer [550] and Taft and Philipp [551] 
in Figure 31.9 compared to the fit suggested by Spicer to be 9 


QE(ha>) «- — - (31.28) 

1 +--- 

(hco - v g y 

where V g should be the sum of the band gap plus the electron affinity E g + E a , and where Spicer found v « 3/2 (not to be 
confused with the v of field emission). When hoo is rather close to V g , then QE oc (hco - V g ) v , rather reminiscent of the QE 
of metals. Here, the parameters B and g are being treated as adjustable, and V g and v are perilously close to being treated 
likewise. Why these parameters are what they are takes some inquiry to constrain just how far they may be adjusted, as 
well as to what they likely refer. 

Return, therefore, to the integral from which QE is calculated, but to avoid having to discuss the value of m n , the 
substitution k = yj2m n E/h transforms the integral to one over energy. For semiconductors then. 


QE = (l-R(co)) 


//* EdE xdxD A [Ex 2 ]f k (x, p(E )) 

2 f 0 Em EdE f 0 l dx 


(31.29) 


where the upper limit of the energy integration is E m = hco- E g and the lower limit of the x = cos0 integration is x m = 
\JEJE. Additionally, E is the energy of the photoexcited electron, x = cos 0, and although some electrons can tunnel 
through the triangular barrier, the contribution of electrons below the barrier height (characterized by the electron 
affinity E a ) is negligible. Observe another subtlety: for the metal treatment in Eq. (31.24), k{E) referred to the electron 
prior to the absorption of hco, whereas now E in Eq. (31.29) refers to the energy after absorption, a convenience due to 
energy being measured with respect to the bottom of the conduction band. Either convention is acceptable as long as the 
arguments of p(E), t(E), and D(E) take into account the photon. The energy of the photoexcited electron is constrained 
by the difference between the photon energy hco and the band gap E g on the upper limit of integration, and the barrier 
height E a on the lower limit. 

Suppose that the transmission probability D^ffEx 2 ) could be treated as a step function, as in &(Ex 2 - E a ), as for metals 
in Section 31.4.1. The integration over the scattering term f x in the case of the metal treatment made use of the approx¬ 
imation f A « cos(9/(1 + p) in Eq. (31.29), but how good is that approximation? From either Eq. (31.21) or Eq. (31.29), 
the approximation affects the evaluation of 


F x (x m ,p) = 


f*h 


\(x,p(E))dx 


(31.30) 


J Although this equation bears scant similarity to Eq. (6) of Spicer [550], the two equations are in fact identical for the approximations Spicer i 
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where x m = \JeJE for semiconductors or cos 6 m for metals, and the notation of F i is meant to evoke fond memories of 
the modified Fowler-DuBridge equation of Eq. (14.5). The approximation of f x used in Section 31.4.1 for metals results 

,3L3,) 

However, the integration is, in fact, analytic, and gives [308] 

%,?)= l^-x m )a+x m -2p)-p 2 \n(^^j (31.32) 

In the limit that x m -* 1, and in terms of the small quantity e = (1 - x m )/(\ + p), both approach the same limiting value 
(the approximate equation behaves as F x « e - (1 + p)e 2 , whereas the exact equation behaves as F x » e - (p + (l/2))e 2 
for small e) and therefore the approximate form of F x is adequate. However, when the wavelength becomes small, the 
differences between the approximate and analytic solutions become more pronounced: in the short wavelength limit 
x m -*■ 0, the approximate solution approaches 1 /[ 3(1 + p)\ whereas the analytic solution approaches 


F x (x m = 0,p)= -(l-2p)-p 2 ln 


(31.33) 


or (1/2) — p when p <cl and l/3p when p » 1. Thus, for the metal-like values where p is large, the approximation is 
adequate, but for semiconductor parameters where p is small, a noticeable difference accrues, as in Figure 31.10. 

There are two problems with using F x (p) and settling for a modified Fowler-DuBridge equation. The first is that, as 
a function of energy, F x {p) is not very friendly and makes for a more difficult integration in Eq. (31.29) so that an 
analytical solution comparable to that of Eq. (31.27) for metals is going to give way to numerical evaluations. The second 
problem is that the transmission probability for a triangular barrier is not a step function, although the fields associated with 
photoemission are often such that Eq. (18.46) is a useful limiting approximation, for which 


4 [E(E-E a )] 1 ' 2 
D A^> K (£1/2 + (£-£ a )l/2)2 


(31.34) 


so that the modified Fowler-DuBridge model of Eq. (14.5) is at best only qualitatively correct for a triangular barrier. 

The impact of D^JfE) can be accounted for by using constant but reasonable values for R(co) (set to 0.2, although it 
varies between 0.1 and 0.35 depending on frequency) and p (as labeled) with the band gap E g and the electron affinity 
E a modeled after Cs 3 Sb parameters [308]. The consequences are shown in Figure 31.11, showing reasonable agreement, 
and so V g and v can be circumscribed by considering the small (s 2 = (hco -E g - E a )/E a <sc 1) behavior. 

To develop an analytic equation with the simplicity of Spicer's Eq. (31.28) is frustrated by the complexity of the inte¬ 
grand, but given that the integrand vanishes at the lower bounds of both integration regions, but rises quickly near the 
lower bound (E « E a ) only to become progressively more linear near the upper bound (E « E m ) as s increases, means that 



X [nm] 


Figure 31.10 Behavior of F i as a function of wavelength 2 = 2 ichc/{E + E ) and x m = y/EJE for various values of p, as defined by Bq. (31.33). For 
metal-like (large p) conditions, the approximate form holds, but for semiconductor-like (small p), the approximation degrades for smaller wavelengths. 
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Photon Energy [eV] 


Figure 31.11 A comparison of Eq. (31.29) using constant values of K(®) = 0.2 and the constant values of p as indicated for the assumed values of 
E g = 1.6 eV, E a = 0.4 eV (fashioned after Cs 3 Sb parameters in ref. [308]). In addition, the data of Figure 31.9 for Taft and Philipp [551] and Spicer [550] 
are shown. 


the integration overall is bounded from below by the trapezoidal approximation of Eq. (A1.9) and from above by twice 
that approximation (put another way, for a given E and in terms of s = (x - x m )/(l - x m ) with x m = yfEjE, the integrand 
for the x integration lies above the line y = s and below the line y = 1 , and the area under the second curve is simply 
twice the first). Using the trapezoidal approximation, therefore, 

f 1 , x Ve-tM 

/ xD(Ex 2 ) —— dx «- y —^-D(E) (31.35) 

X + P 2(1 +p)Ve 

and so using the trapezoidal approximation on the resulting E integration, 

QE « - 1 ~ R (E m - E a )(y/E^ - y/EJD(E m ) (31.36) 

2(1 + p)E]i 2 

or, in terms of s 2 = (hco — E. — E a )/E a , the lower limit is 


QE > 


_2(1 — R)s 5 _ 

(1 +p)(l +s 2 )(l + Vl + s 2 ) ( s + Vl + s 2 ) 


(31.37) 


and the upper limit is twice that, as shown in Figure 31.12 In both cases, the dependence on s in the numerator shows 
that V g = E g + E a as intuited, but that v starts off as 5/2 and approaches unity as s becomes large: the behavior in the 
intermediate regime therefore hovers about v « 3/2, as Spicer argued (corresponding to the s 3 line of Figure 31.12). 



Figure 31.12 A comparison of quantum efficiency numerically evaluated using Eq. (31.29) compared to the lower bound estimate of Eq. (31.37) and 
twice the lower bound for parameters fashioned after Cs 3 Sb (E =1.6 eV, E a = 0.4 eV). Gray dashed lines correspond to QE ex s 5 and s 3 , the later in 
keeping with Spicer's finding. 
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Two of the adjustable parameters (v ~ 3/2 and V g = E a + E.) have now been locked down, but to be usefully predictive 
equations such as Eq. (31.29) (as well as Eq. (31.27) for that matter) have to circumscribe the other two parameters 
R and p. Both require knowing the optical properties of the material, and p requires knowing the scattering rates of 
electrons with other electrons and with the bulk lattice. That will take some work. The optical properties will be considered 
forthwith. The treatment of scattering is deferred to Chapter 32, in particular the two important cases of electron-electron 
scattering in Section 32.3.3 and electron-phonon (acoustic) in Section 32.3.4 after Monte Carlo methods motivate their 
treatment. 


31.5 Reflectivity and penetration factors 


Obviously then it would be better to assume a finite number of principles. They should, in fact, be as few as possible, consistently with 
proving what has to be proved. 

- Aristotle 10 

Models are needed for reflectivity R{oo) and the laser penetration depth <5(w) that appears in p = 8(w)/l(E) of/,(cos 0, E) 
in Eq. (31.18) (the relaxation time t(£?) that is also a part of p is deferred to Section 32.3). These factors govern how 
materials and light interact, and can be briefly considered, recognizing that extended treatments [52, 53, 85, 552] and 
supplementary accounts [55, 80, 553, 554-555] are available. Maxwell's equations for the electromagnetic fields concern 
the vector relationships between the electric £, magnetic H fields, and current J with regards to their spatial and temporal 
behavior. Of the four that are normally experienced on first encounters [99, 100, 556, 557], the two that matter here 
are 11 


V X £ + p 0 d t H = 0 (31.38) 

VxH - e 0 d t £ = J (31.39) 


where e 0 = 0.0552635 q 2 /eV-nm is the permittivity of free space, and p 0 = l/e 0 c 2 = 0.000201335 eV fs 2 /nm q 2 is the 
permeability of free space in the [nefq] units of Tables 2.1 and 2.2. Consider, then, the quantity Vx(Vxf) = V(V • £) - 
V 2 £ and let there be no free charges loitering about so that V • £ = 0 from Poisson's equation. Then the two equations 
for £ and H entail 12 


V 2 - 4-d 2 £ = - 


(31.40) 


where Eq. (3.8) has been used to relate current to electric field by reintroducing conductivity a. For electromagnetic 
waves characterized by £ = £ 0 exp(/r • f - loot ), where k is the propagation constant (k is used so as not to confuse matters 
with k when it is taken to be a damping constant) and m is the frequency, then 


,.= ( 1 + ,^)' /> ( 2 ) 


(31.41) 


The coefficient of co/c is termed the (complex) index of refraction, and to emphasize that it is in fact a complex quantity, a 
small hat as in h usually attends the symbol for it, so that 


. \6xQa V 12 

1 ®<7 2 / 


(31.42) 


where n is the usual index of refraction, and k is called the damping constant. The presence of w indicates that both n and k 
depend on the frequency of incident light. The term a will fade from view for a while, but returns when the distribution 
function approach to finding dielectric parameters is undertaken. 

Let £ = £x so that the magnitude of the force on an electron is F = q£ and that unpleasant notational complication with 
k ■ f can be replaced by kx. When the electron is accelerated over a distance l by a force F, it acquires an energy A F = Fl. If 


10 Aristotle, On The Heavens, Book III (trans. J.L. Stocks), Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 8, Aristotle I. Chicago: 
Encyclopaedia Britannica, 1952, p. 394. 

11 The electric field notation £ reinforces that light, not electrons, is under discussion. 

12 The notation for permittivity and permeability are being jettisoned, using e 0 = q 2 /\6nQ to enable easy computations using familiar terms 
with transparent units. 
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/ = vdt where the electron velocity is v, then the rate of change of the energy (a power) is dE/dt = Fv. Transferring those 
ideas to a number density p, then the power density transferring over to the electron gas from the wave, or, equivalently, 
the power that is being extracted from the wave, goes as pFv = J£ = a£ 2 (remember that, apart from sign, J = qvp). The 
intensity of light (measured in W/cm 2 ) goes as \£\ 2 , and its empirical decline as light passes through thin films of material 
serves to characterize the decay parameters as a function of wavelength. 

Because of the imaginary part of h, the electric field is decaying as the wave propagates into the bulk material. If y{x) 
is an exponentially decaying function, then the length scale of the exponential decay is -[d x ln(y(x ))] _1 (e.g., trying that 
on y{x) =y 0 e~ x/a returns a, as it should). The length scale 8(o>) governs the exponential decay of the power density or 
intensity into the bulk material as a function of distance x. Intensity is proportional to |£(x )| 2 so that 

<5(®) = {-d ln[|£(z )| 2 ]} -1 = -f- (31.43) 

2 kco 

Experimentally, to measure the absorption coefficient is to measure the transmission through varying thicknesses of 
material, in which case T goes as [558] 

T = (1 - R) 2 e~ d/s /( 1 - R 2 e~ 2d/S ) (31.44) 


where R is the reflectivity. In the thick (d » S) limit, the attenuation found above in Eq. (31.43) is obtained. 

So what of reflection, then? If the electric field is £ = x£ 0 exp (ixx - icot), then the associated magnetic field 
H = yH 0 exp(;Vx - icot) from Eq. (31.38), where H 0 = {ac/oo)£ 0 . These incident waves are crossing a boundary with 
transmitted and reflected waves, making for a problem extraordinarily reminiscent of the tunneling problems of Section 
18.1. In fact, if £ with the subscripts i, r and t refer to the incident, reflected, and transmitted waves, respectively, then 
the matching of the waves and their first derivatives at the boundary x = 0 results in a matrix equation of the form 
(where x i = x r because both are in the same medium, for example vacuum, and k t is the complex parameter in the bulk 
material) 




where the little hats on k (i.e., k) denote a complex quantity, and where the top row of the matrix equation refers to 
matching the waves, and the bottom to matching their derivatives, at x = 0. But after experiencing the evaluation of 
transmission probabilities using the transfer matrix approach of Section 19.1, the present task is straightforward. Solving 
for £ r 0 in terms of £' 0 results in 

E r o = K ±^iE i 0 (31.46) 


Defining the reflectivity R as the ratio of the absolute square of the magnitudes of the reflected with the incident wave 
results in 


R(w) = 


I 


|n-l | 2 _ (n - l ) 2 - k 2 
\h+l\ 2 ~ (n+l) 2 -k 2 


(31.47) 


Values of n and k can be empirically found and tabulated, as in refs [55] or [309], but simple models shed light as to 
what causes them to take the forms they do. Familiarity with the Drude model of conductivity in Section 3.2 and with 
the dielectric constant in Section 22.6 will help. 


31.6 Lorentz-Drude model of the dielectric constant 

A theory based on thefrequency-dependent dielectric constant in the random phase approximation ... predicts and the experiments confirm 
essentially free electron-like behavior before the onset of d-band excitations and a plasma frequency modified from that of free electrons 
due to oscillator strength coupling between valence and d bands and d-band screening effects. 

- Herbert R. Philipp and Henry Ehrenreich [553] 

The classical theory of absorption and dispersion is due mainly to Lorentz and Drude. The Lorentz model is applicable to insulators... The 
Drude model is applicable to free-electron metals... Both the Lorentz and Drude models are largely ad hoc, but still useful as starting points 
and for developing a feeling for optical properties. We shall see that many features of these classical models have quantum mechanical 
counterparts which are easily understood as generalizations of their classical analogs. 


- Frederick Wooten [552] 
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Figure 31.13 The application of £ pulls the distribution of negative charge (red fuzzy circle) away from the center of positive charge (small blue circle) 
in atoms or molecules. The dipoles that form give rise to the polarization vector P, which is proportional to £. 


Conductors shield out external fields quite effectively, as roaming electrons in them pile up near the surface so as 
to completely block external fields from penetrating into the interior (or, for that matter, aggregating around charged 
inclusions as in Section 22.6). For semiconductors or insulators, however, the electrons may not have the same freedom: 
the atoms from which they would have otherwise escaped retain a hold on them. In a simple visualization, such atoms 
in an electric field thereby become polarized whereby the center of positive charge is off-set from the center of negative 
charge and so a dipole is formed, as in Figure 31.13. The dipole moment p is proportional to the electric field £. Many 
such little dipoles conspire to produce a macroscopic P, and so for static conditions 13 

P = e oX £ (31.48) 

When dielectrics are present, therefore. Maxwell's equations are retained in form by introducing D to take over for £, 
where (in conventional notation) 

D = e 0 £ + P = e 0 (\+x)£ = e£ (31.49) 

for which V • D = q 2 p with p as the number density of free (not bound) charge. When conditions are not static, then the 
separation of charge that makes the little dipoles and gives rise to P(f) is a process that lags behind £, and so / takes over 
the role of a response function whereby 

P(t) = e 0 J (31.50) 

where x(t) is a real function. Using the convolution theorem (as done in going from Eq. (22.33) to Eq. (22.35)) gives 

Pia>) = £ Q x(ofr£(ofr (31.51) 

where the hat notation on x indicates that it is complex. The relationship between the complex index of refraction and 
susceptibility is then 

n(®) 2 = 1 + x(m) (31.52) 

Finding x(m) is involved but two kinds of arguments, the first treating the electron as a damped harmonic oscillator 
and the second using a distribution function approach (a quasi-classical Boltzmann transport equation), are inordinately 
useful. Such simple models are often used to understand the frequency dependence of absorption and reflection for 
metals, but also illuminate the same dependence in semiconductors [553]. The latter, though, require a bit more caution 
(especially for good photoemitters like GaAs) because the carrier density of electrons is less, the semiconductors have a 
band gap, and the relaxation time is no longer constant as for low frequencies but becomes frequency dependent when 
the photon energies are sufficiently large (around infrared). Frequency dependence in turn changes the dependence of 
the absorption coefficient from X 2 as in the Drude model to X 3 or X 4 in a quantum mechanical theory [558-560], The 
aim, however, is not to derive that dependence, but rather to show why the reflectivity R(co) and the penetration depth 
8{co) are frequency dependent in the first place by finding out their relation to x(°>) via Eq. (31.52). 


13 There is more to the microscopic to macroscopic transition than just this [99, 556], but the description is adequate for simple models. Still, if 
the dipole is proportional to the field, as in p = a£, then the macroscopic polarization P = Na£, where aifo) is the polarizability for a one-electron 
atom. For those who crave more, see Wootan [552]. 
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31.6.1 Harmonic model 

Non-violent resistance to evil does m 


n absence of any resistance whatsoever but it means not to resist evil with evil but with good. 

- Mahatma Gandhi 14 


Resistance is a fundamental feature of bound particles, manifested as a dampening to forced motion. Consider a simple 
model first. If the position of a bound electron to the dipoles of Figure 31.13 with a mass m oscillates in response to the 
time-varying electric field and experiences dampening while doing so, then its acceleration can be related to the forces 
acting on it by [561] 

—F = mdfr(t) + — d t f(t) + m(o 2 0 r(t) (31.53) 

where the left-hand side is the force F = q£ due to the electric field, while the factors on the right-hand side are the 
acceleration, the dissipation (or dampening) term, and a restoring force, respectively. Substituting the Fourier transform 
of r(0 into Eq. (31.53) results in 

r(®)= £(® 2 -® 2 -/®) (31.54) 

where the relaxation time t is a constant [311] (an assumption of the Drude model). Such an equation is for only one 
bound electron, but there are many of them doing much the same thing. If the number of electrons per unit volume is 
p 0 , then all the little dipoles acting in concert result in 


(® 2 - ® 2 - if) 


where the plasma frequency ® 2 = 1 6nQp 0 /m = q 2 p o /e 0 m has been introduced. Lastly, Eq. (31.55) suggests that the static 
dielectric constant e(0) is given by 

e(0) = e 0 (l + jr(0» = £ 0 ^1 + ^ = K 0 e 0 (31.56) 

where K 0 e 0 is the static dielectric constant. Going further, and introducing the high frequency limit £„ = K^e 0 , then 

— (31-57) 


EXAMPLE: Find the wavelength associated with the plasma frequency a 
assuming m is the rest mass of the electron. 

SOLUTION: From Eq. (6.24), p = h 2 k 2 /2m and p 0 = k 2 /(3x 2 ), and observing that X p = 2jcc/oo v along with R y = 
2 Q/a 0 = ahc/2a 0 , then 

or 2 p = 115.13 nm, a wavelength associated with ultraviolet light. 


p for copper-like parameters (p = 1 eV), 


= 2175.6 


EXAMPLE: Approximate the static dielectric constant for a semiconductor with p 0 = 5 x 10 22 cm -3 assuming one 
electron per atom and ®„ corresponding to an optical wavelength of 500 nm. Assume m n = m. Contrast to the static 
dielectric constant of silicon K s = 11.9. 

SOLUTION: Numerical evaluation of ® p = \f\6nQpJm = 12.615 rad/fs. For 2= 500 nm, then co 0 = 2jic/ X = 
h.KsTb rad/fs. Consequently, 


£(0) _ / 12.615 y 

£ 0 + V 3.7673/ 


= 12.212 


or about the same as silicon. 


4 As quoted in N. Iyer Raghavan, The Moral and Political Thought of Mahatma Gandhi. New York: Oxford University Press, 1973, p. 274. 
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For metals, there is no restoring force (the electrons roam freely) so that co a = 0, taking the Lorentz model to the Drude 
model. When the dielectric function can be separated into bound (Lorentz) and free (Drude) contributions, the optical 
properties can be related to the band structure and the obtained from reflectance data using Kramers-Kronig relations 
[552, 553, 561], Starting from (n + ik) 2 = 1 + % and then equating the real and imaginary parts of the left- and right-hand 
sides, Drude theory predicts 


®(1 + ® 2 * 2 ) £o 

where i = e r + ie i . With a bit of effort, n and k can be expressed in terms of e r and e ( -, or 

from which n(co ) and k(co) are readily isolated, and R{&) evaluated in turn. 


(31.58) 

(31.59) 


(31.60) 

(31.61) 


31.6.2 Quasi-dassical Boltzmann model 


So distribution should undo excess, 

And each man have enough. 

- William Shakespeare 15 

Now that the harmonic model has presaged the impact of resistance (inverse conductivity <r) on the optical parameters, 
undertake a harder but more insightful reconsideration using a distribution approach, familiar in the transition from 
Eqs (7.16) to (7.18), along with the perturbation methods in the treatment of Eq. (21.61). That is, let the distribution 
function / be the sum of a distribution/, (such as the Fermi-Dirac distribution) that exists in the absence of fields plus 
a modification/, that accounts for the turn-on of the field, or / = / 0 + 2/, (where here X is taken from 0 to 1 as the field 
is turned on, and is not wavelength). For the equilibrium case, Eq. (7.18) becomes (where hk resumes its meaning as a 
momentum rather than A: as a damping term) 

a /o + ^/ 0 = 0 (31.62) 

as usual, but it is the next equation of order X that matters, and is 

a/i + ^/ 1 -|d/ 0 = -i/ 1 (31.63) 

The difference between / and f 0 can be expressed in a Taylor-like expansion 

f-f^<p(x,k,t)(-dj 0 ) (31.64) 


where cp accounts for the variations with x and k. If the force F has spatial and temporal variation, 
arises from light, then 

F(x, k, t ) = q£ 0 expO'jrx - ioot) 

It is to be expected, then, that 


cp{x, k, t ) = cp 0 (k) exp(firx - ioot) 


will if the £ field 
(31.65) 


but if that is the case, and using drf = d k E d F f, then Eq. (7.18) becomes 


[(-iffl + iK ^) ?,(*) - J 4^] (-<U) = -\cp 0 {k){-dj 0 ) 


(31.67) 


15 Ref. [37]: King Lear, IV.i.70-71. 
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Upon making the substitutions d k E = h 2 k/m (the parabolic approximation) and k = h(co/c) (from Eq. (31.41)) then from 
equating the coefficients of (-d F f 0 ) on the left- and right-hand sides. 


(31.68) 


where it has been assumed that the velocity Tik/m is much smaller than the speed of light so that Tik/mc 1. 

A parenthetical comment following Eq. (31.42) suggested conductivity a would resurface, and that time has come. 
The DC conductivity a of Section 3.2.1 is changed by the presence of the time-varying field. With an approximation for 
/i now in hand from Eq. (31.64), then 




(31.69) 


where the zero-temperature limit of the supply function f(k) (Section 11.1) resulting in the 0 function, is convenient to 
take. The integration over k results in 


o-(w) = 


3 n 2 m 


(1 — icor) 1 = o-(O) 


1 + ion 
1 + co 2 t 2 


(31.70) 


where Eqs (3.9) and (6.24) were used to define the DC conductivity er(0), and the r-dependent fraction rewritten to make 
the separation into real and imaginary parts easier. The Drude model gives for metals, and in terms of the conductivity. 


where the equivalence 


is invoked (harkening back to Eq. (3.9)). 

The small cot limit for R(co) gives the Hagen-Rubens relation: from Eqs (31.58) and (31.59) it follows that as cot 
vanishes, then e r (w) becomes small and e ; (®) becomes large. Letting e r -> 0 in Eqs. (31.60) and (31.61), then subsequently 
neglecting terms of order \j2e 0 and e 0 /\/e ; (w) compared to £,■(»), it is shown that 


R{m 



(31.73) 


This suggests that when cot m (® p r) 2 /8, then the reflectivity becomes small: at high enough frequencies (into the UV), 
metals become increasingly transparent. 

The best metal on which to test the prediction of R(co) using Eq. (31.47), with n(co) and k(co) given by Eqs (31.60) and 
(31.61) using Eq. (31.71), is aluminum as it most closely matches conditions behind the free electron theory in the Drude 
model [561, 562], The data can be compared to tabulated values ([309] or [563]) for aluminum if an estimate of t is 
available. 


EXAMPLE: Find an estimate for the relaxation time t required for predicting the reflectivity R(co) for aluminum 
using hco p = 14.98 eV [564] and o-(O) = 3.5 X 10 5 [Ohm-cmT 1 [563], 

SOLUTION: From Eq. (31.72), and knowing that q 2 Ohm cm = 1602.2 eV fs nm. 


s>p(1602.2 eV fs nm) 


Usage of t = 7.6318 fs in Eq. (31.71) results in the Theory 1 line of Figure 31.14, which shows that below hco p the 
reflectivity is high. Since co p for aluminum corresponds to k p = 83 nm, aluminum (and metals in general) are mirror-like to 
wavelengths in the visible: they reflect most of the light incident upon them. The relaxation time, though, was evaluated 
using the DC conductivity <r(0), which begs the question of what if t were evaluated for frequencies closer to co p by some 
other means? That invites the next example. 
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Figure 31.14 Reflectivity of aluminum data [309] compared to Eq. (31.47) via Eqs (31.58) and (31.59) with i evaluated using <r(0) (Theory 
eV/ft) (Theory 2). 


R(14 


EXAMPLE: Using hco p = 14.98 eV as before, but now letting R(w) = 0.8486 for hco = 14 eV [309, 563], find an 
alternate value of r. 

SOLUTION: Using a search algorithm much like that of Section A3.7.1, it is found that a value of t = 1.4967 results 
in the given value of R = 0.8486 for m = 14 eW/h = 22.759 rad/fs. 


Usage of t = 1.4967 fs in Eq. (31.71) results in the Theory 2 line of Figure 31.14. There is improvement in the inter¬ 
mediate values of ®, although 7?(0) is under-predicted, even though for m small (hco < 1 eV), the prediction is a bit off. 
Clearly, although the one-parameter (t) Drude theory behind Eq. (31.71) is surprisingly good, the differences in t as 
calculated using the two different approaches suggests strongly that there is more to the physics involved. It takes a 
quantum mechanical approach to ferret out the right behavior by effectively adding more Lorentzians to e. 

31.6.3 Quantum model 

Sean: If I asked you about war you could refer me to a bevy of fictional and nonfictional material, but you've never been in one... There's 
nothing you can tell me that I can’t read somewhere else. 

- Good Will Hunting 16 

Knowledge of how the additional Lorentzians terms to be added to the Drude equations come about is different than 
knowing how to use those Lorentzians for developing a better model of the reflectivity Rico) and decay parameter 8(co). 
The availability of good and detailed treatments [52, 552, 553, 561] allows the present discussion to focus on how the 
Lorentzians can be used to obtain reflectivity and decay relations, without becoming mired in arcana. 17 Nevertheless, a 
demonstration of why the Lorentzian form is plausible instills confidence when using it. 

An oscillating electric field £(t), assumed to be in the x direction, is represented by a force term of the form F 0 cos cot. 
If the initial state of the electron in an atom is designated \(/> 0 ) then as a result of the perturbation by an electric field the 
perturbed wave function takes the form 

M0> = c 0 \<p g )e- ia ‘‘ + £ CjWtye-W (31.74) 

i+o 

where the energy of the jth level is characterized by hcoj and where the q are assumed to be small except for j = o for 
which c 0 is approximately unity, 18 hence its explicit isolation in Eq. (31.74). By the orthogonality of the basis states. 


16 Matt Damon and Ben Affleck, original screenplay of Good Will Hunting, 1997. 

17 Examples of such subtleties are what are the different behaviors of the free versus bound electrons and what is the value of the electric field 
the harmonic oscillators are oscillating in? Regarding the latter, it is often taken to be that which is at the nucleus of the atom, but the electric 
field in a material varies locally in and between atoms, and the question becomes rather complex. 

ls The initial state is sometimes labeled by m and sometimes i, but avoiding confusion with electron mass m and i = \/-T has virtue. 
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Schrodinger's equation becomes 


ih d t cft ) = {cj>j\F(t)x\c}) 0 )e i{w i ®» )( 


(31.75) 


where the | <pj) are solutions to the Hamiltonian without the F(t)x contribution. It is convenient to introduce the notation 


0 = J cos cot (T^dt 

F f 1 — 1 _ J(oij 0 -O))t \ 


(31.76) 


where the explicit time dependence of F(t) remains in the integrand. Recall that the dielectric constant is associated with 
the polarization of the atoms (Eq. (31.55) and its discussion), and that the little dipoles go as (qx). With the eft) in hand 
so that the wave function is given by Eq. (31.74), and the assumption that all of the eft) save; = o are small, the little 
dipole can be approximated to leading order (i.e., neglecting quadratic terms in Cjft, using the orthogonality of 1 4>j), and 
taking a time average, so as to neglect those terms that oscillate at a higher frequency than the driving force, to give 19 


(y/(t)\qx\ys(t)) « £ \x jo \ 2 (—^ cos(wi) 


\x j0 \ 2 = K4>j\x\4>o)\ 2 


(31.77) 

(31.78) 


The dipole moment is therefore oscillating with the same frequency as the electric field, and so the coefficient of cos cot 
is what contributes to the definition of the dielectric function. Using (co jo - coy 1 + ( oo jo + coy 1 = 2® J0 /(w? - co 2 ) and intro¬ 
ducing an oscillator strength factor 

fj = (ff)HM 2 < 31 - 79 ) 

then the atomic polarizability (recall the discussion surrounding Eq. (31.48)) is proportional to (q 2 - co 2 ), 

which, when all the little contributions of the polarizations are summed up, gives the quantum extension to Eq. (31.55). 
However, there is one complication: no mechanism has been included to account for the transition from one state to 
another, which causes the initial state to decay as exp(-t/r 0 ) and the state | <pj) to which the electron transitions to grow 
as expit/ft). That mechanism may be inserted by replacing [552, 553] 

hoo in -> hoo in + — (31.80) 

1 1 2t- 


Following the addition of how the imaginary part ratchets through the calculation requires care, but in the end Eq. (31.78) 
becomes 


(y/(t)\qx\y/(t)) 



2Wj 0 e mt 'l 

ol-^-Kt/ft) 


(31.81) 


where (co^ft » 1 is assumed so as to neglect factors of t. 2 in the denominator. The term 
as 

- co 2 ) cos cot ( cOj 0 /Tj)co sin cot 

* ^ “ ® 2 ) 2 + (t/'Cj) 2 + (® 2 - ® 2 ) 2 + ( t/Tj) 2 


in parentheses can be rewritten 
(31.82) 


The first term oscillates with the phase of the electric field, and the second out of phase. The out-of-phase part leads to 
absorption and transition, and gives rise to the imaginary part of the dielectric function. To simplify matters, drop the o 
subscript on Wj g -* coj, and then let the oscillator strength factor /J be defined by 


h=~ 


(31.83) 


J Thej = 


is constant, and therefore taken out of consideration for the time-dependent analysis. 
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Lastly, using the relation ( a - ib)/(a 2 + b 2 ) = l/(a + ib) for general a and b results in the quantum version of Eq. (31.55) 
of 


X(fi>) = X 


CO 2 - CO 2 + i(.00/Tj) 


(31.84) 


Such is the synopsis of the Lorentz-Drude model of the dielectric function, but to know how to use it is required. The 
form of the complex dielectric e is separated into intraband (a free-electron-like or Drude part for metals) and interband (a 
bound-electron-like or Lorentz part for insulators) components that look like [564] 


e(oo) = i f (w) + e b (w) 

fotf 


e*(®) = X 


iT 0 ) 

H 


(fi>j - a 2 + icoTf) 


(31.85) 

(31.86) 

(31.87) 


where T) = 1 /r 20 For a number of metals, typically the upper limit of the summation n need only be 4 or 5. The values of 
fj, oo j, and Tj for; ranging from 0 to 5 (where w 0 -* co p for simplicity in the table) is given in Table 31.2. With i(w) in hand, 
the calculation of n and k via Eqs (31.60) and (31.61) is straightforward, from whence R(w) and S(oo) can be obtained 
from Eqs (31.47) and (31.43). For the values of Table 31.2, the results of such a calculation are shown in Figure 31.15. 

How good the model is can be assessed by comparison to copper data 21 to measured values of n(m), k{co), and R(co) as 
shown in Figure 31.16, the data being chosen from refs. [55] and [309]. Clearly, the Lorentz-Drude model with the table 
values performs reasonably well in reconstructing the measured data. Apart from the thrill of knowing how to approx¬ 
imately calculate the optical parameters, the shown variation makes it clear that relying on a single value of reflectivity 
or laser penetration depth in the approximation of quantum efficiency (or emittance, for that matter) as a function of 
wavelength is not desirable: the variation in the optical region of the spectrum is generally too great, a comment that also 
applies to absorption (which goes as 1 - R) for the alkali-anitmony compounds and other semiconductors that constitute 
the favored higher quantum efficiency semiconductors [168, 550], 


Table 31.2 Lorentz-Drude model parameters: the oscillator strength factor fj, the restoring energy hoo:, the plasma 
frequency ® p , and the scattering factor rj for various metals of interest to photocathodes. Observe that fj is dimensionless, 
but ho)j and KTj have dimensions of [eV], Data compiled from Tables 1 and 2 of ref. [564], 


Term Ag Au Cu Al Ni W 


0.845 0.760 

0.048 0.053 

9.01 9.03 

0.065 0.024 

3.886 0.241 

0.816 0.415 

0.124 0.010 

0.452 0.345 

4.481 0.830 

0.011 0.071 

0.065 0.870 

8.185 2.969 

0.840 0.601 

0.916 2.494 

9.083 4.304 

5.646 4.384 

2.419 2.214 

20.29 13.32 


0.575 0.523 

0.030 0.047 

10.83 14.98 

0.061 0.227 

0.378 0.333 

0.291 0.162 

0.104 0.050 

I. 056 0.312 

2.957 1.544 

0.723 0.166 

3.213 1.351 

5.300 1.808 

0.638 0.030 

4.305 3.382 

II. 18 3.473 

0.0 0.0 

0.0 0.0 

0.0 0.0 


0.096 0.206 

0.048 0.064 

15.92 13.22 

0.100 0.054 

4.511 0.530 

0.174 1.004 

0.135 0.166 

1.334 1.281 

0.582 1.917 

0.106 0.706 

2.178 3.332 

1.597 3.580 

0.729 2.590 

6.292 5.836 

6.089 7.498 

0.0 0.0 

0.0 0.0 

0.0 0.0 


20 Whether one uses h = n — ik or h = n + ik will determine whether one has found h or ft*. However, since one will deal with magnitudes, the 
distinction is not worth agonizing over. Both conventions are used in the literature. Beware. 

21 See Rakic etal. [564] for comparisons to a modification to the model that performs better. 
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Photon Energy [eV] Photon Energy [eV] 


Figure 31.15 The reflectivity R{a>) (Eq. (31.47)) and the laser penetration depth 6(a>) (Eq. (31.43)) calculated using the Lorentz-Drude form of e(®) (Eqs 
(31.86) and (31.87)) and using the parameters of Table 31.2 for typical metals. 



Photon Energy [eV] 


Figure 31.16 Values of n, k, and R obtained from refs. [55] and [309] for copper compared to the Lorentz-Drude model of Eqs (31.86) and (31.87) 
applied to Eqs (31.60), (31.61), and (31.47) for copper. Gray vertical lines show common laser wavelengths. 


31.7 Scattering contributions 

From too much love of living, 

From hope and fear set free, 

We thank with brief thanksgiving 
Whatever gods may be 
That no life lives for ever; 

That dead men rise up never; 

That even the weariest river 
Winds somewhere safe to sea. 

- Algernon Charles Swinburne 22 

To account for the "fatal" removal of electrons by scattering, revisit the equation for quantum efficiency from metals 
in Eq. (31.21), rendered more useful by fixing k m and k 0 to be 

k m =j^yJ 2 m[p — (hco — $)] (31.88) 

k 0 = - hoo) (31.89) 


22 Algernon Charles Swinburne, The Garden of Proserpine, in M.L. Rosenthal, Poetry in English: An Anthology. New York: Oxford University Press, 
1987, p. 744. 
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Then the QE for metals can be recast as 

/** k 2 dk f l.. [kxfi (x, p)] dx 

QE = (1 — R )—— . - 

k 2 dk f\ [k\dx 


(31.90) 


The square brackets are intentional: it is convenient to introduce an operator-like notation that denotes the integrals. 
Therefore, define two operators 



k 2 dk / | 

■ ■ ]dx 

(31.91) 

£>„(■■■)= / 

r k 2 dk f [■ ■ 

■]dx 

(31.92) 


where the o subscript denotes that the lower limits of the integrals in 0 are different (essentially corresponding to the 
absence of the impact of <j>) and where <p(k) = cos 0 m (k). As a consequence. 


QE = (1-R(a>)) 


0[kx 2 /fx + p )] 

o 0 m 


(31.93) 


The denominator b g considers the photoexcited electrons going in all directions, whereas the numerator 0 considers only 
those within the emission cone towards the surface. The similarity of this notation to (k z ), for which h(k z ) denotes the 
average forward momentum of the emitted electrons, is worth noting: 


b\kx 2 /(x + p)\ 
OJl] 


(31.94) 


EXAMPLE: Find the relation between QE and k ave = (k z ), and show that its variation is 

small for 2 > 266 nm and 

copper-like parameters. 

SOLUTION: Using Eqs (31.93), (31.94), and (30.26), consider 



o 0 m _ 3 ( J 4- k t\_ 3 

f S 4 (k 0 /k F ) ~\ 


r k F O g [ 1] A k F ykl-k]J 4 < 

{ S,(k 0 /k F ) } 


where p = k 0 /k F = y/l - (hoo/g). In the long wavelength limit (2 -> c 

»), and S„(l) 

= n so that r -> 1. For 2 = 

266 nm and g = 7 eV, p = 0.578. Therefore, the ratio of the two limits is r(266)/r(oo) = 

1.101, or a 10% variation 

over the range of interest. 




31.7.1 Dowell-Schmerge relation 

It is clearly desirable to maximize the QE and minimize the thermal emittance but due to their interdependence this is difficult. 

- David H. Dowell and John F. Schmerge [181] 

For unscattered electrons, a relationship between quantum efficiency and emittance can be developed, so that depar¬ 
tures from it serve as an indication of the contributions of scattered photoexcited electrons. The formulation of Eq. (31.94) 
is useful in relating quantum efficiency to emittance e nra!S (see also Section 33.3 and Eq. (33.40)) by placing them on 
comparable footing to draw out similarities between them [565], given that emittance is often represented as 

— 

where h 2 (k])/2m is the mean transverse energy (MTE) for uniformly emitting rotationally symmetric areas of radius p c . 
Because QE <x (k z ) (more or less) and e n rms <x yj(k 2 L ), they should bear a relationship to each other that will have impact on 
the general desire to maximize quantum efficiency but minimize emittance, something that is thwarted by the tendency 
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of both to increase with ( hco - cp). Showing that can be accomplished without necessarily having an analytic form for 
each by general arguments about how 0 (and 0 0 ) behave. 

The integrals behind b 0 [k n ] for n = (0,1) (or any integer for that matter) means that it can be evaluated to give 

O 0 {k n ) = ^y(^ +1 “ K +l ) (31.96) 

Similarly, the angular integration in 0\kx 2 /(x + p)\ is analytic, and gives rise to a function G{s) for s -*■ cp{k) = cos 0 m {k) = 
k 0 /\Jk 2 + k 2 , so that 

G{s)=±{l-s){l-2p + s)+p 2 \n(^±Z^ (31.97) 

f k kp k 2 G\cp{k)\dk 

QE = 2(1 - R) m ^ ^ - (31.98) 

leaving the k integration in QE to other means. Proceed now in three steps for the determination of the dependence of 
QE on e njms : ( i) develop insofar as practical a reasonable approximation to Eq. (31.98) on the assumption that hco - cp is a 
small quantity, (it) infer the same dependence using more general means, and (in) apply the techniques of the second step 
to find the dependence of e n rms on hco - cp as well. Having a power law relation for both QE and e n rms in terms of {hco - cp) 
means that there is a power law relation between them, the determination of which results in the Dowell-Schmerge 
relation. 

For the first step, observe that when the argument of G{s ) is close to unity (s « 1), then 


G(s)« 


1 +P 


(2p + 1)(1 — s) 2 

2{p + 1)2 


(31.99) 


where the first term was used in much of Section 31.4.1. Writing the first term as G(s) « (1 —s 2 )/[{ 1 +p)( 1 + s)| for 
s -* <p(k ) shows that G(cp(k)) is approximately linear in (k 2 - k^), which vanishes at k=k m (the energy of the photon is 
insufficient to cause photoemission even when the electron is directed to the surface): this allows for the approximation 


GWM * C,\cp{k,.)\ 



to be used in Eq. (31.98), and which results in 


QEx(l-R) 


2 k% - 3k 2 m k$ + k 6 m 
6 {k* P - J4){k 2 F - k 2 ) 


G[<p{k F )\ 


Using the first term of Eq. (31.99) and the relation of Eq. (31.23) gives rise to the approximation 


G[cp(k f )] « 


k 2 P -k m 

(l+p)(2k 2 P -k 2 ) 


Finally, taking k 2 -k^ n = (2m/h 2 )(hoj - cp) to be small, then 


2k 6 F -l>k 2 m k* p + k 6 m 
\ 1 +p) (>4 - 0(2k 2 P - k 2 ) 

/ 1 -R \ p(hco - cp) 2 

\ 1 + p ) 4hw(p + hco){2p — hw) 


(31.100) 


(31.101) 


(31.102) 


(31.103) 


to leading order in {hco-cp), as in Eq. (31.27). The desired relation, therefore, is that QE~(hco — cp) 2 (albeit other 
Tito-dependent factors attend it). 

For the second step, reproduce Eq. (31.103) using less exact but more intuitive means, as the same means shall be 
employed to consider the dependence of £ nrms on the small factor {hco - cp). The integrand of the numerator of Eq. (31.98) 
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vanishes at the lower limit, enabling an approximation that makes use of Eq. (A1.9) to give 23 
f kr , k 3 

/ k 3 G[<p(k)]rfk « ^G[cp(k F )](k F -k n ) (31.104) 

making use of, first, ( k F - k m ) = (k 2 - k 2 )/(k F + k m ) » (k 2 - k 2 ,)/(2k y ) and, second, using Eq. (31.102) shows that for 
k 2 - k 2 = (2 m/h 2 )(hco - <j>) small, then 


QE ex (k 2 - k 2 m f = (^\hco-cj)) 2 


(31.105) 


as the first, and longer, approach showed. 

Finally, for the third and last step, apply the methods of the second step to the consideration of emittance. In terms 
of the MTE, emittance e n rms , should also depend on (hco - r/>), so that a correlation between emittance and quantum 
efficiency can be formulated. There are a few points to pay attention to. First, h 2 k 2 L /2m = (E + ho))sln 2 0 and, second, the 
averaging (■ ■ ■) is calculated with respect to emitted electrons, and so the denominator must also include a factor of /-. 
That is, 


(k i> = 


0[(k 2 + k 2 )(l - x 2 )x/(x + p)] 
0[x/(x + p)\ 


(31.106) 


where 6 (not 0 0 ) appears in the denominator. Now applying the same trapazoidal-like rule on both the numerator and 
the denominator, as done for quantum efficiency previously, shows that 


/fJX (k 2 + k 2 )(l - cp(k P ) 2 ) 
{ x> * 2 k F 


(31.107) 


but (1 - cp(k F ) 2 ) = (k 2 - k^)/(2k 2 - k 2 ) = (hco - cj>)/(hco + p), and so from Eq. (31.95) it follows 


e n,rms « ~ (31.108) 

The combined results of Eqs (31.105) and (31.108) taken together therefore gives rise to the Dowell-Schmerge relation 
[181, 565] expressing the relationship between quantum efficiency and emittance when the fatal approximation holds 

QE cx e 4 n rms (31.109) 

Compare its predictions to measurements of quantum efficiency and e as a function of photon energy by Vecchione etal. 
[566] on oxidized antimony, incorporated in Figure 31.17 and recast in a manner to bring out the behavior as a function 
of (hco — (j>) n for n = 2 for QE and n = 1 /2 for e n ms in Figure 31.18 using fixed values of R = 1/2 and p = 1. Taking ratios for 
constant R eliminate its influence, and the variation of the ratio with p is not great. It is seen that the simple theory gives 
a fair account of the behavior for small (hco - $) in Figure 31.19, and therefore supports the Dowell-Schmerge relation of 
Eq. (31.109), but it is also seen that departures occur, which may be due to departures from free-electron-like behavior, 
the wavelength dependence of both R(co ) and p(a>) = 8{a>)/l(E), the contribution of scattered electrons, temperature effects 
(particularly for small hco - cj)) or even some combination of all of the above. Nevertheless, the presumed constancy of R 
and p gives a reference point from which to judge the influence of the scattered electron contribution when it matters. 

Why the focus on how QE and e vary with (hco — </>)? Insofar as Eq. (31.109) holds, it leads to Dowell and Schmerge's 
rueful observation introducing the present section: the best that can be obtained is to maximize the quantum efficiency 
insofar as the emittance levels can be endured. This motivates returning to the derivations (particularly Eq. (31.93) for 
QE and Eqs (31.95) and (31.106) for e n rms ) and first extending the result so that (hco - cj)) need not be that small, and 
then assessing the impact of scattered electrons (which tend to have lower energy) to the relation. 


23 Knowing that the integrand f(x) vanishes at the lower limit and also behaves approximately like a power law, or f(x) ~ (x - a) a , means one 
can do better: in place of the trapezoidal rule, a better approximation is f*f(x)dx m (a + 1 T l f(b)(a -b) = (a- &)“ +1 /(« + 1): the important part 
here is not the coefficient l/(a + 1) (which does not contribute to ratios) but rather f(b)(b - a). 
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Figure 31.17 The data of Vecchione [566] (data digitally extracted) as a ratio with the value at a photon energy of 4.5 eV (an arbitrary reference value) 
alongside guide curves of the form y(E) = y(.E 0 )[(E - 4>)/(E 0 - </>)]" with n = 2 ( QE ) and n= 1/2 (a) as suggested by theory. 


Figure 31.18 The data of Figure 31.17 
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31.7.2 Scattered electron contribution 


0 Joy! that in our embers 
Is something that doth live, 

That nature yet remembers 
What was so fugitive! 

- William Wordsworth 24 

Including scattering is not easy. For metals, the colliding electrons tend to share what they have on average, so that 
if the primary electron has an energy of p + hoo and the target an energy of p, then both have (p + tioo/2) on average 
after the collision. That can be a rather substantial change. The situation with semiconductors is a bit better, as changes 
in energy are associated with phonons, for which the energy losses are not as large (A E is on the order of 0.1 eV), but 
many more scattering events are allowed before so much energy has been lost that emission is no longer possible, and 
that is a difficult problem often requiring Monte Carlo (Chapter 32). But keeping the discussion just to metals is enough 
to get the point. Let f x (E) be the number of electrons that do not scatter before emission, and f x be the number that do, 
so that all the electrons are accounted for by the equation 

1=/a+7a (31.110) 

A little reflection reveals this analysis may be iterated, as the scattered electrons follow a similar behavior, and so, using 
the notation f n = f x {E n ) 

1 = /0+/0V1+/1 \f 2 +U--(fn +/„)•••)]! (31.111) 


= f 0 +?ofl +/ 0 / 1/2 + ■ ■ • ( Ylfj )fn 


(31.112) 


with the first term corresponding to no scatterings, the second to one scattering, and so on until n scatterings, where n 
corresponds to the number of scatterings required before the electron energy has dropped below the barrier height. 

Two approximations are useful. First, the photoexcited electron (the primary) colliding with an electron at the Fermi 
level results in two electrons with, on average, an energy of p + Tico/2, but also on average only one travels in the direction 
of the surface. Second, observe that 


cos 6 + p cos 9 + p 




(31.113) 


and so f n « p n /( 1 + p n ). The second approximation means that the generalization to 0\ A \ for some argument A becomes 



(31.114) 


where the index j on 0\A\j means that the energy of the electron before scattering, previously taken to be p + hco if it 
is excited from the Fermi level, is now evaluated at Ej. For electron-electron collisions, mathematical induction shows 
that Ej may be approximated by Ej x p + hoc/2 J after the;'th scattering but before the (j + l)th, whereas when the energy 
is removed in small units (as with electron-phonon collisions in semiconductors, where A is a small fraction of an 
electronvolt), then Ej x hoo — E g —jA. For metals, where scattering aggressively removes excess energy beyond p, the 
first scattering term dominates and so 

Q[A] „ OlAlo + P, 0|A|, 

6 [ b ] ~ b[E\ 0 + p y b[B\ 

with an analogous equation for O a [A] in the evaluation of QE and e. Because the approximation affects the 0 and 0 0 
terms differently by affecting the limits of their integrals in Eqs (31.91) and (31.92) differently, deviations from the 
Dowell-Schmerge relation between quantum efficiency and emittance arise when the scattered electron contribution 
is noticeable. An evaluation of the effect, including the dependence of pifij) on the pre-collision electron's photoexcited 
energy and the dependence of the penetration depth 8{co) on frequency (but still neglecting the variation of the reflectivity 
Rfoo)), can show that departure [565], The results for cesiated copper are shown in Figure 31.20. 


24 William Wordsworth, Ode: Intimations oflmortality, in M.L. Rosenthal, Poetry in English: An Anthology. New York: Oxford University Press, 1987, 
p. 521. 
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Figure 31.20 Theoretical quantum efficiency for cesiated copper (4> = 1.6 eV) without (QE 0 and £ 0 ) and with (QE and e) scattered electrons as per 
Eq. (31.115). Based on Figure 1 of ref. [565]. 



Figure 31.21 The points of Figure 31.20 displayed for QE as a function of e, both of which depend on E/E 0 , where E is the photon energy ha>, based on 
Figure 2 of K.L. Jensen, J. Appl. Phys., 113, 056101, 2013. Also shown are fits with a = 0.6029 and b = 2.67. For small x the Dowell-Schmerge relation 
is good (the ratio is proportional to x 4 ); as x increases, the power law shifts so that for large x it is closer to x 4-4 « x 4 / 3 . 

The modifications to the Dowell-Schmerge relation of Eq. (31.109) are intuited by observing that as the photon energy 
ha (denoted E in the figures) increases, the power law relation appears to change that relates QE to e n rms as E transitions 
from below E g to above. In terms of x = E/E 0 , a function which can do this is x 4 /(l + ax b ), where a and b are to be 
determined, although a better form is x*/(a + ax b ), where a = 1 - a, as that function has the virtue of being equal to unity 
at x = 1 and therefore is in keeping with the ratios approach to the figures. The range over which the Dowell-Schmerge 
relation holds, and where it departs, is then visible in Figure 31.21. The question of how the scattered electrons contribute 
to the current and explain the relation between Figures 31.4 and 31.5 then arises. 

Scattered electrons lose energy, thereby delaying their arrival time to the surface as they fall behind the electrons that 
do not scatter (or scatter a fewer number of times). Let the delay of their arrival to the surface be characterized by a 
parameter t, so that a sharp pulse of incident light results in electrons dribbling out with an exponentially decreasing 
tail. If the laser intensity has a shape I x (t), then the emitted current should then follow 25 

J e if) « V / h® e ~*~* hds = ^QE CUM (31.116) 

where the L[- ■ ■] operator is introduced. Pulse profiles of I A (t) for which a simple analytic solution results are flat-top and 
sinusoidal, for which a Fourier representation of I A (t) is 

I x (f) = 1 4 + ^Z ; cos(ffl/ + ^)j 0(t)Q(T — t) (31.117) 

where I 0 , Ij and 4>j (a phase factor) are constants, 0(x) is the Heaviside step function, and coj = 2nj/T. 


’The formulation here follows an approach suggested by J.W. Lewellen (LANL) (unpublished). 
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EXAMPLE: Find L[I x (f)\ assuming only I 0 is non-zero (the flat-top). 

SOLUTION: For t < T, the integral can be rewritten as 


I 0 C[ 1] = -<T ,/r f e' /r ds = /„(] - e~ t/T ) 

T JO 

(31.118) 

For t > T, it is instead 


I a C[l\ = -e~ t/r f e’/'ds = I 0 (e T/r - l)e~ t/z 

T Jo 

(31.119) 


EXAMPLE: Find L[I x (t)] assuming all the coefficients save one I n is non-zero and all <4 = 0 (a sinusoidal profile). 
SOLUTION: Using cosx = (e‘ x + e~ K )/2 with x = oo n t, the integration gives for t < T 


J„£[cos co n t] = 


/„(sin co n t + a> n T(e ,/r - cos oo n t)) 


1 + (co n r) 2 

For t > T, then co n T = 2 nn simplifies the sin and cos terms at the integration limits, giving 

.(1 _ e T /‘) e -‘/* 


I„£[cosco n t] = 


1 + (co n r) 2 


Although flat-top pulses (with some jitter or shape to them) are possible, laser pulses are commonly Gaussian in shape, 
for which the I n are the Fourier transform coefficients (where e mt = cos cot + i sin cot can be used because only the cosine 
coefficients survive): 

ly(Gaussian) oc J e^^^e’^dt = yGrt,exp\-(ojjt 0 /2) 2 \ (31.122) 

The understanding of the effects of delayed emission are revealed simply by taking all but one of the I„ as non-zero, a 
useful stratagem as it also provides insight on the effects of laser jitter (random fluctuations in the laser pulse shape) that 
often accompany pulses and are exaggerated by using frequency doubling crystals to reduce the wavelength of the drive 
laser [281]. Consider, therefore, a flat-top pulse (non-zero I 0 ) with a single ripple (non-zero I n for j = n and zero for all 
j ± n, or Ij = I„Sj n ), for example let n = 8 to provide a reasonably rapid ripple, and so 

4(f) = 4 + 4 cos(16wf/T) (31.123) 

The current J e (t) = ( q/Tico ) QE L[4(f)| then adopts the shape shown in Figure 31.22 for various values of t/T: not only 
does a longer delay time t result in a softening of the laser jitter, it also elongates the pulse by increasing the emission 



t/T 


Figure 31.22 Current density as calculated using Eqs (31.116) and (31.117) with I 0 = 2I n = 100 W/cm 2 , n = 8, and* = t/T. The other parameters are 
X = 266 nm, QE = 0.06, and T = 10 ps. The lines are labeled by the values of t/T. 
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Time [ps] 



Time [ps] 


Figure 31.23 Left, photoemission from NEA bulk gallium arsenide: symbols are measured, line is simulated using a diffusion model with D = 200 cm 2 /s. 
Based on Figure 3 of P. Hartmann, etai, J. Appl. Phys. 86, 2245 (1999) (Ref. [567]). Right, measured intensity as a function of time for different GaAsP 
photocathodes in terms of quantum efficiency using a 520 nm laser pulse of 1 ps root mean square duration: larger QE values are associated with the 
presence of a more significant emission tail. Based on Figure 9 of I.V. Bazarov, etal., Phys. Rev. ST Accel. Beams 11, 040702 (2008) (Ref. [526]). All data 
digitally extracted. 


tail. The presence of e~‘/ T in both the (t < T) and (f > T) regimes means that the lag of the pulse rise is of the same shape 
as the tail of the pulse decline. 

Intuitively, t will depend on how far electrons are excited into the bulk of the material, making it correlate with the 
laser penetration depth. Therefore, an expectation that metals will be fast responders (short laser penetration depths 
coupled with the fatal effects of scattering) compared to semiconductors (deep laser penetration depths and not-so-fatal 
effects of scattering) is in fact borne out: the quantum efficiency of metals is typically below that of semiconductors. The 
contribution of the tails (which contain scattered and, in the case of NEA photocathodes [23, 516, 523, 526], thermal- 
ized electrons) manifests itself in both theoretical/simulation studies and measurements [526, 567], as reproduced in 
Figure 31.23 for NEA and high quantum efficiency semiconductor photocathodes. 

31.7.3 Shell and sphere models 

And tomorrow we might not he together 
I'm no prophet and I don't know nature's ways 

- Carley Simon 26 

The long tails in Figure 31.23 contain electrons that have undergone scattering events and thereby fall behind due to a 
loss of energy and the randomization of their direction through scattering, compared to the heads for which scattering has 
not occurred at all. For NEA photocathodes, even electrons that have lost most of their excess energy are still emitted. 
The nature of such drift-diffusion electrons differs from the single-scattering event model of above. Prophesying how 
they part ways is undertaken in the algorithm of Section A3.38 for comparing the behavior of drift-diffusion (scattered 
or "sphere") electrons to the ballistic (unscattered or "shell") contribution using a simple mode that randomizes the 
direction and reduces the energy of the electrons after every scattering event [568], The scattering events are taken 
to occur for t = m with n an integer, and so a random walk is being modeled, with results as shown in Figure 31.24. 
Although all blue (unscattered) particles have traveled the same distance from the origin, the projection makes it appear 
they are inside the blue sphere when they are not, in contrast to the red (scattered) drift-diffusion electrons for which 
the red circle is the root mean square (rms) radius. 

Because of the random walk nature of the scattered, or sphere, electrons, the rms radius (r^) 2 = (J^r 2 )/N is expected 
to increase as \ft [51], and so it does as shown in Figure 31.25. Consequently, scattering causes electrons to dawdle in 
reaching the surface (bottom of figure) and thereby be emitted with lower energy and at a later time than the unscattered 
electrons. In a semiconductor under "magic window" conditions, the energy loss would not be as extreme, and the 


’Carley Simon, lyrics to Anticipation, 1971. 
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Faint red and blue lines are spheres with the rc 


m squared radius. The surface is at the bottom of the figure. 



Figure 31.25 The rms radii of the sphere (or scattered) electrons in Figure 31.24. The symbols represent r( f)/r{t max ) and the line yjt/t max for = 20t, 
ratios being taken to eliminate common factors. By comparison, the shell electrons would have a radius of v 0 t and therefore show a linear relation. 


expansion would be affected by effective mass variations and band bending, but an analogous evolution occurs, and the 
resulting behavior helps explain the difference between the initial spike and long tails apparent in Figure 31.23. 

When both the shell and the sphere get to the surface, the contribution of each to the emitted current has a different 
behavior. A manner to estimate it is to evaluate the amount of charge Q(t) = Q shett (t ) + Q sphere (t) passing into vacuum for 
each process, and from that find the time derivative 7(0 = dQ/dt. For the shell electrons, this is rather easy, as it amounts 
to finding the portion of a shell past z = 8, or [568] 

(WO =T~ [ d( p [ m sin Ode = a *, (31.124) 

4# J o Jo 2(7 + 1 0 ) 

where cos0,„ = R(0)/R(t), 8 = v 0 t 0 is the depth at which the electrons were photoexcited, Q s is the total charge on the 
shell, 27 and R(t) = v 0 (t + t 0 ), with c 2 = 2(p + hw)/m as the initial charge and t 0 = 8/v 0 the amount of time taken for the 
shell to first make contact with the surface. 

Calculating the contribution from the sphere is a bit more challenging. A diffusively expanding sphere has much in 
common with the point source models and treatment of the diffusion equation in Section 29.2.1. Compared to the 
one-dimensional solution to Eq. (29.10), three dimensions requires the product of three such factors, and so the distri¬ 
bution of charge expands like 

^ exp (-w,) (31 - 125 > 


27 As luck would have it, "shell" and "sphere" both begin with "s”, so let the sphere charge Q d be designated with a d subscript to indicate 
"diffusion", leaving the s subscript to indicate "shell". 
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and the integral ove 
analogous equation 


all space gives / p{t)dV = Q d , as it must. If the velocity of the diffusing electrons is 
) Eq. (31.124) becomes 


/“ e^' m dz _ i 


= ^d 


IZ e ~ z2/,lDtdz 2 


-m 


v d , then the 


(31.126) 


A few convenient tricks are hidden in obtaining Eq. (31.126): (z) r 2 = x 2 +y 2 + z 2 , so that the integrations over x and 
y coordinates in both numerator and denominator factor out, (zz) by forming a ratio of integrals for the purposes of 
calculation, all the other factors in the coefficient of p(t) likewise factor out as well, (iii) the origin of the integration has 
been quietly relocated to where the photoexcited electrons are created, and (zv) t d = 8/\[AD is introduced and Erf(x) is 
the error function of Section A2.4. Taking the derivative to find the current results in 


(31.127) 


whereas the maximum of the shell electrons occurs right away at (f = 0), the maximum of the diffusive electrons occurs 
at a later time where d t [t~ 3 / 2 e~ td ^] = 0, or at t = 2t d /3. The sphere contribution to current therefore trails the shell contri¬ 
bution. Such is the quintessential element of one diffusion site: a string of such sites then represents the contribution of a 
distribution of photoexcited electrons. As pedagogically useful as a shell/sphere distinction is, it represents a difficult way 
to keep track of the total contribution to emitted current. A Monte Carlo approach is simpler, and illuminates nature's 
ways by mimicking unfathomable random processes. 


31.7.4 A scattering coefficient to quantum efficiency 

the black horse of the future, 
comes to earth but has no tongue. 

- Joan Baez 28 

The Monte Carlo methods behind Figures 31.24 and 31.25 are rather minimalist: after scattering, only the direction is 
chosen randomly. Predicting the future is a great deal more difficult than that. Monte Carlo can and does account for much 
more, such as the energy-dependence of the scattering rates, the random time between collisions, different scattering 
processes weighted according to material properties, a varying number of electrons that are created as a function of depth, 
the consequences of final state occupation probabilities on the directions the scattered electrons can take, the possibility 
that the photoexcited electrons impart enough energy to the scattered electron from the Fermi level that one or more 
of them may still contribute to the photoemission current, and quite a few more complications associated with electron 
transport [108], Not all photoexcited electrons and the electrons they excite (the "secondaries") need to be followed, 
though: once an electron loses enough energy through scattering that it can no longer, if favorably directed, surmount 
the surface barrier then there is no need to monitor its future evolution (unless needed for band bending and space-charge 
effects) and it can be dropped from a simulation. Evaluating the ratio R of only the unscattered electrons with all the 
electrons that surmount the surface barrier thereby gives a measure of how QE can be modified to include scattering. That 
is, use Monte Carlo to estimate QE = R(ha)QE 0 , where QE 0 is the quantum efficiency due to only unscattered electrons 
(the fatal approximation), as in Eq. (31.19). The Monte Carlo methods to do so will treat transport and scattering, but a 
description of the methodology is deferred to Section 32.1. 

Two configurations will be contrasted: the first considers a work function <t> that is comparable to the photon energy 
ha, characteristic of bare metals under UV illumination; the second considers a work function brought low through the 
addition of a cesium surface layer (Section 31.8) so that the photoexcited electron can undergo one to several collisions 
before its chances of escape vanish. Photoexcited electrons and electrons they excite through scattering are followed until 
such time that the energetic electrons either loose too much energy to be emitted or pass beyond the surface barrier. 
Electrons striking the surface barrier with a normal energy that is below the barrier height are reflected even if the total 
energy exceeds the barrier height: in the language of the escape cone in Eq. (31.16), electrons with total energy E > 


5 Joan Baez, lyrics to Three Horses, 1971. 
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H + tj) are reflected if Ecos 2 9 < ft + <p, and such electrons are followed so long as E > /a + ( P- When all of the photoexcited 
electrons are either lost through an erosion of their kinetic energy through scattering or emitted into vacuum, then a 
ratio of the number of electrons emitted with the number of unscattered electrons emitted is made and defines the factor 
R(hco). Letting N„ refer to the number of electrons emitted after undergoing n scattering events, with N 0 referring to 
unscattered electrons, then 

R =W 0 'L N j =1 + 'Z R j (31.128) 

where Rj is the ratio of the number of electrons that scatter j times prior to emission compared to the number emitted 
that do not scatter at all. A few expectations immediately arise. For example, when the photon energy is small, then 
only X[ should be non-zero, and it likely scales linearly with ( hco — </>). When the photon energy increases, then at some 
point the secondary electrons may be able to share enough of their energy so that what they scatter with can also be 
emitted. Therefore, while R may be linear in ( hco - <p) for small values, as ( tico - </>) increases higher powers should become 
manifest. 

A Monte Carlo simulation counts the number N of electrons that have been emitted and keeps track of the number of 
times they scatter prior to emission [568]. Contrast two cases, the first (Cu) where the photon energy hco = 4.661 eV (A = 
266 nm) is close to the work function ® = 4.5 eV, and the second (Cs-Cu) where the difference is greater because 0=1.6 
eV. A field of 10 MV/m at the surface causes a further reduction of s/4QF =0.12 eV. For Cu, of some 10 6 photoexcited 
electrons launched, only 1035 are emitted, of which 953 are unscattered and 82 have scattered prior to emission. Compare 
this to Cs-Cu, for which 39670 electrons are emitted, with 17811 unscattered and 21859 scattered. Therefore, while the 
fatal approximation is good for simple Cu, for Cs-Cu about half the emitted electrons had undergone some scattering. 
The contribution of scattered electrons is summarized in Figure 31.26, and based on the study of ref. [568]. 

A dimensionless term which captures the presumably linear behavior of N with small (hco — <p) yet which should 
also capture the large photon energy behavior of N increasing rapidly is x = (hco - <p)/hco. When the ratio of total to 
unscattered electrons (that is R = N/N 0 ) is shown as a function of x for the cases of Cu and Cs-Cu, then the relations are 
as in Figure 31.26. In fact, evaluations over a range of wavelengths (with fixed work function) or work function (with 
fixed wavelength) demonstrate that a parametric fitting defined by 

R(x) « C 0 + C y x + C v x v (31.129) 

performs well, as in Figure 31.27. Although the values of the C n and v are determined by simulation, they surprisingly 
show a weak (but not negligible) indifference to the bulk material. Once found, f?(x) may be appended to QE 0 of the fatal 
approximation to account for the scattered electron contribution via QE = R QE 0 , as demonstrated after the nature of the 
surface coating is explored (Section 31.8). 



Figure 31.26 The percentage of emitted electrons as a function of the number of scatterings they endured before emission for the cases of copper (Cu: 
<S = 4.5 eV) and cesiated copper (Cs-Cu: ® = 1.6 eV). The photon energy was ftco = 4.661 eV (2 = 266 nm) and a field of 10 eV/pm caused a Schottky 
barrier lowering factor of y/4QF = 0.12 eV. The red and blue lines to guide the eye are from the y(x) equations alongside them. 
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(hf -()>)/ hf 


Figure 31.27 Comparison of Eq. (31.129) to Monte Carlo simulations (based on Figures 17 and 21 of ref. [568]) numerically evaluating R(x) with 
x = (tico -</>)/ha> for copper and tungsten surfaces with partial coatings of cesium, which reduces the effective work function. Symbols labeled by 
wavelengths (in [nm]) vary <6 for the fixed wavelength shown; those labeled by energies (in [eV]) vary photon energy for the fixed work function 


31.8 Low work function coatings 

When a refractory metal is partially covered (6 < 1) by a metallic film, the surface is heterogeneous, the heterogeneity being only 
quasi-static. The effective work function, which from the experimental standpoint represents the entire surface, is an appropriate average 
over the work function of sites covered by the film and others that are uncovered. Due to cooperative phenomena between sites of different 
types, each site cannot be considered independently. Thus, the task of deriving the appropriate averaging procedure is beyond detailed 
analysis. 

Elias P. Gyftopoulos and Jules D. Levine [394] 

In the crude capacitor model of Section 21.5, a submonolayer coverage of cesium atoms gave rise to a nanoscale 
capacitor that caused a work function reduction. The magnitude of that reduction, in Eq. (21.56), was related to the 
degree to which charge separation occurred between the cesium atoms of the surface layer (which are loose with their 
outermost electrons) and the tungsten atoms of the bulk (which attract those electrons) as measured by their respec¬ 
tive electronegativities (Figure 21.9). Smoluchowski [85, 364] used such concepts to give an account of work function 
variation amongst the different crystal faces exposed on a metal surface, illustrated in Figure 31.29. Tabulations of work 
functions, as in Figure 31.28, often obscure the variation with crystal face: the arrangement of the last layer of atoms 
and the distribution of charge on them makes a difference, as shown in Figure 31.30, where "differences" amount to 
fractions of an electron volt. 

The capacitor model is simple and trivially calculated, but phenomenological. In utter contrast, density functional the¬ 
ory exploits a theorem by Hohenberg and Kohn that a system of interacting electrons in the ground state can be described 
by a potential determined from the electron charge density [104]. The Kohn-Sham equations can then give one-particle 
approximations that determine the properties of the system at a fundamental level; its extension to non-uniform elec¬ 
tron gases makes use of a local density approximation, which Lang and Kohn used to great effect in describing the image 



Figure 31.28 Typical work Junction associated with the elements. For crystals, typically the (100) or (110) face is shown (see Figure 31.30). The ir 
shows the ranges associated with the color scheme. Compiled from ref. [309] and other sources. 
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Figure 31.29 Relation of charge redistribution in the atoms on a surface that gives rise to a contribution to the work function. The simple capacitor 
model is in the inset. The triangular figure to the upper right is a representation suggested by Smoluchowski [364], 



charge potential [85, 340, 344, 348] (Chapter 27). Properties such as effective masses, optical constants, and material 
parameters are additional benefits [569], but such first principle calculations are difficult and time-consuming. When 
the need is for something more accurate than the simple models, but less demanding than density functional theory 
(DFT) or molecular dynamics simulations (e.g., the needs of beam optics codes for modeling electron beams launched 
into photoinjectors and particle accelerators), something of a more intermediate nature is preferable for predicting 0(0) 
from metal and semiconductor surfaces as a function of coverage 0 by low work function coatings [84, 308]. Of a number 
of models that may satisfy such purposes [135], a pedagogically useful one is due to Gyftopoulos and Levine [394, 409], 
which improves upon an earlier formulation due to Topping [570], 

Consider, therefore, a group of measurements of various coatings on the surface of various metals versus coverage, as 
in Figure 31.31. Schmidt and Gomer [571] observe 

The 4> vs h curves of... refractory substrates are generally quite similar: the work function decreases linearly at first, then more slowly, 
eventually passes through a minimum, and finally approaches the work function of the adsorbate at concentrations corresponding to two 
to three atomic layers. 

Schmidt and Gomer actually observe a great deal more - they consider the Fowler-Nordheim equation and how it is 
used to determine work function values as in the experimental field emission investigations of Swanson and Strayer 
[572] for cesium on various metals (the images of the crystal faces as the coverage changes are particularly edifying), 
that is, the relations shown in Figure 31.31 all dip through a minimum and asymptotically approach the bulk work 
function of the coating material. It is that behavior that Gyftopoulos-Levine (GL) theory captures by identifying various 
relevant parameters and their effects, although a few of those parameters are somewhat squishy, and not all questions of 
interpretation are resolved with the same clairvoyance or rigor as computational materials physics methods. A modified 
version (MGL) of that theory, based largely on refs. [85 135, 206, 394] and using more contemporary parameters, was 
motivated by the performance of dispenser photocathodes [220, 281, 383-386, 530, 573] and used to understand the 
relationship between quantum efficiency and coverage [206, 381, 382, 532, 534], 
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0 2 4 6 

Coverage [atoms / nm 2 ] 


Figure 31.31 Work function of coated tungsten for different coatings as a function of coverage. Gray fines denote bulk values for (bottom to top) Cs, Ba, 
K, and Sr (based on Figure 22 of L. Schmidt, R. Gomer, Journal of Chemical Physics 42, 3573 (1965) with the permission of American Institute of Physics). 


That relation can be understood as follows. A coating atom such as cesium lets go of its electron to atoms such as 
tungsten, but not completely. How much the electron is "let go" is governed by the electronegativity differences between 
the atoms (Section 21.5), but also on how close neighboring cesium atoms are: neighbors that are too close get in the 
way. Gyftopoulos and Levine therefore suggested that the work function should be formed of two parts, the first part 
W(9) governed by the electronegativity differences and the second part d{6) by a dipole effect. That is, for a coverage 6 
representing the fraction of a monolayer, 

0(0) = W(0) + d(0) (31.130) 

Mulligan proposed that electronegativity is the mean of the ionization potential and the electron affinity of an atom [102]. 
The GL hypothesis therefore conforms to intuition in that in the absence of a coating, the work function and electroneg¬ 
ativity behave similarly as one moves through the Periodic Table, as in Figure 31.32. A difficulty is that electronegativity 
is in Pauling units [PU] and work function is in [eV]. In Pauling units, the electronegativity of fluorine (100.45 kJ/mole 
= 10.411 eV) is 3.98 PU. A rough formula relating O to W constructed by Gordy and Thomas [574] was suggested as 

<D[eV] = 2.27MPU] + 0.341 eV (31.131) 

the performance of which is shown in Figure 31.33. The intercept factor of 0.341 eV is argued by Gyftopoulos and Levine 
to be the energy required to overcome image charge forces, and asserted to be the same for all metals. As a result, if a 
relationship between electronegativity and coverage, or W{9), can be found, then ®(0) can be found. 

A polynomial relationship would be useful, but the determination of the coefficients requires boundary conditions 
and knowledge of the behavior. Two boundary conditions are readily apparent: when there are no coverage atoms then 
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Figure 31.32 The electronegativity W in [PU] and the work function <6 in units of [eV] for various elements as a function of; 


tic number. 
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Figure 31.33 The points of Figure 31.32 compared to the equation of Gordy and Thomas in Eq. (31.131). The color squares correspond to copper (Cu), 
tungsten (W), and cesium (Cs). 


the work function ®(0) should be that of the bulk material or 0(0) = ® u „ and conversely, when there is a monolayer 
coverage, the monolayer work function should be obtained or ®(1) = ® c . Moreover, the addition of a few coverage atoms 
in the former case, or the removal of a few in the latter case, should not make a difference, which is another way of 
saying the first derivatives W'( 0) and W'( 1) vanish. Therefore, four equations (two for W(0) and two for W'(0) at 6 = 0 
and 1) can be used to determine the coefficients of the polynomial. To summarize, they are 29 


W( 0) = <& w \W'(Q) = 0 
W(l) = O^W'Q) = 0 


(31.132) 


With four equations, the simplest polynomial is a cubic for which W(0) = £ 3 =0 ■ The four equations leads to a matrix 

relation of the form 


1 0 0 0 C 0 <5 W 

1 1 1 1 q ® c 

0 1 0 0 C 2 0 

0 1 2 3 C 3 0 


(31.133) 


with a solution of 


C 0 

C 1 0 

C 2 3® f - 3®^ 

C 3 2®^ - 2® c 


(31.134) 


A bit of algebra shows that this means 


W(0) = ® c + (®„; — ® c )iT(0) (31.135) 

H(0) = (1 + 20)(1 -0) 2 (31.136) 


with H( 0) = 1 and H( 1) = H'( 1) = H\ 0) = 0 as required. 

Evaluating the dipole term d{0) remains, and it can be obtained from the electronegativity barrier. 30 A dipole represents 
a charge separation, and so the dipole term between two atoms should reflect how strongly each atom tugs on the 


29 The / and m subscripts of ref. [394] are c and w here, respectively, to suggest "cesium" and "tungsten"; if generalized, they could mean 
"coating" and "tuhat is underneath" instead. 

30 In Section 21.5, the electronegativity was designated B p , but the necessity to discriminate between subscript terms suggests replacing it with 
something else. Ref. [394] uses x(0), but here X(8) will be used. 
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Figure 31.34 Configuration of the coating (green) and substrate (gray) atoms to show the hypotenuse R = r c + r w and the angle 0. Left, top view; right, 
perspective view with the locations of the coating (green) and substrate (gray) shown. The orange arrow indicates the direction of M cw . 


outermost electrons, that is, it should be proportional to the difference in the electronegativity X c and X w of those atoms. 
Gyftopoulos and Levine argued that a similar conclusion holds when one atom rests upon several atoms. Representing 
those atoms as hard spheres with a radius r specified by their covalent radius eases the calculation a bit: considering Figure 
29.18 suggests that the vertical component of each dipole interaction contributes to the total (the horizontal components 
cancel) and so for cesium on tungsten, with the distance between adjacent cesium and tungsten atoms being designated 
R= r c + r w and M cw being the dipole of that pair, the sum of the four dipole components is 

M 0 = 4M CW cos ft (31.137) 


where (3 is the angle R makes with the normal to the surface, as in Figure 31.34, and where 

M cw = YX7^ W ~~ (31.138) 

where 2.27 is from Eq. (31.131) and k/e 0 = 43.256 nm 2 is deduced from electronegativities and molecular dipole 
moments. 31 Let the length (k/ 2.27e 0 ) 1/2 = (7.0485 nQk/q 2 ) l/2 =r 0 = 0.43653 nm, and so 


M{0) = cos P(Q> W — ®, )1T(0) (31.139) 

4nQ 

where the dependence of M{6) on coverage is reasonably expected to go as H{0) (no coverage should result in no dipole, 
full coverage in the full monolayer dipole, and the addition of a few, or removal of a few, atoms at each extreme does 
not change matters). The triangle of Figure 29.18 has a base of 2r w and a side of r w + r c , but that is just a special case 
when the bulk atoms are touching. If they are pushed apart a small bit, then r w would have to take that into account. 
Therefore, let w/{2r w ) 2 be the number of atoms per unit area. A generalization to /? is then 


sin /? = — ( -r- 


(31.140) 


which finishes the consideration of a dipole by itself. 

Now consider the effect of dipoles on each other: they will give rise to fields that act to "depolarize" their neighbors. 
The "effective" dipole moment M e {6) is therefore the difference between the dipole moment M{6) and the depolarizing 
field £{6), but the depolarizing field is proportional to M e (6) by 


[A 


[A 


(31.141) 


where the new dimensionless factor / is the number of coating atoms per unit cell at monolayer coverage 32 and the 
coverage 6 appears when there is not a monolayer on the surface. The effective dipole M e (0) is therefore the difference 


31 Gyftopoulos and Levine specify k = 3.83 x 10 30 Coulomb-m/V, which is the same as k/e 0 = 43.256 nm 2 here, but without the befuddling 
exponents. 

32 Calling / by the letter c would have been confusing with the velocity of light. Practicality always requires concessions. 
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between the dipole M(0) with the product of the polarizability factor 4 ne 0 nr] (more about which in a moment) and the 
depolarizing field £{6), or 

M e (0) = M(9) - 4jte 0 nr^£(6) = M(0) - ^-r c 3 £(0) (31.142) 

from which M{6) can be found. Several things were slid into that equation: first, there is the covalent radius of the coating 
atoms r c and, second, there is a new factor n. An atom such as cesium in the alkali metals column of the Periodic Table 
has only one outer electron so n = 1. The alkali earth metals in column two, however, have two atoms, but because they 
interfere with each other (or "shield" each other) from the atom's nucleus, n is not quite 2 but closer to 1.65 for them. 
The product of the effective dipole moment M e {6), the surface density of coating atoms f9/(2r c ) 2 , and a factor of l/e 0 
then specify d{6), or 

dm = -M e m /°j = -M t m( f 4) (31.143) 

4 e 0 r 2 c q 2 \ r 2 ) 


The final piece is the evaluation of cos p based on the geometrical relationships of Figure 31.34. If the 
square of the pyramid is l 2 = w(2r w ) 2 , then with sin 2 /? = l 2 /2R 2 , it follows 


cos p = 


-ml 


1/2 


Now all the pieces are in place. Putting them together, and defining 


of the base 


(31.144) 




(31.145) 


while remembering that r c is the covalent radius of the coating atom, r w of the substrate atom, R=r w + r c , r 0 = 0.43653 
nm is from the discussion following Eq. (31.138),/ and w are dimensionless number of atoms per unit cell factors, and n 
takes on the value of 1 for coatings from the first column of the Periodic Table (like cesium), but 1.65 for coatings from 
the second columm (like barium), then the work function O as a function of coverage 6 (with 0 = 1 being a monolayer) is 


0(61) = ® c — (O c - OJH(9) [ 1 - G(0)] (31.146) 

A moment's thought reveals that just as differing crystal faces alter the value of the work function even though the 
surfaces are composed of atoms of the same material, they will also mess with the value of / and w because different 
faces position the surface atoms at different relative locations, and the coating atoms are affected by the placement of 
the atoms under them: as a result, the values / and w take on are not independent. Distinguish between three kinds of 
crystal faces: the [100] face, the [110] face, and let all others be designated as [S] for "bumpy". Introduce a factor N 0 that 
depends on crystal face, defined such that 


iV,/100] = 1; [110] = 2; N 0 [B] = 3 (31.147) 

The ratios f/i/N~ 0 and w/^JW 0 show approximate consistency from one crystal face to another. The relationship between 
/ and w is then dictated by 


/ 


= 4(Cs on W, Mo, Ta, ...) 
= 2(Ba on Sr, Th, W, ...) 


(31.148) 


For generic parameters, the variation of 0(0) is shown in Figure 31.35. 

Putting Eq. (31.146) into the service of comparing theoretical 0(0) values to experimental measurements for metal 
surfaces with a submonolayer coverage of coatings encounters difficulties in wresting/ from a hard sphere model (on typ¬ 
ical and especially sintered surfaces, multiple crystal faces are present), mediating between 6 and what is experimentally 
measured, and even the possibility of contaminants that change the work function (e.g., oxygen on Cs-W or Ba-W tends 
to lower the work function [575, 577]). For example, consider the comparison of various experiments in the literature 
for Cs and Ba on tungsten shown in Figure 31.36 where the coverage is usually (but not always) given in fractions of a 
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Coverage 9 

Figure 31.35 Work function 4> as a function of coverage 0 for Cs-on-W-like parameters (the default parameters of the Gyftopoulos-Levine algorithm of 
Section A3.29: r w = 0.130 nm, r c = 0.253 nm, n = 1.00, R fw = 4.0, 4> c = 1.6 eV, = 4.55 eV. 


Figure 31.36 Work function 4> 
of ref. [418]. For Ba: Longo is from Figure 2 of ref. [576], Haas is fr 
extracted from sources. 



lor is from Figure 14 
n Figure 1 of ref. [577], Schmidt is from Figure 6 of ref. [578]. Data digitally 


monolayer or expressed in terms of deposition time that has to be converted to an equivalent coverage factor. Although 
the data on the Cs-W side tends to be in reasonable agreement with each other, the data for Ba-W on the right shows 
variation between the data of Haas et al. and that of Longo et al. or Schmidt (which tend to be in agreement). Clearly, 
then, the possibility exists that reported data give coverage up to an overall scale factor, which represents a parameter 
that must be constrained or determined. 

If both/ and the scale factor (call it scale) that connects 0 to the empirical coverage are treated as adjustable parameters, 
then a constraint must be imposed to find either / or scale from the data. Consider then the optimistic assumption that 
the experimentally reported data is accurate so that scale = 1. How might / then be determined? One possible approach 
is to minimize the rms error. That is, let the empirical data be characterized by N pairs of (6j, Oj) for; € {1,2, ■ ■ -N}. Then 
a variation over / such that A(/) defined by the rms error term 

A(/) 2 = jf Z [ °; - O(0 7 )]2 (31 ■ 149) 

is minimized is sufficient to identify a unique/, where ®((/) is determined using Eq. (31.146). 33 The results of just such 
a calculation are shown in Figure 31.37, for which the agreement is reasonable, but moreover, the data for Haas alone 


33 0ne could have just as easily gone the other way by specifying a fixed/ and using Eq. (31.149) to identify scale, but that is another story. 
Which one is held fixed depends on the confidence held in the models from which it is constructed. 
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Figure 31.37 The data of Figure 31.36 compared to Eq. (31.146) with/ determined such that 0 = scale x coverage. For Wang, Taylor, Longo and Schmidt, 
/is determined such that scale = 1 (Cs:/ = 1.5336; Ba: / = 0.800). For Haas, the value/ = 0.800 and scale is found by minimizing the least squares 
difference of Eq. (31.149) between theoretical and experimental values of ®(0) (scale = 0.56). 


is brought into rather good agreement with the other data through the determination of a scale factor while otherwise 
using the / value that was determined for the Longo and Schmidt data lines. Even better agreement can be achieved by 
adjusting the value of scale away from unity, but at that point the effort is more curve fitting than illuminating. What is 
clear is that GL theory does a respectable job in capturing the variation witnessed for a submonolayer coating atop a bulk 
metal surface with few adjustable (and constrained) parameters. 


31.9 Quantum efficiency of a cesiated surface 

Everything in this world is a matter of calculation. Advance then with caution, the balance in your hand. Put into one scale the pleasures 
which any object may offer; but put fairly into the other the pains which are to follow, and see which preponderates. 

- Thomas Jefferson 34 

The GL model for ®(0) can be combined with the moments model for QE to predict the quantum efficiency QE(co) of a 
metal surface with a varying submonolayer coverage of cesium. Those predictions may then be compared to experimental 
measurements for cesium deposition onto sintered tungsten surfaces [84, 206, 381, 382, 579 580] much like those of 
the dispenser cathodes shown in Figure 29.1. In such measurements, cesium deposition occurs simultaneously with QE 
measurements at various laser wavelengths as the coverage is increased. 

The evaluation of QE for different wavelengths as a function of coverage requires an integration over energy and angle 
in Eq. (31.21), and generally numerical integration is required. In performing the computation, several dependent factors 
must be kept in mind or evaluated. They are: 

• the escape probability (for cesiated metals, approximating the transmission probability by a step function is adequate) 

• the optical parameters n and k from which to find the reflectivity R{co) in Eq. (31.47) and laser penetration depth <5(w) 
in Eq. (31.43) (for tungsten, a spline fit to measured data [309] is useful) 

• the material constants, bulk and monolayer work functions used in the GL model in Eqs (31.145) and (31.146) [84, 
206, 568) 

• an approximation to the scattered electron contribution (e.g., Eq. (31.129)) 

• the relaxation time dependence on photoexcited electron energy (Section 32.3). 

A number of these parameters are temperature dependent: if laser intensities are sufficiently low, then a constant 
temperature approximation is sufficient, but high laser intensities over prolonged times or fast repetition rates cause 
temperature excursions. If the laser intensity is very high (even if short in duration), a decoupling between the electron 
and the lattice temperature can occur [83, 85, 206, 581, 582]. For the low laser intensities used in the experiments to 
which the photoemission model will be measured against [381, 382], the temperature can be treated as constant. 


34 Quoted in Jonathan Haidt, The Righteous Mind: Why Good People Are Divided By Politics and Religion. New York: Pantheon Books, 2012, p. 35. 
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Figure 31.38 Numerical evaluation of QE(w) without the scattering factor R(x) (Fatal) and with (Scatt), compared to measured UMD data for cesium 
evaporated onto a polycrystalline tungsten surface (based on Figure 22 of ref. [568]experimental data courtesy of E. Montgomery (UMD)). All theory 
lines are calibrated to match the relative peak location and height of the 2 = 532 nm line. 



The behavior of the relaxation time that is part of the mean free path between collisions 1(E) = hk(E)r(E)/m and is a 
part of the scattering term p(E) of Eq. (31.18) is more difficult. The treatment of relaxation times will be undertaken in 
Section 32.3; for now, in metals, the relaxation time r(E) drops rapidly as the energy of the electron increases over the 
Fermi energy, and is primarily due to electron-electron collisions. 

The predictions from such a model can be compared directly with experiments, for example those undertaken at the 
University of Maryland in which QE was monitored as cesium was deposited onto a sintered tungsten surface, where 
rapid and as close to simultaneous measurements from the same spot on the cathode surface necessitate innovative 
approaches [579, 580]: cesium is mobile and can evaporate quickly, necessitating that QE measurements at different 
wavelengths be rapid. To make such measurements, a photocathode with a heater coil for measurements at different 
temperatures (or even gradually increasing temperatures) was used. Direct cesium deposition was performed by using 
an external source and measuring the coverage by monitoring the mass deposited (e.g., Cs, Sb, K) on the surface using a 
quartz crystal balance. The configuration enabled the surface to be heated or argon ion cleaned to remove contaminants. 
The rapid sequential measurement of QE was enabled by using five low-power diode lasers fixed to a rapidly moving 
stage, so that the QE measurements for various wavelengths were taken from the same spot in close to the same time. A 
field of 5-30 kV/m was sufficient to prevent space-charge effects. Ion cleaning between runs insured an atomically clean 
surface, although it roughened the surface as well. 

A comparison to the University of Maryland (UMD) data to both the fatal approximation (excluding R(x)) or including 
scattering (using R(x), as in Eq. (31.129)), is shown in Figure 31.38 after the theory is normalized to the X = 532 nm ref¬ 
erence line [568], Because of the polycrystalline nature of the sintered tungsten surface, the possibility of contaminants, 
and other experimental considerations, photoemission cannot be presumed to occur uniformly across the surface, and 
the normalization reflects that uncertainty. Regardless, once one of the lines is normalized, the theory curves for the fatal 
approximation account reasonably well for the observed variation in quantum efficiency as a function of wavelength, 
but when the scattering contribution signified by R(x) is included, the shapes of each line in addition to the relative 
differences between them is much improved. As a result, the GL model for work function <t>(0) applied to the moments 
model for quantum efficiency matches the photoemission measurements very well, enough to supply emission models 
needed by the beam optics codes [583] described in Chapter 33. Hidden in the calculation, however, is the evaluation of 
the relaxation time r(E), the treatment of which must wait until Section 32.3. 







CHAPTER 32 

Secondary emission cathodes 


It appears here ... that calculations based on the free electron model of Sommerfeld can lead to a considerably improved understanding 
of secondary electron emission from metals, so that this phenomenon should have its place along with the many others which have been 
illuminated by this simple picture. 

- Eugene M. Baroody [35] 

Those who utilize high brightness photoinjectors often rue that quantum efficiency and photocathode ruggedness do 
not correlate (Table 31.1), or that what eases demands on the photocathode often increases demands on the drive laser 
(and vice versa). Recognizing that the measured high secondary yields from diamond in reflection mode (over lOOx) 
observed by Yater and Shih [207, 584] (as shown in Figure 32.1), Ben-Zvi etal. [143] envisioned using a thin diamond 
layer as a secondary electron source to provide copious electrons while protecting the photocathode and easing demands 
on the drive laser. Studies to investigate the feasibility have been increasingly reported [23, 144, 209, 322, 324, 585-588], 
Here the physics is explored. 

Consider the secondary yield from diamond in both reflection and transmission mode, the differences of which are 
suggested in Figure 32.2. Whether secondaries are measured on the same side or on the opposite side from which the 
primary beam is incident affects what is emitted. In transmission mode, the electrons have further to migrate through a 
diamond sample, and so higher primary beam energies are often required. When the secondaries are collected and shown 
as a function of energy, the behavior shown in Figure 32.3 is characteristic. Though perhaps not readily apparent, the 
lines for transmission mode would for the most part lie atop one another if each were normalized by its maximum value, 
with only the 10 keV and 8.5 keV lines slightly shifted. The greatest difference between the reflection and transmission 
lines is the presence of electrons in the higher energy tail for the reflection case, but an absence of such a tail in the 
transmission case. If one is coming fresh from Section 31.7 this may be no surprise: the secondaries are borne close to 
the surface in the reflection case, and a fraction of the secondaries can make it to the surface and be emitted without 
having thermalized; in the case of the transmission measurements, the diamond layer - even at 8.3 pm - is too thick for 
electrons to pass through without having their excess energy whittled away due to scattering. That scattering has such 
an effect first motivates a means to treat it, and second has implications for the kinds of beams and the response time 
associated with a cathode made by using a diamond thin film as a secondary emission source. 

32.1 Diamond amplifier concept 


...a diamond amplifier functions as follows. Primary electrons with energy of a few keV enter a high purity single-crystal diamond 
through a metal coating. These electrons excite electron-hole pairs, the number of which typically is about 2 orders of magnitude more 
than the number of primary electrons, depending on the latter's energy. A fraction of the electrons is lost to recombination near the 
diamond-metal interface and by diffusion into the metal surface. 

- Erdong Wang, Ilan Ben-Zvi, Trivini Rao etal. [589] 

I saw a highway of diamonds with nobody on it. 

- Bob Dylan 1 

Of the various configurations envisioned for a cathode producing bunches of electrons by exploiting the high secondary 
yield of a hydrogenated diamond surface [209, 322, 585, 586, 588, 589], take a configuration based on refs [144, 322] 
and illustrated in Figure 32.4 as representative. In it, a high-energy primary beam is incident on the back side of a 


^ob Dylan, lyrics to A Hard Rain's A-Gonna Pall, 1963. 


Introduction to the Physics of Electron Emission, First Edition. Kevin L. Jensen. 
© 2018 John Wiley & Sons Ltd. Published 2018 by John Wiley & Sons Ltd. 
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Figure 32.1 Secondary yield from the C100 and Cl 11 faces of hydrogen-terminated and cesium-coated diamond crystals. The high yields indicate that 
low-energy electrons transport efficiently through diamond regardless of crystal face. The yields have not reached their maximum value for the range 
of primary electron energies shown. Based on Figures 2 and 5 of ref. [207], data courtesy of J.E. Yater (NRL). 



Reflection Transmission 


Figure 32.2 Left, in reflection mode, a 1 keV primary beam (red cylinder) incident from the left results in measurement of secondaries emitted to the 
left (green cone). Right, in transmission mode, a 20 keV primary beam generates secondaries that pass through the thin diamond layer and are 
collected on the opposite side. After ref. [144], 



E - E f [eV] 


Figure 32.3 The yield of secondary electrons in reflection mode (blue circles) for 1 keV primaries, and in transmission mode (red squares and lines 
without symbols) for various primary energies. The single-crystal chemical vapor deposition (CVD) diamond thickness was 8.3 ftm. Ec is the conduction 
band minimum. Figure based on ref. [144], data courtesy of J. Yater (NRL). 


thin diamond flake much like that shown in Figure 15.1; the arrangement of the measurement system is schematically 
illustrated in Figure 32.5. A potential is maintained across the flake, and it serves to pull the secondaries to the vacuum 
side, whereupon the hydrogen-terminated (and therefore negative electron affinity, NEA) diamond surface allows their 
passage into vacuum. Two questions arise: first, for the simple representation, what should be the primary energy and, 
second, how thick should the flake be? The first question is answered by Baroody's equation of Eq. (15.8): choose E 0 = E m 
to maximize the yield 8(E 0 ). The second question depends on how bunches of electrons migrate through the diamond 
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Figure 32.4 The operation and layout of a thin film diamond current amplifier. A high-energy beam of primaries is directed to the back side, generating 
secondaries. Contacts on the diamond flake have a potential difference which pulls the secondary electrons through the thin flake whereupon a 
hydrogen-terminated NEA diamond surface allows for their escape. Based on Figure 1 of ref. [322] and Figure 1 of ref. [144], 



Figure 32.5 Schematic of the UHV analysis chamber used in making the reflection and transmission measurements of secondary emission in thin 
diamond flakes. Based on Figure 2 of ref. [590]. 


flake, and therefore depends on what scattering does to that diffusing cluster: why that might matter depends on the 
uses advocated for a diamond amplifier. 

The diamond amplifier was conceived to augment a photocathode for a free electron laser (FEL) [143], 
High-performance and X-ray FELs make strong demands on beam quality and emittance [591,592] that are chal¬ 
lenging to photocathodes in terms of quantum efficiency, emittance, and lifetime. Existing candidates, as in Table 31.1, 
can entail unsavory requirements on the drive lasers as well. Therefore, augmenting the "effective" quantum efficiency 
by enhancing the yield without demanding higher laser intensity eases demands on both photocathode and drive laser, 
while offering protection to the photocathode from the harsh injector environment as an added benefit. 

Other capabilities also matter. An ability to pulse-shape very short bunches entails rapid rise and fall times associated 
with emission, particularly if a flat (or "top hat") profile is desired for emittance reduction [593, 594], For example, 
bunches for the European X-ray FEL are characterized by a trapezoidal pulse 20 ps long with a 4 ps rise/fall time [595], and 
similar or shorter pulse length requirements exist for other FELs. Although metal and some semiconductor photocathodes 
have fast response times by comparison, a pulse of electrons forced to diffuse through a thick layer of diamond with an 
NEA surface might face challenges [209], and so finding the relationship between rise/fall time and diamond thickness 
matters, particularly if the field across the layer is affected by residual doping levels [322], 

The investigation to be undertaken, therefore, is two-fold: first, the physics of how the secondaries are generated by 
the primary and how deep into the diamond they are made and, second, how those generated secondaries migrate while 
being accelerated by field and retarded by scattering through the diamond, and how that migration affects how rapidly 
such pulses rise and fall. A third consideration, namely, how emittance may be affected by the transport of the secondaries 
through the film, is deferred to Section 33.3.7. 
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32.1.1 Bethe approximation 


Four other Oysters followed them, 

And yet another four; 

And thick and fast they came at last, 

And more, and more, and more- 
All hopping through the frothy waves, 

And scrambling to the shore. 

- Lewis Carroll 2 

The development of Baroody's equation (Eq. (15.8)) used approximations for Eq. (15.1) that relied on a model due 
to Bethe concerning how high-energy primary electrons shed their energy. A high-energy (primary) electron has more 
than enough energy to knock core electrons (secondaries) from atoms in the bulk material into the conduction band, 
where they can roam until such time as they find the surface and are emitted or lost to recombination. Bethe developed a 
model describing how that shedding of energy occurred (also called a stopping power (SP) formula) [141, 209, 596, 597], 
so that the number of secondaries per primary could be estimated. Although good for high primary electron energies 
(higher than several hundred electronvolts), at lower energies it becomes less reliable [598] but nevertheless useful. The 
energy loss per unit length is given by 3 


-SW! 


(32.1) 


where Z= 2 j , BEj = (2Q/a g )(Z/j) 2 , Z = 6, and N = p/A = (3.512 g/cm 3 )/(12.0107 g/mole) = 0.29241 mole/cm 3 for car¬ 
bon (of which diamond is composed). If the sum is expanded and the terms collected, an easier form results and is 


dE 

dx 


192 nNQ 2 
E 


In 


/ 2 4 ' 3 Ea 0 \ 


(32.2) 


EXAMPLE: At what minimum energy does Eq. (32.2) become unphysical (positive energy gain per unit length)? 
Assume carbon parameters (Z = 6). 

SOLUTION: Find E such that the logarithm argument equals 1, or 


E 


QZ 2 

2 4/3 «„ 


2 4/3 (0.3[eV nm]6 2 ) 
0.052918[nm] 


97.19 eV 


When E drops below « 97 eV, dE/dx changes sign, an unrealistic consequence of the inadequacy of the approximations 
when the energy transfer is small [596], The logarithmic sign change is removed, yet the large energy limit retained, by 
including a factor of 1 in the logarithm, or 


dE 

dx 




(32.3) 


The inserted factor of unity in Eq. (32.3) need not have been 1: when it is instead a parameter (say d 2 ) and the factor 
2 A ’^Ea 0 /QZ 2 is augmented by a factor 1 /d lt with the requirement that the values of d l and d 2 must be fitted in conformity 
with data, then rather good agreement with experiment can be obtained [599], In the interests of simplicity, though, 
setting d^ = d 2 = 1 (Eq. (32.3)) will be used. Re-evaluation of the range R(E g ) from Eq. (15.3) now gives 


R(E 0 ) = 





\92ttNQ 2 J 0 In [l +a(E 0 )x] 


(32.4) 


2 Lewis Carroll, Alice in Wonderland and Through the Looking Glass. New York: Grosset & Dunlap, 1946, Chapter IV: Tweedledum and Tweedledee. 

3 The present treatment uses a notation dependent on Q instead of a ^ as in ref. [209], but the formulae resulting are equivalent. Also, though 

not treating the Bethe model perse, Young [211] successfully compared the electron range energy in A1 2 Q 3 to the SP formula. 
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Figure 32.6 Comparison of Eq. (32.4) to the power-law representation of Eq. (15.5) for n = 0.4 (blue) and n = 3/4 (red), where R(3 keV) = 113.64 nm. 
A best fit over the range shown (Figure 1 of ref. [209]) finds J[nm] = 18.949(E[eV]) 1679 . 


where a(E 0 ) = 2‘ i ^E 0 a g /QZ 2 = E 0 /(97.19 eV) for carbon. This is a different sort of relation than that implied by Eq. (15.5), 
but as Figure 32.6 demonstrates, a power-law representation is not bad, with n in R(E g ) = A 0 E" suggested to be between 
1.4 and 1.75. For intermediate values of E 0 , R(E 0 )\nm\ « 18.949(£i 0 [eV|) 1 679 (e.g., the power-law approximation of 
ref. [209]). Compare this to the treatment of Kanaya and Okayama [600], who find R(E 0 )\nm\ » 19.218(7i 0 [eVl) 1 - 666 , 
which is quite close. For the purposes of what follows, 4 it is therefore convenient to let n = 5/3 and nA 0 = 18.949 nm 
keV- 5 / 3 . 


Pulling a power-law fit out of Eq. (32.4) seems to be extracting an inferior representation from a more accurate and 
perfectly fine numerical model. Why do it? The power-law form is simply too convenient to pass up because it allows 
for a friendlier method of calculating 8(E 0 ). Used in Eq. (15.1), the power-law form gives 


dx nR(E 0 ) [ R(E 0 )\ 


(compare Eq. (15.4)) in terms of which 


S(E 0 )=- 


BE 


[yR{E 0 )]V» 

F„(y) = J ds 


> e -rWdF n [ YR (E 0 )] 


(32.5) 


(32.6) 

(32.7) 


in contrast to Eq. (15.6). 


EXAMPLE: Find the steps necessary to obtain Young's approximation [141, 601] discussed in Chapter 15 from Eq. 
(32.6). Let n = 5/3. 

SOLUTION: Make the substitution s = y (1 - x) in F n (y) of Eq. (32.7) and then use the exponential approximation 
( l-*) 2 / 5 «< T 2 */ 5 

The integration is now analytic, and is 

S (E.) = -( - — -— ^ {1 - e -r«(B.)+(2/5)| (32.8) 

eV °’ n\ Y R(E 0 )-(2/5) S 

If (n - Y)/n = 2/5 is small compared to yR and B' = B/n, then Young's approximation is obtained. 


4 There are differences between the present treatment and ref. [209], on which it is based, such as n here is n + f there, and Eq. (15.5) here 
calls out an n in the denominator, but there n is absorbed into A g with changes to y as well as a result. 
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Might it not be easier to use the simpler exponential approximation 8 e (E 0 ) of Eq. (32.8) to compare to measurements 
rather than enduring the numerical integrals demanded by F n (y)7 Bluntly, "No", but why is instructive. In the limit that 
there are no losses during transport (y -*■ 0) and the probability of emission is unity (B -* 1/A E, or p -* 1), then the yield 
8(E g ) should simply be just the number of secondaries created. Either by being simple and solving Eq. (15.1) directly with 
y = 0 or by being fancy by using F n (y ) » y l/n and taking the limit y -*■ 0, 

lim 8{E 0 )=^ (32.9) 

r-»o 0 A E 

when p= 1, which just makes sense: each secondary takes an energy of about A E to be created, so the number of 
secondaries is just the number of times that can be done by a primary of initial energy E 0 . But that is not the y -* 0 limit 
of 8 e (E 0 ), which is instead (for n = 5/3) 

^ - 1) = °- 7377 ^ (32.10) 

A temptation might then be to use the easier form of 8 e (E 0 ), but appending a coefficient of 1.356 to its overall magnitude 
so that the number of generated secondaries is brought back into line would result in E m and 8 m being too large as a 
consequence. Further machinations beyond that make the approximation become of comparable difficulty to the actual 
numerical evaluation, and so evaluating F n (y ) numerically from the start is just better. 

Compare now the 8(E 0 ) of Eq. (32.6) to, for example, the H/C(lll) shown in Figure 32.7, one of the sets of data 
from which high secondary yields from diamond were demonstrated and which helped motivate the diamond amplifier 
concept. To make the comparison ( i ) evaluate A E using the relation A E = 2.5E g [209, 602], which is compatible with the 
data of Yater and Shih [207], taking the band gap of diamond to be E g = 5.5 eV, then A E = 13.75 eV, (if) from Section 
15.1, B = p/AE, and (iii) the parameters p and y are all that is left for which to match data to S(E 0 ). 

Finding p and y has some art to it. Actual data is noisy, but taking a polynomial fit of the form 8 a (E 0 ) « C l E 0 - C 2 E 2 0 
smooths out the effects of noise. Fits give C x [l/keV] = 43.588 and C 2 [l/keV 2 ] = 1.8028. Algorithms such as bisection 
(Section A3.7.1) can then be used to findy by requiring <5(2 keV)/<5(l keV) = 1.9137, which occurs for y = 1.7295 [1/pm], 
after which <5(1 keV) = 41.785 gives p = 0.58641, entailing B = 42.648 keV -1 . With these values, the theory line of 
Figure 32.7 is constructed, from which 8 m = 187 for E m = 8.6 keV. 


32.1.2 Penetration range and initial conditions 

At high primary energies, the high-velocity electrons have a relatively short time to interact with the lattice electrons, and the internal 
yield per unit length is low. As the primary electrons lose energy, the interaction time increases and so does the yield. The combined effect 
is that as the primary-electron energy increases, the internal secondary electrons originate deeper beneath the surface. 

- Arnold Shih, Joan Yater, Charles Hor, and Richard Abrams [584] 



crystal face of 
yield occurs at 
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In preparation for following the evolution of secondaries as they migrate through the bulk material, the initial position 
of the secondaries and their initial direction must be obtained to specify the staring conditions in a Monte Carlo analysis. 
The direction is relatively straightforward: the secondaries can be assumed to point perpendicularly to the direction of 
the incident primary beam, although randomly distributed about the azimuthal angle [157, 596], 

But how many electrons are deposited as a function of distance into the bulk material? If one assumes that a secondary 
is generated every time the primary loses A E in energy, then the positions where this occurs can be obtained by inte¬ 
grating, or if the positions occur at Xj, then x ;+1 « Xj + AE(dx/dE). But there is an easier way. Recall that R(E 0 ) is how far 
a primary beam of energy E 0 penetrates into bulk. A primary beam of energy E 0 — A E would not penetrate quite as far, 
but the conception can be inverted: 

Xj = R(E 0 ) - R(Ej) (32.11) 


with Ej = E 0 -jAE. The number of secondaries (and that would be one) created between x J+1 and Xj is then 

± N(x) = -J-( E] ~ E] - 1 \ = 1 electron 

dx ' ’ AE \Xj--Xj_y ) R(Ej) - R{E hl ) 


(32.12) 


Although Eq. (32.4) should be used for R{E), a quick appreciation of the behavior is given by using Eq. (15.5), for which 
Figure 32.8 results and matches independent findings [584], What is evident is that the rate at which the secondaries are 
created spikes as the primary grinds to a stop. That this is a general feature of particle penetration allows for targeting 
the depth to which particles are deposited by adjusting the primary beam energy. 


EXAMPLE: Estimate how long is required for the primary electron to loose its initial energy E 0 . Assume no field 
exists across the diamond flake, and let E 0 = 220AE = 3.025 keV. Take n = 5/3. Lastly, assume m n = 0.57 m for dia¬ 
mond. 

SOLUTION: Although the velocity v(x) of the primary changes by discrete amounts after every collision which 
generates a secondary, treat c(x) as continuous. From Eq. (32.5), it follows E(x) = E 0 \ 1 - (x/R(E 0 )] i ^ 5 . Introduce 
v(x) = \]2E(x)/m n . As a result, 

dx 10 R{E a ) 

At= ax = (32.13) 

Jo v(x) lv 0 

where v 0 = \j2Ejm n . As a result of the factor (10/7), the time for the primary to come to a stop is approximately 
43% longer than the transit time of the primary across R(E 0 ) if it did not suffer events leading to secondaries. Inserting 
numbers, v g = 32.62 nm/fs, and so At« 4 fs. If a field existed across the diamond flake, then A t would be smaller 
because the primary would accelerate between collisions. 



Figure 32.8 Number of secondary electrons generated between x, and the surface (N(x)) and number generated per unit length 
dN/dx = 1 e"/(K(B J+1 ) - !?(£))), where 50% of the secondaries (110) are generated in the last 33% of the path of the primary. 
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Approximating the initial conditions of the secondary electrons is next. The time taken for the primary to stop is orders 
of magnitude smaller than a picosecond-scale pulse width, and so the secondaries are to a good approximation created 
all at once with one electron at each Xj, each of which is borne with an initial energy AE and directed randomly in a 
direction perpendicular to the primary electron's path, which bores through the bulk in a straight line. With the initial 
positions and velocities of the secondaries, their trajectories can be followed, at least until they scatter. 


32.2 Monte Carlo methods 

If you play a Game of Chance know before you begin 
If you are benevolent you will never win 

- William Blake 5 

Prior encounters with Monte Carlo, in Section 29.2.4 for choosing how atoms hop or in Section Al.2.5 for evaluating 
otherwise difficult integrals, are but small examples of how probabilistic methods can be used to solve otherwise difficult 
or intractable problems. For modeling semiconductor devices and electron transport through materials, the Monte Carlo 
method is - with no exaggeration - magnificent when time scales are so fast (comparable to the scattering times) or 
the length scales so small that drift-diffusion and hydrodynamic equations are not appropriate. Good accounts are in 
the literature [107, 596, 603-605] and in texts [81, 606]; ref. [108] is particularly useful for containing FORTRAN code 
for various algorithms. The method itself is treated in refs [63, 105, 106]. The diamond amplifier analysis benefits from 
treatments of Monte Carlo in refs [324, 586, 588, 589, 607]. Other electron sources that exhibit complex physics, such 
as the Monte Carlo modeling of transport through layers in flat cold cathode structures where space-charge effects lead 
to current self-quenching in a manner reminiscent of thermionic cathodes [150] are additionally useful. 

The unfolding of the Monte Carlo method has general features captured by the flow chart of Figure 32.9. Its essential 
attribute is the random choices made at key decision points that affect particle trajectories, or in the concise description 
by Peter J. Price [107], 



Figure 32.9 The main characteristics of a Monte Carlo program for the simulation of carrier (electron or hole or both) transport in semiconductors, 
based on a flow chart developed by Jacoboni and Reggiani [603]. Red equations suggest parameters associated with that element of the program. 


’William Blake, Blakes Selected Poems. New York: Dover Publications, 1995, p. 58. 
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The motion of a mobile electron in a solid, in the quasi-classical representation, is a sequence of trajectories governed by the external fields 
(and by any internal macroscopic structure as in a junction) alternating with scatterings determined by interaction with lattice motion 
and other microscopic deviations from crystal perfection. Both the duration of a trajectory (“path ") before termination by a scattering, and 
the electron state resulting from the scattering, are to be treated as random variables with given distributions. These random variables 
are produced in a determinate way as functions of one or more values of a standard random variable. 

as reflected in the prominent features of the flow chart of Figure 32.9 identifying where reliance on random numbers and 
the decision point associated with scattering events occur. When transport can be treated using a semiclassical Boltzmann 
transport equation, a pedagogical approach suffices, but when device structures become much smaller so that quantum 
mechanical effects (discrete energy levels and wave packet interactions with scattering centers) must be considered, 
then the semiclassical methods lose their validity and what Monte Carlo entails as a consequence becomes much more 
involved [608]. The simple models are sufficient to treat the temporal shape of emitted secondaries. 

32.2.1 Poisson statistics 

Now I'm winding up staring at an empty glass 
Ah ah ah ah ah ah ah ah. 

Cause it's so easy to say that you'll forget your past. 

- Greg Kihn and Steve Wright 6 

Electron emission, radioactive decay, line queueing, and yes, scattering, are kinds of random processes in which the 
occurrence of a previous event (emission of an electron, scattering with something) does not affect the likelihood of the 
next event, in contrast with events that are very much affected by the outcome of the prior event (having grandchildren, 
Russian roulette). That is, the processes are said to have no memory of what has come before. The noise associated with 
individual or discrete electron emission events, for example, is called "shot noise" and is of great interest for thermal 
[148, 221, 609, 610] and field emission cathodes [150, 175, 611-615], Such a process is characterized by a single time 
scale r used to parameterize the passage of time between subsequent events, a rather remarkable state of affairs. Why the 
simple assumption of processes which "forget their past" results in an exponential relation in time is worth investigating. 

So what is the question? The one posed by Hamlet is a good start: existence of a state or its absence pretty much covers 
all cases, and if the probability of occurrence is p, then Hamlet's formulation is equivalent to 1 = p + q, which is the same 
as q = 1 - p after rearranging. 7 The self-evident simplicity of the single event is rapidly complicated when more than one 
event is considered. For the harder question of a maximum of n events occurring in a measure of time, it is still the case 
that all possible outcomes are accounted for by 

1 -* +l ' y, -j s W=wf‘ rl (32 ' 14) 

where the binomial theorem in Eq. (5.16) is met once again. Observe that the sum is just the probability that an event did 
not occur at all ( k = 0) plus the probability that it occurred only once (k= 1) and so on till the last instance where every 
chance an event could have occurred for n times in fact occurred. The combinatoric factor n\/k\(n - k)\ (Section 4.4) out 
front allows for the order of the events not mattering, only that they occurred in the span of time t = n 8t. For example, 
the probability an event occurs three times for n = 4, given 1 = p 4 + q 4 + 4pq 3 + 6p 2 q 2 + 4 p^q, is then 4p 3 (l - p). 

For a given time duration, if the unit of time were, say, reduced (that is to say, n increased), then the probability p that 
an event occurring in that amount of time would have to be smaller, even as np stayed the same. This makes intuitive 
sense: a cold thermionic cathode, for example, may emit electrons so infrequently (the probability of emission being 
related to (T)) that changing the measurement of the time between events in units of seconds to milliseconds should 
not affect the measured estimate of number of electrons emitted per hour by much, for example at 350 K and O = 2 eV 
a barium dispenser cathode of area 1 cm 2 emits five electrons an hour on average. A Poisson process, then, conceptually 
takes the number of time units n -*■ oo, even as np -* 2 is held fixed. Conceivably, however, two of those aforementioned 
six electrons could be emitted within picoseconds of each other, even though the average time between emission events 
is far longer: being uncorrelated events (no memory), the duration of time to the next event could be anything. 


6 Greg Kihn and Steve Wright, lyrics to The Breakup Song (They Don't Write 'Em), 1981. 

7 Hamlet’s formulation "To be, or not to be - that is the question" (ref. [37], Hamlet m.i.56) is then rephrased as “p or not-p", as it were. 
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Now the question is: what is the probability that an event (like the emission of an electron or its scattering) has occurred 
k times in the length of time under consideration, given that the number of time units n making up that interval becomes 
very large? 8 That is, what is the large n limit of Eq. (32.14) for a fixed X? 


EXAMPLE: Assuming p = X/n, find 


» k\(n - k)\ ' 

SOLUTION: Use Stirling's approximation (Eq. (A2.4)) to find 


k\(n — k)V 

For small x, use (1 + x) n = e n ln (1+x) « e nx to show 


, k(l-k) \ 
2 n ) 


The n -* oo limit can be taken forthwith, leaving 


(32.15) 


(32.16) 


Generating random times between events and then measuring how many events occur before the time between the 
start of the clock and last event exceeds X shows just how good P k is as an approximation to the distribution, as suggested 
in Figure 32.10 using the algorithm of Section A3.30.1. The k = 0 term has a special place: it is the probability that no 
events occur in the time t = 1 unit, for which P 0 oc exp(— X). For an arbitrary number of units, this becomes P 0 (t) oc e~ u , 
or the probability that an event has not occurred decreases exponentially with time. Taking into account that integrating 
over all time must result in a probability of unity, then 


P 0 (f)dt = e xt Xdt 


(32.17) 



k 


Figure 32.10 Distribution of events for sequentially emitted particles Eq. (32.16) (red line) in three simulations or trials based on the algorithm of 
Section A3.30.1. Events randomly occur until the time separation between first and last exceeds 2. The number of events is N/i = 1250 (based on 
ref. [209]). 


8 The time unit must be small so that the time between successive events can be resolved regardless of how close they are, which accounts for 
the n-* oo limit. 
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32.2.2 Random number generation 


...Get your facts first, and.. .then you can distort 'em as much as you please. 


- Mark Twain 9 


Deep in Section A3.30.1 is the relation x ;+1 = Xj - In (r), where x } is represented by xt in the algorithm, and the random 
number 0 < r < 1 is represented by rand () in the programming language of the example. Why - In (r)? There is a reason 
for that, and it is based on how distributions of random numbers are generated. 

Let P{x) be the probability distribution such that P(x)dx is the probability that an "event" has occurred. It is use¬ 
ful to have a method to construct P(x) from the uniform distribution 7/(0,1) of random numbers 0 < r < l. 10 The 
acceptance-rejection (AR) method of von Neumann [63] then provides a means to generate random numbers distributed 
according to P(x) from a uniformly generated random number r. The algorithm is 

1. Generate two random number r, and r 2 . 

2. Compare r, to P(r 2 ). 

3. If r, < P(r 2 ) then accept r,; if not, try again. 

Although such a method works for any P(x), if P(x) has regions where it is sharply peaked and other regions where 
it is small, then employing the AR scheme can become inefficient, meaning more time than perhaps desired is spent in 
regions where P(x) is small. 

Some forms of P(x) enable a more elegant method. The examples chosen here will be when P(x) is linear and when it 
is an exponential (like P 0 (f) in Eq. (32.17)). P(x) is normalized so that 


f 


P(x)dx = 1 


(32.18) 


where L is the range of x beyond which the distribution P(x) is zero (although L can be infinite, as in the case of P 0 (f))- 
Separate the range of integration into regions that contain the same number of events. If there are n such regions, then 


(32.19) 


for j = 0,1, ...n - 1 with n large, which serves to define the Xj. Observe that if P(x) were constant (as it would be for a 
uniformly distributed random variable), then 1 /m = Ax is the width of the region. Now consider the two cases: 

• For the linear case with L- 1, then P(x) = 2x. Eq. (32.19) now entails that xj - xj_ y = l/n, or 


nr 


If n is large, then (j/ri) « r where r is the uniformly distributed random number. As a result, a random number x that 
is linearly distributed can be generated from a uniformly distributed r by x(r) = \fr. 

For the exponential case where P(x) = e~ x (A is taken as 1 and L -*■ oo), then Eq. (32.19) entails exp(-x ; _,) - exp(-x ; ) = 
l/n, or 


Xj = - In 


(32.21) 


Again, letting (j/ri) -* r identifies x(r) = - In (1 - r). However, if 0 < r < 1, then so does 0 < (1 - r) < 1, and we can just 
as well make the replacement r -*■ 1 — r as it will not change the distribution of r. Therefore 


x(r) = - In (r) (32.22) 

a result corresponding to the m = 0 (no "occurrences") case of an algorithm described by Knuth (ref. [411], p. 132) for 
the treatment of Poisson distributions. 

There are alternate methods to generate an exponential-like distributions to circumvent the computational burden of 
low probability events by grouping them into classes from which selections are made [450, 616], but the two options 
here are enough to explore. An algorithm to do so, which is used to generate Figure 32.11, is given in Section A3.30.2. 


9 Quoted in Rudyard Kipling, Prom Sea to Sea: Letters of Travel (1899). Project Gutenberg, EBook #32977, Contents of Part II, XXXVII, 2010, p. 167. 
10 Programming languages often have calls that generate a random number. In the language of MATLAB, it is rand (); in FORTRAN, it is 
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Figure 32.11 A comparison of Eq. (32.17) (expC-AAf/) with Eq. (32.22) (Knuth) and the AR algorithm. Based on Figure 2 of ref. [450]. 


32.2.3 Scattering and self-scattering 


What fates impose, that men must needs abide; 
It boots not to resist both wind and tide. 


William Shakespeare 11 


Now that a random number generation method has been developed to determine the time to the next scattering event 
A t= -t ln(r), with r being the average scattering duration, for a secondary electron based on Eq. (32.22), a means 
of determining the scattering mechanism must be chosen by a second random process. There are five or so scattering 
mechanisms that will form the core of those considered here, but choosing amongst them is not simply choosing them 
randomly. There is more art than that to the process. 

The relaxation time t has been cropping up regularly, from when Eq. (3.18) first made mention of it in introduc¬ 
ing conductivity in the Drude model of Section 3.2 right up through its appearance in the optical parameters in the 
Drude-Lorentz model of Section 31.6. How t appears in Eq. (7.18) (see also Eq. (31.63)) is of most interest here: it is a 
measure of how long a distribution / takes to "relax" back to its equilibrium distribution. Clearly, collisions are involved, 
wherein electrons scatter with their brethren, but also with lattice atoms, imperfections, and other impediments to their 
flow; it is also intuitive that each one of those processes takes their own particular toll on how departures from the 
equilibrium distribution evolve. One might, for example, write the impact of those various processes in Eq. (31.63) as 
(for simplicity, consider time independent and spatially uniform conditions) 




(32.23) 


where each scattering process T ; (k), of which there are n, has its own energy dependence. If scattering is characterized 
by an overall relaxation time t, then this line of thought suggests Matthiessen's rule of 



(32.24) 


Actually, Matthiessen's rule dealt with partial resistivities [52] where p = Y^jPy but insofar as resistivity is the inverse of 
conductivity, or pj = l/vj, and oy <x t ; -, speaking of t amounts to much the same thing. The probability that a scattering 
event characterized by t occurs is then related to the sum of the probabilities for each scattering process. 

Consider the overall probability for the occurrence of a scattering event in a short time interval St, and represent it as 
PSt. The probability that a scattering will not occur in that same time interval is 1 - PSt. If there are two time intervals, 
then the probability that an event will occur in the second interval is the product of the probability that nothing will 
happen in the first interval and something will in the second, or W 2 = (1 — PSty)PSt 2 (such language is reminiscent of 


11 Ref. [37]: The Third Part of King Henry the Sixth, IV.iii.58-59. 
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W n {k) in Eq. (4.27) of Section 4.4). That easily generalizes into the probability that something happens in the (n + l)th 
interval is (assuming all the intervals are the same 8t) 

W n+1 = (I - P8t) n = exp [n In (1 - PSt)] (32.25) 


but if 8t -*■ dt becomes infinitesimally small, then the In term may be expanded, c 


r n+1 oc exp ^ PSt^j Pdt -*■ W(t)dt = Pexp |- J Prff'J dt 

s the same for each interval, observi 
n energy changes, and so 

= P\k(t)\ exp | ^ P\k(t')\dt' | dt 


Although it was seemingly implied that P was the same for each interval, observe that the treatment does not require 
this: P can change, for example, as the electron energy changes, and so 


W(t)dt 

Were P constant, however, then W(t) -*■ W 0 (t ) could be written as 

W g {t)dt = - exp(-t/r)dt 


which would be very nice indeed: the total relaxation time t could be thought composed of individual relaxation 
time components ry, from which one chosen at random (in proportion to how much t depended upon it according 
to Eq. (32.24)) would dictate the nature of the scattering event. Nice, but not possible: a huge difference exists between 
Eqs. (32.27) and (32.28). 

A trick to bridge the gap is to introduce another, or self-scattering term r 0 which simply does not do anything: if the 
electron scatters according to it, the electron simply goes on its way unmolested. But the introduction of such a scattering 
term does bring Eq. (32.27) into Eq. (32.28), at the price of making the self-scattering t 0 long enough so that t takes on 
its maximum value (i.e., 1 /t takes on the minimum value of 2" =l (1 fr-) - mind the upper limit) which serves to define 

T n+l by 


= Z7 = r- + 27- ml " Ur) 


(32.29) 


with r n+] (k) defined to make this so. That will make for many self-scattering events that just do not do anything. The 
price, however, is worth it: the calculation of the free-flight times becomes tractable [603]. 

In practice, then, for the secondary emission simulation, the choice of which scattering event occurs is found by creating 
a vector P, the elements of which are [209] 


P k = 




(32.30) 


where the five scattering mechanisms of Section 32.3 are considered. For each electron, a random number 0 < r < 1 is 
generated, from which the electron is allowed to travel for t = -T min In (r), after which a scattering process characterized 
by r k is chosen from the five candidates, with r k determined by finding a k that satisfies P k <r < P k+] . If none is selected 
because r > P 5 , then a self-scattering event has occurred. 


32.3 Relaxation time 


Relaxation, then, is not an end; for it is taken for the sake of activity. 


- Aristotle 12 


Understanding how to calculate the initial and final states of a particle (such as an electron) undergoing a scattering 
event based on the properties of the scattering potential, be it due to other similar particles, oscillations of the lattice 


12 Aristotle, Nicomachean Ethics Book X (trans. W.D. Ross), Great Books of the Western World, ed. Robert Maynard Hutchins, Vol. 9, Aristotle II. 
Chicago: Encyclopaedia Britannica, 1952, Chapter 6, p. 431. 
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(phonons), impurities charged or otherwise, and imperfections in the crystal structure is complex [80-82, 97, 294, 617, 
618], The specific components of most use to the scattering of secondaries in diamond will be the focus [85, 209, 308], 
A potential 17 will change the dynamics of an incident electron, characterized by the unperturbed wave function 
|yr k (0) m \k)e~ la>kt by scattering it into other states. 13 As in Sections 8.2 and 21.6, the final state is expressed as a sum 
over the unperturbed states with weights c k ,. If the interaction potential is weak, then the original wave function will 
dominate or c k » c k , for k # k'. This entails 

MO) = J/ k (0 l k > e-^‘ (32.31) 

where the unperturbed wave functions are the solutions to 

H 0 |k> = E{ k) |k) = hw k |k) (32.32) 

Schrodinger's equation takes the form 

ihd t MO) = (H 0 + U) MO) (32.33) 


which, in light of Eq. (32.31), becomes 

ihd t c k , = £c k (k'| U(t) |k> exp [i (aj k , - aj k ) t] (32.34) 

k 

where k' is the final state (after scattering) and k is the initial state (prior to scattering). Letting U(t) = Ufe ±mt where the 
sign ± will come to denote the emission or absorption of phonons, then 


ihc k m = 2c k (k'|U± Ik) 


exp [i (w k , - w k + w) t] - 1 

(®k- - ®k ± ®) 


(32.35) 


Now invoke the fact that the interaction potential is "weak" so that one state dominates (the Born approximation ) in that 
c k , = <5 k k -, so that 

f exp {z~ (w k , - w k ± co) t} - 1 


c k ,(0 * <k'| u-- 


t Ik) | - 


ih (w k , - W k ± w) 


-} 


(32.36) 


The probability that the scattered electron is in the state |k') at time t is \c k ,(t)\ 2 dt: the probability that it ends up in that 
state is then the integration of |c k ,(f)| 2 dt over all time. That appears daunting, but observe that the time dependence is 
wrapped up in the term in brackets, and if the substitution Q, = (w k , - ® k ± co), then the time-dependent part of c k ,(f) 
looks like the far less threatening 

e' nt ~ 1 _ jeu/2 s in (Qf/2) 

i£lt (Q.t/2) y ' ' 

which is a rather sharply peaked function. The integration over all time therefore contains an integration that looks like 

J^(^') 2 dx = j[ (32.38) 

The integrand is sharply peaked about x= 0: knowing that the function c/>„(x) = {nxx 2 y l (sin nx) 2 is a delta sequence 
[400], or a function that in the limit of n -* oo behaves as a Dirac 8 function, then applied to the present circumstances. 


l^cof W | (k'| tJf |k)| 2 ( 8 (E(kr) - B(k) ± hco) 


as long as the time between collisions is much longer than how long the collision takes to occur [80]. The rate at which 
transitions take place, or |c k ,(f)| 2 /f = S(k, k'), is then 

(32.40) 

(32.41) 


S(k, k') = 5+fk, k') + 5“(k, k') 

S ± (k,k') = (^) |<k'l Ik)| 2 5 (B(k') -E(k) ± hco) 


13 In the present section, let the vector k be instead denoted by k to avoid clutter, with arrows above kets, subscripts, and primed 
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As long as something does not interfere with the scattering occurring, then the relaxation time r(k) is defined by 


FirS s < k ’ k '> 


(32.42) 


If something does interfere, like a high concentration of carriers or the occupation of the final state (as occurs for metals 
and degenerate semiconductors), then accommodations do have to be made. Now Eq. (32.42) governs how often col¬ 
lisions occur, but knowledge of how long either momentum or energy takes to randomize through collisions is also of 
interest, necessitating the introduction of a momentum and energy relaxation time [80, 81]. When the change in momen¬ 
tum matters, and if the initial direction is defined as the z direction, then evaluation of the momentum relaxation time 
r m introduces a factor 


t k £2 k 

that is appended to S(k,k') in Eq. (32.40); for energy, the insertion of 


(k-k') (k-k') , k ' 


g(k') _ (V\ 

' E{ k) V k ) 


(32.44) 


defines the energy relaxation time t f when appended to Eq. (32.40). Observe that for elastic scattering where there is no 
change in energy (i.e., k' = k) then l/r E ->• 0, meaning there is no relaxation in energy. 

Specifying the initial and final states, the nature of the potentials, the effects of band structure, and much else enor¬ 
mously complicates the evaluation of relaxation times for various processes that matter, more important ones being: ( i ) 
scattering off other electrons, (fi) acoustic phonon scattering, (iii) optical and polar optical scattering, (iv) ionized and 
neutral impurity scattering, and (v) others like piezoelectric scattering. The evaluation of the related relaxation times is 
sufficiently complex to have generated a large literature [80-82, 84, 85, 209, 294, 548, 559, 560, 617, 619, 620], For 
describing how a bunch of scattering electrons behaves in transport through a diamond layer, though, the exact form is 
less important than the qualitative effect, and that affords some flexibility, here expressed as allowing two simple mod¬ 
els in particular to stand in for the remainder in such a way that electron-electron and the electron-phonon scattering 
factors feel plausible when they are introduced rather than derived. The former have much in common with scattering 
off an ionized potential, while the latter are more like scattering off sinusoidal potentials. Although the discussion of the 
scattering times is motivated by considerations of secondary emission, the discussion is kept more general so as to be 
relevant to photoemission as well, particularly with regards to electron-electron scattering. 


32.3.1 Ionized center 

.. .somehow the arc of the moral universe is long hut it bends toward justice. 

- Martin Luther King, Jr. 14 

A bare Coulomb potential (4Q/r) can, allegedly like justice, bend the trajectory of a passing charged particle no matter 
how far away it is. In the parlance of scattering as a cross-section, the bare Coulomb potential therefore has an infinite 
cross-section [90], For scattering as a relaxation time, t can be related to cross-section a and a volume V by the velocity 
v as 

- = vaV (32.45) 

An infinite cross-section a therefore implies a vanishing relaxation time: a charged particle is always getting knocked off 
course. In materials, that will change due to the presence of screening. 

Modifying the Coulomb potential by adding a screening factor e~ Kr results in the Yukawa potential of Eq. (21.89): were 
the ionized core to be multiply charged, then Q -*• ZQ in the Coulomb part (e.g., a doubly ionized potential would have 
Z = 2), but for simplicity, ignore that complication. The operator U g is 

40 

U°= ^exp(-x|f|) (32.46) 


14 Martin Luther King, Jr., Sermon at the Temple Israel of Hollywood, 26 February 1965, http://www.americanrhetoric.com/speeches/ 
mlktempleisraelhollywood.htm. 
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Recalling the methods of Sections 8.2.1 and 8.2.4 extended to three dimensions, with familiarity of Section 22.6 and in 
particular Eq. (22.36), then in some detail 15 

(k|£yk') = J drdr' (k|r) <r|&Jr') (r'|k') 

= J drdr'^e^Sir - r') exp(zk ■ r - tk' • r') (32.47) 

= \j dA-"exp(i V -r) 

where V is volume and p = k - k\ Writing out the integral explicitly, using dr = r 2 sin 0 drdOd(/>, then 

(i2M) 

where x= cos 9 and the integrals are straightforward. As a result, using the continuum version of Eq. (32.42), and 
observing that p = |k - lc'| entails 

p = \/W^id 2 ^-2kidcosO -*■ = 2ksin(0/2) (32.49) 


because the energy 8 function demands k = k! , then 

1 _ 8 /2y\ r k 2 dk8[E(k)-E(kr)] 

r(k) V\h){ K s ) J 0 [ K 2 + 4 A; 2 sin 2 (0/2)] 2 


(32.50) 


There are four modifications that now must be made. First, the relation is for only one scattering center. In a crystal, there 
are generally JV,- = Vp t scattering centers, where p t is the number density of them, so that 1 /t -*• N t /r = 1/t /( . Second, to 
perform the integration over dk the relation 


(32.51) 


must be employed. Third, to get the momentum relaxation time, recall that the replacement (1 - cos 6) must be included 
in the integration. Lastly, introduce g = 2k/ k, whereupon 


1 _ nm n /4Q\ 2 7°° sin 0(1 - cos 0)d0 

Hi® ~ 2(M) 5 1 K s ) J 0 + s in 2 (0/2)] 2 


The integral is analytic, as can be shown by the substitution x = cos 0 and undoing the half-angle formula for sin 2 (0/2). 
It gives 


The result is the Brooks-Herring (BH) relation, for which 


3 sin 0(1 - cos 6)d6 


l2 = 4 In (1 + g 2 ) - -rr~2 

10/21 1+ f 


Tu(k) 


1 6xm n p i / 4 Q\ 2 i 
{Tik? \K S ) f 3 


(32.54) 


where the replacement k = qk/2 has been made so that the final result can be expressed in terms of the dimensionless g 
in the function G(g) defined by this equation. 

Much hinges on the value that k takes on. Recall that the screening factor k was introduced to prevent infinities that 
otherwise attend scattering from a Coulomb potential: were there no screening (x -* 0 or g -* oo) then the relaxation 
time would go to infinity as f 3 /(2 In (f)) as in Figure 32.12. Alternately, in the Conwell-Weisskopf ( CW) approximation, 
the infinities are avoided by truncating the offending parts of the integration at a lower scattering angle [80, 621], But 


15 Here, it is easier to use the bold face notation for r -*■ r to indicate vectors. 
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Figure 32.12 Behavior of G(2k/ic ) of Eq. (32.54), alongside its asymptotes. 


here, analogous to the BH model using k, then if the screening of the ionized impurities by electrons is governed by 
Debye's screening constant of Bq. (21.88), then 



(32.55) 


The transition between the BH and CW approximations is governed by the length scale associated with the average 
separation of the charged scattering centers: if l/(2p^ 3 ) > k, then use of the BH approximation is good [81, 621], 


EXAMPLE: For parameters loosely suggested by diamond, or m n = 0.57 m, T = 300 K, K s = 5.7, and assuming that 
the electron density and charged ion densities are p 0 = 10 17 cm -3 and p, = pj 10, find r u (k) when q = 1. 
SOLUTION: In [nefq] units of Table 2.2, Eq. (32.55) becomes 



Setting f = 1 entails that k = k/2 and 


209.56 ■ 250.71 =2 „ 157xl0 . fe 

256 jrninPi 256n ■ 3.2408 ■ 10 -5 
then z u (k = k/2) = r 0 G( 1) = 10.436 ns. 


32.3.2 Sinusoidal potential 

Trying to understand the way nature works involves a most terrible test of human reasoning ability. It involves subtle trickery, beautiful 
tightropes of logic on which one has to walk in order not to make a mistake in predicting what will happen. 


Richard R Feynman 16 


In contrast to the screened charge potential of Eq. (32.46), consider instead a potential that is oscillatory and extends 
over all space (it could, for example, be just one component of the Fourier representation of a real potential). The potential 
takes the form of 


U 0 = 2a q cos(q ■ r) = + U~ 


(32.56) 


16 Richard P. Feynman, The Meaning of it All: Thoughts of a Citizen Scientist. Reading, Mass.: Perseus Books, 1998, p. 15. 
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which joins the usual e iq ' T of Fourier transforms with its complex conjugate so as to render the potential (r| U\rr) real. 
It is quick work to show 

(k|0±|k') = ^| J dr dr' (k|r) (r|c ±iq ' f |r') (r'|k') 

= ^ J dr dr’e ik ' I e ±i<l ' t e~ ik ' I S(r - r') 

= a q <5(k-k'±q) (32.57) 

As a result, 

S^k, k') = |a q | 2 «5 [E(k) - E(k') ± too] 8 (k - k' ± q) (32.58) 

and S(k, k') = 5 + (k, k') + S“(k, k') as in Eq. (32.40). 

How co{ q) behaves is given by the dispersion relation. There are two types to consider, one being where w(q) = co 0 is 
a constant and the other being where w(q) = v s q, where v s is the sound velocity. Though not apparent yet, the second 
type occurs for small values of q and is associated with the collective motion of lattice atoms, and so is called the acoustic 
mode. The first type is correlated with lattice atoms moving out of phase with each other and because such motion 
interacts strongly with light, it is called the optical mode [53, 292]. Insofar as lattice motion can be treated in the language 
of particle interactions, the vibrations are termed "phonons". The literature [80-82, 294, 618] is extensive, whereas 
acquiring a feeling for what is involved will be more useful here (following ref. [85]). 

Consider a linear chain of atoms, but of two types, and let the harmonic forces holding them in place be visualized 
as little springs, much like in Figure 32.13. If the masses were equal, or M A =M B = M, then the force on the nth atom 
connected to its nearest neighbors by springs with a spring constant g would be 

Mu„ = g [(u n+l - u„) - (u n - i vO] = 9 K+i + k_i - 2u n ) (32.59) 

On the other hand, if the masses were different, then there would be two equations, one for atom n and the the other 
for its neighbor n + 1, given by 


M A u n =g{u n+l +u n _ 1 -2u n ) 

m bK+i = 9 K+2 + u„ ~ 2 u n+l ) (32.60) 

If solutions of the form Uj = Cj exp (iwt - ijqa) for j = n or n + 1, and where ja is the position of the jth atom, then 
Eq. (32.60) becomes 


M a co 2 C a = 2 g (C A - C B cos(qa)) 

M b co 2 C b = 2 g ( C B — C A cos(qa)) 

Two equations for two unknowns C A and C B suggests a matrix solution of the form 

M a oo 2 — 2g 2g cos (qa) 

2g cos (qa) M B m 2 - 2g 



(32.61) 


(32.62) 



Figure 32.13 An arrangement of two types of atoms (A = green, B = red) on a linear chain characterized by different masses M A and M B , and connected 
by springs with a spring constant g. The positions of the Kth mass is denoted u n (t ): the solid spheres show the equilibrium position, and the dashed 
faded spheres their positions at some time t. 
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Figure 32.14 The behavior of Eq. (32.64) for various values of r. The lines are labeled by [P, »], where P is either acoustic (A) or optical (0), and n refers 
to the power of r = 2 1_ " = 1, 1/2, 1/4, and 1/8 for the curves shown. The acoustic branch behaves as v s q, and the optical branch as a> 0 , for q small. 


For a solution, the determinant of the 2x2 matrix must equal zero, and so after a bit of manipulation, 

M a M b co 4 - 2 g (M a + M b ) co 2 + 4g 2 sin 2 (qa ) = 0 (32.63) 


for which the quadratic equation yields the solutions. With the introduction of the ratio r = MJM B , the "reduced mass" 
M = M a M b /(Ma + M b ), and the frequency co g = sJlg/M, the solutions take on the more compact form of 

(^) 132641 
The behavior of co ± /w g given in Eq. (32.64) is shown in Figure 32.14, from which it is seen that the small q values for r 
small to r « 1 are visually similar. 

Only because it is particularly simple to do so, consider the behavior of ® ± /® a for r = 1 rather than general r, for which 


— = - (i ± cos(<?a)) 


Finding the behavior of 
seen to be 


to order {qa) amounts to the replacements cos (qa) -> 1 and sm(qa) -* qa. For ® + , it is easily 
(l/2)(l + l) = e» e (32.66) 


The same replacements for co_ take a bit more finesse as the replacement appears to make the right-hand side vanish. 
Consider instead the equivalent expression 


sin(ga) 

i/2[l + cos(<?«)] 


qa qaw 0 



(32.67) 


which is easier than Taylor expanding the original relation to first order. 

The two different behaviors have now been identified, with w_ being the acoustic phonon mode and co + being the optical 
phonon mode. 

• Acoustic: the dispersion relation co_ = v s q gives the sound velocity as 


(32.68) 


where the A and B atoms are mostly moving in unison. In the spring metaphor, such coordinated motion does not 
overly compress the springs, so the energy of such oscillations is small. 

Optical: The frequency ® + » w„ is approximately constant for small q. It corresponds to when the A and B atoms are 
moving out of phase, that is, in opposite directions. As a result, the springs can be thought of as experiencing substantial 
compression (or elongation), so the energy of such oscillations is large. 
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Table 32.1 Sound velocity compared to the Bohm-Staver (BS) relation of Eq. (32.72) for 
longitudinal (long) waves. The ratio is measured over the BS values of v s . Sound velocities 
similar to ref. [55], Fermi energies based on ref. [86], p. 150, except for tungsten. 


Element 

v s (long) [m/s] 

p[eV] 

M [g/mole] 

v s (BS) v [m/s] 

Ratio (m/BS) 

Berylium 

12890 

14.14 

9.012 

10046.1 

1.28 

Magnesium 

5770 

7.13 

24.305 

4343.9 

1.33 

Aluminum 

6420 

11.63 

26.982 

5265.5 

1.22 

Iron 

4994 

11.10 

55.845 

3575.6 

1.40 

Copper 

4760 

7.00 

63.546 

2661.9 

1.79 

Zinc 

4210 

9.38 

65.380 

3037.8 

1.39 

Silver 

3650 

5.48 

107.868 

1807.7 

2.02 

Tungsten 

5220 

11.00 

183.840 

1961.8 

2.66 

Gold 

3240 

5.51 

196.967 

1341.4 

2.42 

Lead 

2160 

9.37 

207.200 

1705.5 

1.27 


The spring constant g can be related to the deformation potentials E for various crystals [80, 85, 292], Because the 
motion of the ions tends to drag the much lighter electrons with them, there is interaction between lattice vibrations and 
electrons, and such interaction can be described in the language of electron-phonon scattering. A quick understanding 
of what the sound velocity depends on can be understood from a statistical mechanical argument. 

A method to approximate the sound velocity in metals is, curiously enough, to treat the positive ions as a plasma and 
talk of their "plasma frequency" [622], where co p was discussed in the context of Bq. (31.55). The electrons are light 
and will therefore adhere to the ions as they vibrate. A conceptual shortcut is to run through the discussion as though 
electrons were being discussed, but swap the electron mass m for the ion mass M (equivalently, think of the ions as just 
fat electrons). Recalling that the plasma frequency is then 


/ q 2 Pc \ 1/2 = / 16k Qp g \ 1/2 
Wp \K s £ 0 m) V K s M ) 


(32.69) 


where the first form is in the conventional [MKSA] units and the later in [nefq] units (Table 2.2), M represents the 
mass of the ion and K s is the static dielectric constant. Recall that the dielectric constant gives rise to a shielded Coulomb 
potential V(r ) = 4Qe~ kTpr fr where k TF is the Thomas-Fermi wavenumber of Eq. (21.85). From the Fourier transforms v(k) 
and V 0 (k), evaluated much like Eq. (32.48), the relation V{k) = V 0 (k)/K s {k) results in 


which therefore entails that 


Taking the sound velocity as 


2 \67i Qp 0 k 2 


p M{k 2 + k 2 TF ) 

= do:> p /dk in the limit k -* 0 then gives 
2 _ 16 7rQp 0 _ 4a 0 Qkp _ 
Vs ~ Mk 2 TP 3M 


3 M 


(32.70) 

(32.71) 


(32.72) 


where the relations a 0 = h/amc, Q = ahc/4, p 0 = kp/(3n 3 ) (remember Eq. (6.24)) and p = h 2 k 2 /2m. Eq. (32.72) is known 
as the Bohm-Staver (BS) relation, and for the simple metals it gives a reasonable account of the sound velocity, as shown 
in Table 32.1. 


32.3.3 Electron-electron scattering 

Even if suffering makes you gentler and wiser, it is still bad for you. It is still a bad thing that this was the way you managed to become 
gentler and wiser, rather than some other way. 

- Terry Eagleton 17 


7 Terry Eagleton, On Evil. New Haven: Yale University Press, 2010, pp. 134-135. 
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What causes energetic and otherwise hot-headed electrons to calm down and equilibrate with their environment (or 
thermalize) is their mutually harsh interactions with each other. The scattering between carriers (particularly electrons) is 
quite important in photoemission models for metals as in Section 31.7, or in highly doped semiconductors, but is almost 
ignorable for scattering mechanisms operative in high purity diamond. Its importance though recommends r ee be treated 
in any discussion of scattering, and so it is. 

Conceptually, the scattering between electrons bears similarity to the scattering of an electron off an ionized impurity, 
with the difference being that the ions are much more massive. The evaluation of the scattering time r ee is arduous [80, 
85, 97, 548, 617, 623]. When undertaken, r ee is found to be 18 


--“-ssHHsol'Sr 


where n is the chemical potential, and the other terms have their meanings as in Table 2.3. The function y{x) is defined 
by 


} 4 (l + x 2 


arccosl \/(x 2 + 1) | ^ 

V* 2 + 2 


[./V?TT]) 


(32.74) 


and the parameter q 0 is from the Thomas-Fermi approximation to k TF (Eq. (21.85)) now modified by an inserted factor 
of K s (5.2 corresponds to copper) to account for d-band contributions, or 


qo = (^ y =(^ y 2 (32.75) 

0 \2K s E 0f iJ \nK s aJ 

These equations are busy, but can be simplified. For metals, y(x) is reasonably well approximated, when (2 k/q 0 ) 2 is large, 

'GWfT'ft) 


Consequently, if t 0 is defined by 


_ 2 ha 0 {KJnY 
nQ y(2k P /q 0 ) 


then (recalling that E = h 2 k 2 /2m and q = h 2 k 2 /2m) 




The performance of r ee (k) (Eq. (32.73)) compared to the approximation of Eq. (32.78) is shown in Figure 32.15, for which 
the agreement is good. 

Two limiting cases in the behavior of t ee {k) in terms of p{E - q) arise. First, when (E-n) » nk P T, then the (1 /k B T) 2 
factor in the curly brackets of Eq. (32.78) cancels the same factor in t 0 of Eq. (32.77), rendering r ee approximately 
temperature independent. In the opposite limit (E- /<)«: nk B T, when E hugs the Fermi energy n, causes a different 
behavior to come into play. The average relaxation time (r ee (A:)) due to e-e scattering is more germane: the averaging is 
done over the probability that the initial scattering state is occupied (governed by f PD (E) and the final scattering state is 
unoccupied (governed by 1 —f PD (E)). Because f PD (E)(l - f PD (E) acts like a Delta function S(E - q) as in Eq. (7.25), for low 
and room temperature conditions, then (r ee (k)) <x t 0 , and the temperature dependence of t 0 then suggests 


<*«(*)> 


2 ha 0 K? / , V 
xQyC2k p /q 0 )\k B T) 


A 0 T 2 


(32.79) 


The form 1/ (^ ee ) -*■ l/r ee = A 0 T 2 is commonly encountered [86, 292] and is useful in considering intense laser heating 
of metals [206, 624, 625] where the electron population heats faster than the background lattice. 


ls The cited references can differ depending on the approximations made. The present text follows refs [85, 548, 617]. 
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Figure 32.15 Comparison of t k (Eq. (32.73)) with its approximation (Eq. (32.78)) as a function of z = P(E - n)lic for values typical of copper {/j = 1 eV, 
m n = m, and K s = 5.2.) as performed in Section A3.31. The approximate formworks well over the range shown but begins to depart when E - /r 
becomes large with respect to k B T. 


32.3.4 Acoustic phonon scattering 

Good, good, good, good vibrations 
She's giving me excitations 

- The Beach Boys 19 

Phonons are lattice vibrations, so that a proper account of them should quantize the energy levels of coupled harmonic 
oscillators (governed by a Bose-Einstein distribution n(hm q )) and explore how oscillations in the surrounding electron 
density (governed by a Fermi-Dirac distribution) varies with their motion. Such a program is difficult [80, 82, 97, 560, 
617], but much can nevertheless be uncovered by starting more or less in the middle. If one is prepared to accept that 20 

- (ik /'^ + ”K + [1 + »-/-]■*-} <32.80) 

where notational simplicity is achieved by dropping q and FD subscripts and f FD (E 2 ± hoo q ) —► f ± , n(ha > q ) -*■ n, and 8 ± = 
8(E 2 —E 1 ± hoo q ). Looking at the ^-function arguments (the vector q and its magnitude q do not refer to the unit of charge 
also called q - the context should make the distinction apparent) 

E 2 — E r (q) ± hco q = (g 2 + 2k • q) ±hw q = ha>(^ cos0± 1^ (32.81) 

where v k = hk/m and v s is the familiar sound velocity from the dispersion relation co q = v s q, and the inconvenient q 
subscripts on w q are abandoned. But this means that the 8 functions only affect integrations over angle associated with 
q, not its magnitude q. When those integrations are performed [85, 623], what is left resembles 

1-t = 2Xn 2 f ° dw w 2 T [fi{E - h), phco ] © (v k - v,) 

T epW (2k p V s ) Jo 

_ (coshx + l)(coshy + 1) 

(cosh x + cosh y) sinh y 

where 2 is a constant (not a wavelength) born of |a q | 2 : it relates the dimensionless X to the deformation potentials that 
govern lattice vibrations [80, 85], The factor hco D = k B T D uses the Debye temperature T n of Section 7.4.3: the frequency 
co D is specified by the maximum number of modes that can exist for the vibrations of the lattice. In a one-dimensional 
chain of N atoms, for example, there could not be more than N modes, and similar arguments apply to three dimensions. 


(32.82) 

(32.83) 


19 Brian Wilson and Mike Love, lyrics to Good Vibrations, 1966. 
20 If one is not, see ref. [85], Section III.F for its origin. 
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The thrust has been to obtain the function T(x,y) because near the Fermi level E « n, and so 



(32.84) 


Working with it gives 



sinh(x)^ 


(32.85) 


where x D = T D /T and x = phco. If x D is small, as would be the case in a 


high-temperature limit, then 



B 0 T 


(32.86) 


as often encountered in discussions of conductivity (e.g„ the Drude model) and laser heating of metals that distinguish 
between the temperature of the lattice and that of the electrons over short times, as in Eq. (32.79). 

The temperature behavior of acoustic phonon scattering is a bit more complicated, though. In switching over to the 
momentum relaxation time as per Eq. (32.43) and performing the integrations, the net effect is to introduce yet another 
factor of w 2 into the integrand with T [97]. Recalling the evaluation of conductivity, which made use of (t) for which r ep 
is a dominant contribution, then the form gives rise to the Bloch-Grtineisen relation [561, 619, 626, 627] for which the 
integral form is 



(32.87) 


an equation which hearkens back to Eq. (7.44) in form. The form of the integral is a specific case of a more general one 
encountered for particles obeying Bose-Einstein and Fermi-Dirac statistics of the form [84] 



(32.88) 


so that the Bloch-Griineisen function is equivalent to W_(5,x). Approximating that function is pedagogically useful. 

Consider the high-temperature limit first, for which T D /T becomes small: the small x behavior of W_(5,x) is found by 
Taylor expanding the denominator, or 



(32.89) 


for which the leading order behavior is (T D /T) 4 / 4. When coupled with the coefficient of Eq. (32.87), the linear behavior 
of 1/ (r ep ) oc T for sufficiently high temperature results, as expected. 

For low temperature, T D /T becomes large. Rewrite the defining integral as W_( 5,x) = W_( 5, oo) - SW where 



(32.90) 


Consider W_(5, oo) first. Treating e x as an expansion parameter, then 



(32.91) 


where the Riemann f function of Section 6.3 makes yet another appearance, f(6) = 1.03693, and where T(6) = 5! = 120, 
so that W_( 5, oo) = 124.431. As for SW(x) it can be transformed into 



(32.92) 


An approximation may be had by noting s 5 can get rather large in Eq. (32.90) before e~ s dominates it (the maximum of 
s n e~ s occurs at s = n and e 5 = 148.413), so that the approximation e~ s <<: 1 in the denominator allows for the consideration 
of the simpler integral 


8W(x) s 5 e~ s ds 

= - (x 5 + 5x 4 + 20x 3 + 60x 2 + 120x + 120) e~ x 



(32.93) 
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Figure 32.16 The BIoch-Griineisen function W_( 5,x) compared to its small and large approximations given by Eqs (32.89) and (32.93), respectively. An 
algorithm for its evaluation is given in Section A3.32. 
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Figure 32.17 The resistivity of various metals. Although dominated by acoustic phonon scattering at most temperatures, at low temperatures other 
mechanisms come into play. 



The limiting case approximations of Eqs (32.89) and (32.93) are compared with the numerical evaluation of W_( 5,x) in 
Figure 32.16. 

Conductivity o(T) is the inverse of resistivity p(T), which for metals is rather varied, as in Figure 32.17. That variation 
seems to suggest an underlying simplification may be out of reach, but then Eq. (7.29) can be written as 


piX) _ («Tq)) 

P„X „) <t(T)> 


(32.94) 


where p 0 (T 0 ) is the resistivity at a reference temperature T 0 . Insofar as acoustic phonon scattering is the primary contri¬ 
bution to resistivity, the ratio of the relaxation times in Eq. (32.94) can be approximated by the acoustic relaxation times 
(except for low temperature, where other relaxation times contribute via Eq. (32.24)). It is possible to use W_(5,x) to 
ascertain the Debye temperature first encountered near Eqs. (7.43) and (7.44). If so, and using W_( 5,1) = 0.236616, then 


- = 4.22626 — W_ 5, — 


(32.95) 


which is a substantial simplification: one ratio (T/T D ) controls the shape of the resistivity. The value of T n must be found: 
a least squares algorithm can be used. Let the measured values of resistivity pj be at the temperatures 7} (i.e., p(Tj) = pj) 
and let the data be ordered by 7) > T)_j. Then the quantity S defined by 




w_( 5, T/Tj) 1 ) ; 
W_(5,T/T n ) I J 


(32.96) 
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Table 32.2 The Debye temperatures T D and the corresponding resistivity for various 
metals, as determined using the algorithms of Section A3.32. Values of T D from ref. [86] 
(Kittel) are also shown. 


Metal 

T d (fit) 

p D [pfl-cm] 

T d 186] 

Ag 

238.2 

1.258 

225 

Au 

184.6 

1.339 

165 

Cu 

349.0 

2.056 

343 

W 

385.8 

7.492 

400 

Pt 

188.8 

6.318 

240 

Ni 

519.2 

19.197 

450 

Al 

416.9 

4.060 

428 

Mo 

391.6 

7.811 

450 


is minimized when T = T 0 . An algorithm to do so, given in Section A3.32 for data culled from Table (9d-2a) of the AIP 
Handbook [55] for various metals, and from Matula [296] for copper, gold, and silver, is used to find the entries of T D 
shown in Table 32.2. When the resistivity data is plotted in the coordinates of pj/p D vs Tj/T d (where p D = p(J D )), then, as 
Figure 32.18 demonstrates, the resistivities of metals follow the behavior suggested by Eq. (32.95). As Table 32.2 shows 
in its comparisons to T D from the literature, acoustic phonons catch most, but not all, of the resistivity behavior for 
typical temperatures. In some cases, the form 1 /t « A 0 T 2 + B 0 T (Eqs (32.79) and (32.86)) does well in capturing what is 
of interest [206, 625], the coefficients being related to material parameters. 

The cold realization that diamond is quite far from being a metal can no longer be contained. But diamond is a crystal, 
as are the metals, and it is electron migration through the lattice that gives rise to acoustic scattering. The acoustic phonon 
relaxation time can be transformed from its previous metal-friendly form [85] to one more appropriate [80, 209, 561] 
for examining the diamond amplifier. That form is 


jrh^pv 2 r / t \ 
4m„S 2 mk B T D [\T^) 


W_(5, TJT) 


(32.97) 


where p = 3.51525 g/cm 3 is the mass density, H is the acoustic deformation potential (variously estimated at 8.8 eV [628], 
8.7 eV [603], and even 5.5 eV [629]), and v s is the sound velocity (18000 m/s) for diamond. Compared to metals, the 
Debye temperature for diamond is substantial, weighing in at 1860 K (or k B T = 0.1603 eV). Therefore, to leading order, 
r ac « 7uh 3 pv 2 /4mE 2 k B Tk(E) is good enough at room temperature. The hyperbolic relation (Eq. (23.27)) 


2m„ \ 



(32.98) 



Figure 32.18 The resistivity values of Figure 32.17 but as p(T)/p D vs T/T n , as in Eq. (32.95), for the Debye temperatures of Table 32.2. 
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relates k to E = E' — E g , but the size of diamond's band gap (E g = 5.5 eV) lessens the hyperbolic modification to the 
parabolic relation for electrons not much above the conduction band. For typical values of secondary electron energy, r ac 
starts in the range of 10-100 fs. 


EXAMPLE: Find r ac in diamond for a secondary electron for two cases: first, for the secondary energy at its creation 
of 13.75 eV and, second, after it thermalizes to k B T at room temperature. Use the values m n = 0.57m and E = 8.7 eV. 
SOLUTION: In [nefq] units, k = Vl4.961 x 13.75 X 19.25 = 26.833 nm” 1 . Next, with pvj = 7108.7 eV/nm 3 , then 
trh^pvj/4k B T D m n S 2 = 40.495 fs/nm and k B T D = 0.16028 eV. When E = 13.75 eV, k(E) = 26.833 [1/nm], whereas 
for E = k B T= 0.025852 eV, it is k{E) = 0.62336 [1/nm], With 7/7^ = 0.16129, then W_(5, T D /T) = 74.678. So, 
r ac = 185.14 fs for E = 13.75 eV and r ac = 7.9693 ps for E = 0.025852 eV. 


32.3.5 Optical phonon and ionized impurity scattering 

Well something's lost, hut something's gained 
In living every day. 

- Joni Mitchell 21 

The last two scattering processes to be considered are optical phonon and ionized impurity, although there are others 
[80-82, 306]. Non-polar optical phonon (NOP) relaxation times involve the emission or absorption of an optical phonon 
tico 0 and are given by [294] 


r NOP,e - 


r NOP,a ~ 


yflntfp hco 0 

DWJ 2 (1 + n ( phco 0 )) (E - hoo 0 ) 1/2 

\flnh 2 p Tico 0 

E>WJ 2 n(phm 0 ) (E + hco 0 ) l/2 


(32.99) 

(32.100) 


where e and a subscripts correspond to "emission" and "absorption" of a phonon with energy hco 0 , and n(x) = 1 /(e* - 1) is 
the Bose-Einstein distribution governing phonons, with phco g = hco 0 /k B T = 6.3051 at T = 300 K for diamond. The factor 
D 0 is the energy per atomic displacement [80]. When the energy is high, emission dominates, and secondary electrons 
that are just born lose most of their excess energy via optical phonon emission. If there is an internal field in the diamond, 
then energy acquired through acceleration in the field can likewise be lost this way. The secondaries are initially borne 
with a rather large energy of 2.5 E g = 13.75 eV: each non-polar optical scattering event associated with "emission" of a 
phonon releases hco 0 = 0.163 eV to the lattice (whereas absorption would have the electron acquire that energy). The 
dependence of the relaxation time on ( E - hoo) means that although NOP is very effective in reducing the electron energy 
while it is high, as the electron energy drops there comes a point where NOP fades in importance and acoustic phonon 
scattering dominates, as in Figure 32.21. 

Although the diamond is grown as pure as possible, boron doping invariably occurs, albeit at small levels. 22 Neverthe¬ 
less, it serves as a source of neutral and ionized impurity scattering. The neutral relaxation time is 


Qm n c 
5 WK 0 N n 


(32.101) 


The ionized impurity rate is more involved: 
compared to the band gap is 

1 


an asymptotic form [209, 308] valid when the phonon energy is small 


xti i 

Qm 2 „ 


E 2 a 

P i n (2) J 

^E{E + E g ) 


(32.102) 


21 Joni Mitchell, lyrics to Both Sides Now, 1969. 

22 A doping of 10 14 atoms/cm 3 is considered small. It corresponds to about 100 atoms per cubic micron or, put another way, a boron atom every 
0.215 pm. 
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In the absence of fields, the ionized impurity density N t is obtained from the neutral impurity density N n by Eq. (22.17), 
with E b = 0.365 eV. At room temperature, NjN b < 2 x 10 -7 . However, when fields are present the impurities are ionized 
within the depletion region (Section 24.4). These ionized boron atoms can shield out an externally applied field over a 
length (the depletion width) given by L D = F/i&nQNJ, where F is the vacuum field (using the vacuum field bypasses a 
formula requiring the dielectric constant K s ). For example, if F = 50 eV/pm (corresponding to £ = 50 MV/m), then L D at 
55.26 pm. As diamond films thinner than a depletion width will be seen to be desirable, then the boron is assumed to be 
ionized throughout [209], 


32.3.6 Scattering angles 


I shall be telling this with a sigh 
Somewhere ages and ages hence: 

Two roads diverged in a wood, and I - 
I took the one less traveled by, 

And that has made all the difference. 

- Robert Frost 23 

In a Monte Carlo simulation, knowing that a scattering event has occurred is only part of the problem. The other part 
is knowing down what road the scattered electron heads, and with what energy. Consider first a collision between equal 
mass particles (representing electrons in a metal) to understand how the energy and momentum are constrained by a 
possible lack of available final states, and then return to the problem of diamond which forbids final states being within 
its band-gap. What is allowed makes all the difference. 

The collision of a primary electron (a) with a target (b) electron is most easily understood by shifting from the lab frame 
to a frame where the z axis is in the direction of the incident particle and the target particle is at rest, as in Figure 32.19. 
The notational convention is as follows: the velocity of the A electron (primary) in the rest frame of the B electron is 
designated u. After the collisions, the daughter electrons A' and B' have velocities u' and v', respectively, in the B frame 
(primes therefore indicate post-collision electrons). Given that the masses of the particles are equal, the equations for 
conservation of energy and momentum become simple: in going from the lab frame to the rest frame, they are 


E a + E b = E' a + E' b =>u 2 = u' 2 + v' 2 (32.103) 

v a + v b = v' a + v' b ^u = id + ? (32.104) 


To satisfy both equations is to ensure that \u' + v'\ 2 = u' 2 + v' 2 , in other words the vectors u' and v’ must be orthogonal, 
or u' ■ v' = 0. In terms of the rotation matrices (Section 8.2.1) 


cos 0 
- sin# 


(32.105) 



The frame in which to analyze the collision is best accomplished in the rest frame of the b electron. The coordinate system is rotated by the 
rix R. In this rest frame, the daughter electrons emerge at 90° to each other, so the Pythagorean theorem can be used to find their energy. 


23 Robert Frost, The Road Not Taken, 1916. 


www.ebook3000.com 














514 Chapter 32 



0 30 60 90 120 150 180 

9 [deg] 

Figure 32.20 The final states of electrons A' and B', with energies E' a and E' b . The regions where one of them falls below the Fermi energy it are grayed 
out: such final states are forbidden in a metal, as those states are already occupied. 


cosi/r 
- sinyr 


si n i// 
cos i p 


then the rotation of the coordinate system corresponds to 

R (cp, 0, <p') = R^-q^Ry^R^cp') 


(32.106) 


(32.107) 


(32.108) 


Velocities parallel to u are not affected by a rotation R z (cp'), as no components along x' or y' are present. Hence, the 
angle cp' can be randomly chosen as a rotation about the polar axis. That leaves on it falls the burden of having to be 
compatible with conservation of energy and momentum for the final states or, to ensure that u! ■ v' = 0 and u' 2 + v' 2 = u 2 , 
then 


| sin#'\ 

° 

cos 6'} 

I -cos 

0 (32.109) 

sin#' j 

The values that \u'\ and |c'| can take are shown in the lines of Figure 32.20. 


32.3.7 Allowed final states 

The reason why they could never answer the question "How could it possibly happen?" is that it's the wrong question ... the question is 
“Why doesn't it happen more often?" 

-Woody Allen 24 


4 Woody Allen, Hannah and Her Sisters. New York: Vintage Books, 1987, p. 101. 
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Figure 32.21 Relaxation time components and the sum r via Eq. (32.24) for the dominant scattering mechanisms for diamond. Suffixes of "14" and 
"16" are log 10 JV ; [atoms/cm 3 ]. The ionized impurity relaxation time for N ( = 10 14 atoms/cm 3 is not visible in the range shown. Based on Figure 4 of 
ref. [209] but using the values in Table 32.3. 


Given the large number of electrons in a metal, it might seem that they should scatter incessantly, but their scattering 
is constrained to only be allowed if the final states are allowed; a similar constraint exists for semiconductors (recall 
Section 7.1). For metals, it is a reasonably good approximation that all states with energy below the Fermi energy are 
filled, and therefore neither of the scattered electrons (the primary one and its target) can have them as final states. As a 
consequence, not all of the solutions of Eq. (32.108) are possible. Rather, the "allowed final states" are - for metals - those 
for which E' b > g: the least energetic of the daughter electrons still must possess an energy higher than the Fermi energy. 
Those constraints result in the gray bands of Figure 32.20. 

For a semiconductor with a band gap E g , an analogous situation holds: final states within the band gap are forbidden, 
but with an additional wrinkle in that the target electron must originate in the valence band, but both daughters must 
be in the conduction band. Because the photon energy must exceed a threshold for this to occur, a "magic window" 
develops in which electron-electron scattering is precluded, so that phonon scattering represents the only energy loss 
mechanism. 

Using this understanding reveals that the final direction of the post-scattered daughter electron is not uniformly dis¬ 
tributed in the polar angle 6 (although it is in the azimuthal angle cp). Rather, the final state 0 is obtained by generating 
a random number r and evaluating 0 according to the Conwell-Weisskopf approach of [603] 




rE 2 - E 2 
rE 2 + i?f 


(32.110) 


or the Brooks-Herring approach of 


cos 0 BH 


4rE + (2r - l)E u 
4 rE + E a 


(32.111) 


where E ti is evaluated according to 


E ti 


K s \ 3 


1/3 


(32.112) 


where N~ is the number density of the ionized boron acceptors. For diamond with K s = 5.7, a value of N~ « 10 14 cm -3 
entails that E a = 1.89 meV, so that most scattering events will not change the direction of the primary electron very 
much. Although the Brooks-Herring approach is favored at high energy conditions when the unintentional boron 
doping is small, both Brooks-Herring and Conwell-Weisskopf lead to preferential forward-directed scattering and the 
Conwell-Weisskopf approach has simplicity. 

With the relaxation times in place, the effects of temperature, doping, and flake thickness can be investigated. For sim¬ 
plicity, ignore the band bending that boron doping entails (Section 24.4): although it makes a rather substantial difference 
in losses and transit times [209, 322], the constant-internal-field approximation (F = F vac /K s ) behind the simulations to 
be discussed below represents, as it were, a "best-case scenario" in that transport through the diamond will be at its most 
rapid and expansion of the secondary bunch will be reduced. 
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32.4 Monte Carlo and diamond amplifier response time 

.. .then the Father balanced his golden scales, and in them he set two fateful portions of death, which lays men prostrate, one for Achilleus, 
and one for Hektor, breaker of horses, and balanced it by the middle; and Hecktor's death-day was heavier and dragged downward toward 
death, and Phoibos Apollo forsook him. 

- Homer 25 

In consequential matters, Monte Carlo methods can provide practical answers. One such matter is the the character¬ 
istics of electron bunches created from a diamond flake after a high-energy primary beam is incident on it, particularly 
with regards to its temporal shape. Let the primary beam of electrons be characterized by an energy E 0 . Roughly E 0 /AE 
secondaries are generated, each with an initial energy of about 2.5 E = 13.75 eV (e.g., for E g = 3 keV, then 218 secon¬ 
daries are generated). At such high energies, the internal field across the flake will have little sway over the directions 
the electrons are inclined to go, so that unless the field is particularly strong and the electron penetration range R{E g ) 
rather deep, many of the secondaries will be lost to the back contact before they lose sufficient energy so that the field 
can direct their migration to the vacuum side of the diamond flake in earnest. If doping substantially reduces the field 
at the back contact, the losses can be up to 50%, another reason that the thinness of the flake is a design goal (recombi¬ 
nation effects, whereby electrons join with holes and are lost, can also reduce the number of electrons [209, 589, 630]). 
The back contact can sweep the holes out of the simulation as well, and more complete Monte Carlo simulations which 
include them [588] show this quite well, an example being shown in Figure 32.22, demonstrating that the hole and 
electron populations in a diamond amplifier simulation can separate in a thin diamond layer relatively rapidly (in that 
figure, it occurs in under 3 ps). As a result, for the simple Monte Carlo to be considered below for analyzing how an 
electron bunch exits a thin diamond flake, and thereby estimating its rise/fall time, attention need only be focused on 
the electron population. 

A constant field approximation is adequate when the boron concentration is < 10 14 cm -3 and the flake is under 10 pm. 
With the initial conditions and scattering mechanisms in place, Monte Carlo enables following the electrons' tortuous 
path through a thin diamond layer. Not surprisingly, and as previewed in Figure 32.22, the shape of the bunch expands 


i | 82 fs 

| 800 fs 

1 


II • Electrons || 

|| • Holes | 


| 1830 fs 

i 

| 2860 fs 


Figure 32.22 Electron (blue circles) and holes (red circles) generated by a 2.7 keV primary electron beam incident on a diamond surface from the left. A 
3 MV/m applied field induces the electron cloud to drift away from the surface so that after 3 ps the holes have left the diamond region. Grid lines are 
every 0.5 pm. (Images courtesy of D. Dimitrov (Tech-X), and based on Figure 6 from D.A. Dimitrov, R. Busby et al. Multiscale Three-Dimensional 
Simulations of Charge Gain and Transport in Diamond, J. Appl. Phys., 108, 073712, 2010, with the permission of AIP Publishing). 


’Homer (trans. Richmond Lattimore), Thellliad. Chicago: University of Chicago Press, 1951, Book 22, lines 209-213. 
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Figure 32.23 Side view of the transport of secondaries. Primaries are incident from the bottom and the field pulls them up to the top (vacuum side). 
Observe the apparent expansion of the bunch as it migrates. The numbers at the top give the time of the snapshot (1 ps, 2 ps, 3 ps, and 10 ps from left 
to right). Based on Figure 5(a) of ref. [322]. 



Figure 32.24 Sites of scattering (red dots) and the path of one electron (blue line). The vertical 2 scale is 8 pi, whereas the % and axes are 4 qm. The blue 
hue base is the side from which the primaries are incident; the yellow hue vacuum side is at the top of the figure. Many (red) scattering events occur 
near where the secondaries are generated (as in inset) as they lose their excess energy to optical phonon (emission) scattering. The blue line indicates 
the trajectory quickly becomes diffusive and scattering ever present, which causes the bunch radius to expand. Based on Figure 3(a) of ref. [322]. 


as it is pulled across the flake by the internal field F, further illustrated in Figure 32.23. A measure of the number of 
scattering events, of which there are many, is indicated in two ways in Figure 32.24: first, by representing the location of 
every scattering event by a red dot and, second, by following the path of one electron as it migrates. The distribution of red 
dots reveals a very tight concentration of events near the injection site, where optical phonon emission processes rapidly 
sap the excess energy of the secondaries. When the energy of the electrons has dropped to the point where the internal 
field can begin to exert an inexorable draw to the vacuum side, acoustic phonon scattering dominates and the electron 
bunch acquires a diffusive character, the sphere of electrons expanding as it moves. Even a bunch excited instantaneously 
(a delta-function primary pulse) will expand, and as that diffusive sphere of electrons crosses into vacuum, the size of 
the bunch will affect the apparent rise/fall time of the emission process. A simple model can be made to account for it. 
Monitoring the number of electrons in the simulation is therefore an effective way of estimating losses to the back contact, 
quantifying when the bunch has started its migration across the flake in earnest, and - most importantly - getting a feel 
for how much time the bunch requires to exit the flake. An example is shown in Figure 32.25, wherein 364 electrons 
generated by one primary at an energy E 0 = 5 keV are followed as they migrate through a 4 pm thick diamond flake 
subject to an internal field of F = 34 eV/pm (or £ = 34 MV/m). The regions naturally divide into (gray) losses to the 
back contact, (yellow) transport through diamond flake under field to the surface, and (blue) exit from the flake and the 
characterization of the rize/fall time. 

Imagine that a uniform sphere of charge moves past a boundary at L (the width of the diamond flake), and let the 
center of the sphere be given by z(t) and its radius by r(t). How much of the sphere has passed the boundary z = L is 
indicated by 

'■Sr 


Electrons to the right of s have moved past L; those to the left are still in the diamond flake. When s = 1, the sphere has 
just touched the right boundary, and when s = -1, the entire sphere has passed z = L. The fraction of the sphere that has 
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Figure 32.25 Characterization of the three regions of secondary generation, transport, and emission for E 0 = 5 keV and L = 4 pm in a Monte Carlo 
simulation. Diamond is assumed free of boron impurities. Losses to the back contact result in a decline of the initial number of secondaries (gray). After 
they thermalize, they are pulled across the flake by the internal field (hellow). On encountering the surface, they are emitted (blue). How fast their 
numbers decline characterizes the rise/fall time. 


passed into vacuum is then a summation over disks of area np 2 with p 2 =r 2 - z' 2 and differential thickness dz'. If the 
total charge in the sphere is Q 0 , then the uniform charge density is Q 0 /(4^t 3 /3). The summation over all disks for which 
z' > L — z is then 

AQ(s) =^ij ^ ~ z' 2 )*' = J(2 + *X1 - rfQo (32.114) 

which has the proper behavior: at s = 1, no charge has escaped (Q(s < -1) = 0), when s = 0 half of the charge has escaped 
Q(0) = QJ 2, and when s = -1, the entire sphere has just emerged (Q(s > 1) = Q 0 ). 

Such a model is limited from the outset: a bunch of secondaries are better modeled by a diffusive sphere with a density 
that decreases with radius much like Eq. (31.125). Still, a uniform sphere model provides a rapid and qualitatively useful 
representation, and makes understanding the the speed of the sphere perhaps more intuitive. In a uniform field, how 
fast the sphere moves is related to the saturation velocity v g of the semiconductor. 26 Why the electrons only accelerate to 
a limiting velocity value is heuristically understood to be when an equilibrium has set in between processes whereby 
the electron gains energy (acceleration in a field and optical phonon absorption) and processes whereby energy is lost 
(optical phonon emission). More concretely, an electron will accelerate in the internal field in diamond until it acquires 
enough energy to create a phonon. The mean free path for this to occur will be v 0 r e a , where the subscripts on t denote 
the different processes of absorption and emission. The rate at which the accelerated electrons shed their excess energy 
is therefore the difference between the absorption and emission rates, or 


Fv 0 


(32.115) 


On the other hand, the force on an electron is related to its change in momentum, which is the time between collisions 
(i.e., F = dp/dt « A pfr). For electrons at a mean energy, the change in momentum after an emission is close to the 
negative of the change after an absorption, and so 


(32.116) 


Taking the ratio of these equations, and observing that in Eq. (32.99), because of the behavior of the Bose-Einstein factor 
n{fitia> 0 ), the emission and absorption t are separated in magnitude by an exponential factor e x with x = ptico 0 , 


~ h(0 o ( e*-\ \ 
V ° m n v 0 \e* + 1 / 


(32.117) 


26 The saturation velocity is often designated v s in the sources, but will be called v 0 here so as not to conflate it unintentionally with the sound 
velocity. 
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Table 32.3 Parameters used in the calculation of the relaxation times for high purity 
diamond (after Table 1 of ref. [209]). 


Definition 

Symbol 

Value 

Unit 

Common 

Effective mass 

mjm 

0.57 


Internal field ( q£) 

F 

10 

eV/pm 

Primary energy (F 0 ) 

F 

10 

eV/pm 

Band gap 

£ g 

3 

keV 

Secondary 

Af 

2.5 E g 

eV 

Static dielectric 

K s 

5.7 

- 

Number density 

N 

0.2924 

moles/cm 3 

Mass density 

p 

3.51525 

g/cm 3 

Impurity density 

n a 

10 14 -10 16 

cm- 3 

Boron activation energy 

E b 

0.365 

eV 

Acoustic phonon 

Sound velocity 

v 

0.018 

nm/fs 

Deformation potential 

S 

8.4 

eV 

Debye temperature 

T d 

1860 

K 

Sound velocity 

V s 

0.018 

nm/fs 

Non-polar optical 

Phonon energy 

ha, 

0.163 

eV 

Optical deformation potential 

D a 

y/m(dJL 0 ) 

eV/nm 

Deformation term 

d a 

61.2 

eV 

Lattice constant 

L o 

0.357 

nm 


or, solving for v 0 , 


, fico, / e* _ 1 \ hco, 

v o ~ — v^n) = nr lanh(Jf/2) 


(32.118) 


Showing a greater concern regarding the coefficients, the final form becomes, on replacing x hco 0 /k B T = ptico 0 [294], 


[3 hw 0 n \] l/ 
[ta7 ,anh (2'"’"-)] 


(32.119) 


Making use of Table 32.3 parameters, v 0 = 1.9387 nm/fs. The charge center of the sphere can therefore be approximated 
by the saturation velocity, or z(t) » v 0 t. The current passing the surface boundary is then given by taking the time deriva¬ 
tive acting on z(t) (not r(t), as even though the spherical bunch is expanding, its center of charge remains at the center), 

AZ(s) = —-^Qo (l —s 2 ) 0 (l — s 2 ) (32.120) 


where v 0 /r{f) is a characteristic time that increases the longer the bunch spends in the flake and @(x) is the Heaviside step 
function (unity for positive argument, zero for negative), which insures that the current does not start until the sphere 
reaches the vacuum interface and stops when the sphere has passed into vacuum. 

The bunches of a Monte Carlo simulation of a cluster of frenetically scattering electrons does not resemble a fixed 
density sphere so much as it resembles a diffusively expanding one. Rather than the hard boundary of Figure 32.26, 
the fuzzy sphere of Figure 32.23 is more in keeping with the probable behavior. The behavior of Eq. (32.114) bears a 
similarity to a hyperbolic tangent function, that is 


(32.121) 


fory = 3 s + 0(s 3 ). Thus, there is reason to expect that 


Q(0 « 


Q o 

l+exp[-(f-0/<5t] 


= Q 0 Kt —1 0 ) 


(32.122) 


and in fact, monitoring the time dependence of particles passing into vacuum using a Monte Carlo simulation [209] 
shows the approximation works well, as in Figure 32.27: the surface of hydrogen-terminated diamond is NEA, so that 
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Figure 32.26 Side view of a sphere with uniformly distributed charge crossing the surface of a diamond flake 
into vacuum. AQ(s) is the portion of the sphere that has moved into vacuum, and s is the dimensionless 
measure of the position of the sphere with respect to the thickness L of the diamond flake. After on Figure 
13 in ref. [209]. 
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Figure 32.27 Fits to charge emitted into vacuum (Q(s)) for diamond-like parameters for an internal field F = 10 eV/pm and a flake thickness of 8 pm for 
two different primary energies of 3 and 13.75 keV. For optical phonons, tm 0 = 0.163 eV. Q 0 , t 0 , and St are as occur in Eq. (32.122). The 3 keV line 
corresponds to Figure 14 of ref. [209]. 



there is no barrier to emission and what passes the vacuum boundary therefore contributes to <2(0- The constant internal 
field approximation allows more of the initial secondaries for the higher primary energy E 0 = 13.75 keV to survive than 
for E 0 = 3 keV, as the penetration depth for the former is greater than for the latter, making the barrier to the back contact 
harder to surmount. 

The parameters t 0 and 8t are dependent upon the primary energy E 0 . For E 0 = 3 keV and a flake of thickness L = 8 pm, 
then Q 0 « 116, t 0 = 38 ps, and St = 0.966 ps, whereas for E 0 of 13.75 keV, Q 0 = 963, t 0 = 34.5 ps, and St = 1.62 ps. That 
is, a higher beam energy means fewer electrons are lost to the back contact and the greater penetration of the primaries 
results in a greater spread that affects St. The rapidity with which h(t) changes from near zero to near unity is the meaning 
of the rise/fall time, and St is the parameter which sets it. 

Consider now a string of bunches like pearls on a chain. The charge for a single bunch is as found above, namely, 
AQ(t) = Q g h{J: - t 0 ). The total current is then the sum over the individual bunch currents 

A/(0 = Q 0 jh(t-t 0 ) = Y t h{ -t-to ) [1 " W-Q\ (32.123) 

which, because of the similarity of the h(t) functions in form to the Fermi-Dirac functions, means the form of Eq. (32.123) 
is similar to that of Eq. (7.24) in its peakiness. The total current is the summation over all the spherical elements distributed 
in time, or t 0 -*■ t' and 

m = J P(t , )h(t-f)[l-h(t-t , )]dt' (32.124) 

where the distribution P(f') is normalized. That is, the integration over all time should give the total charge Q tot emitted, 
as easily shown: 
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j i(i)dt =Q ^ J P(t')dt' J Ht-^li-Ht-^dt 

= Q tot J P(t')dt f = Q m (32.125) 

The diffusive expansion of a bunch of electrons in the diamond flake has consequences for how fast current can be 
turned on (rise to its maximum value) and turned off (reduced to a negligible level) in comparison to the duration A t 
of the primary beam. In the simplest case, let P(t) be a top-hat distribution in which the beam of primaries turns on 
at t = —At/2 and turns off at t = Af/2. Pit) can be represented as a product of two Heaviside step functions, or Pit) = 
©(Af/2 + t)S(At/2 - t)/At, where the At in the denominator is from the normalization requirement. When inserted into 
Eq. (32.124) it follows 

Qm f At/2 

m h(t-s)[l-h(t-s)]ds 

StAt J_ At/2 

= ^ [h(t+ At/2)-hit-At/2)] (32.126) 

Because one h function is rising as the other falls, the choice of the integration region insures that the midpoint (at t = 0) 
corresponds to the maximum. Letting t a and t h be such that 


P 


Kt a ) 

7(0)’ 


(32.127) 


then the rise/fall time corresponds to t a - t h depending on the value p takes on (e.g., p = 1%). For small p, a rewriting of 
Eq. (32.126) is useful, using the notation u = exp(f/<5f) and C = exp(At/28f), to render it as 


Ijt) _ ujC+1) 2 

7(0) ~ iu+ l)iC 2 u + 1) 


(32.128) 


from which the limiting cases of pulse length large compared to rise/fall time (At » St) can be contrasted with the reverse 
when the pulse length is smaller than the rise/fall time (At«: St). When C -*■ 1, the primary beam of electrons is deposited 
all at once (At -* 0), but when C -*■ oo, the primary beam is very long compared to the processes that give rise to St. 


EXAMPLE: Find the rise/fall time for the A/8t = 0 line of Figure 32.28, an ad hoc definition being the time separa¬ 
tion between where lit)/1(0) changes from 99% to 1% (e = 0.01). 

SOLUTION: The center of 7(0, or t = 0, is close to where I(t h )/I(G) = 1 - e « 1. Therefore, t b /St « 0. For t a , inserting 
C = 1 into Eq. (32.128) results in, where a = exp(t a /8t). 


Thus, t a -t b K St In (4/e) =5.9915 St. 


4 a 

(a+ l) 2 


EXAMPLE: Find the rise/fall time for the A/St = 32 line of Figure 32.28, an ad hoc definition being the time sepa¬ 
ration between where I(t)/I(0) changes from 99% to 1% (e = 0.01). 

SOLUTION: Introduce a = exp(f fl /<50, h = exp(4/<50, and C = exp[Af/(2<50]. For t b , C » 1, and from Eq. (32.128), 

e « — U< ° 2 = — - *t a &6t In (—-) 

(a+l)C 2 u (fl+1) fl \ e J 

Analogously, for t a , 

1 - e « —--► L « St In ( —-— ) 

(b+ 1) b V 1 — e / 

and so t a - t b « 2 St In [(1 - e)/e] = 9.1902 St. 
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t/8 

Figure 32.28 Response characteristics of 7(7)/7(0) for A = nS with n indicated in the labels and 7(f) as defined in Eq. (32.126). The time is shifted such 
that the center of the pulse is at ( = 0. 


Therefore, regardless of whether the primary pulse is much longer than the characteristic time (At > St) or much shorter 
(At < <5f), the rise/fall time is nevertheless constrained to be several multiples of St. As an example, the rise/fall time for 
E 0 = 3 keV with L = 8 pm (as in the example following Eq. (32.122)), then t a - t b « 2 In (99) x 0.966 ps = 8.877 ps. 

A second analytically tractable model is the Gaussian distribution, P(t) oc exp(-t 2 /2At 2 ). When the characteristic time 
St is much larger than the characteristic time At of the distribution, then because Eq. (32.126) can be well-approximated 
by a Gaussian itself, their joint integration in 7(0 is possible. First attend to the Gaussian parameterization. Let f(x') = 
In [/z(x)(l - h{x))\ = In \h{x)h{-x)\. Performing a Taylor expansion about x = 0 gives 


-- ln(4) -i& 


(32.129) 


When inserted into the defining equation for 7(0 if Eq. (32.124), then 


m = Jitt L J " p [ - 5&] exp [-w] * 


(32.130) 


Using the general integral relation 

y^^^y S) 


(32.131) 


results in 


7(0 = 


2y/lSt 2 + At 2 


exp 


( _ 

V 4(5 f 2 + 2A t 2 J 


(32.132) 


As a result, a small pulse length At of the primary beam increases the width (in time) of the resulting Gaussian pulse. Put 
another way, even if the primary beam of electrons were deposited all at once with no spread in time, the bunch that 
would emerge would have a Gaussian shape as a result of diffusive expansion during transport through the flake. 

Intuitively, St is a measure of how long the diffusion is allowed to take place in the diamond flake and so is governed 
by both the internal field in the diamond, as well as the width of the flake itself. The simple Monte Carlo methods of 
Section 32.2 are enough to make an estimate of the center of charge position z{t) of the bunch as it travels through a layer 
of thickness L, and of its root mean squared (rms) radius r(t) about its center of charge, as the discussion surrounding 
Eq. (32.122) suggests. For a linear field across the flake, a constant saturation velocity and a diffusive expansion imply 
that z(t) ~ t and r(t) ~ yft, behaviors which simulation readily confirm as in Figure 32.29 for two different fields. If the 
current from the flake is to be delivered in short pulses, then the size of r(f) matters, and therefore how long the bunch 
dwells in the flake matters. Therefore, if short sculpted pulses are desired, fast rise/fall times entail that the flake should 
be as thin as practical. 
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Figure 32.29 Symbols, Monte Carlo evaluation of charge center z(f) and rms radius r(f) of a bunch of secondaries transporting through a thin flake 
under field for diamond-like parameters. Lines, equations show least squares fits to the forms of z(f) « v 0 t and r(t) = a\ft, based on a constant drift 
velocity (for z) or diffusion (for r). Red, F = 10 eV/pm; blue, F = 1 eV/pm. Based on Figure 10 in ref. [209]. 
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Figure 32.30 Rise/fall time t ab variation with flake thickness (calibrated by t 0 = 8.877 ps when L 0 = 8 pm) for L = 2" pm, as per Eq. (32.133). Also 
shown is a depletion width for boron concentrations of 10 14 cm -3 and typical fields. Based on Figure 16 in ref. [209]. 

The rise/fall time t ab = t a — t b can now be related to the thickness of the diamond flake itself. If the radius of the bunch 
does not change appreciably as the sphere passes into vacuum, then t ab should behave something like t ab oc r(t)/v 0 . The 
time for the sphere to cross the thickness L of the flake is f« L/v 0 , so that t ab oc \[h. As a result. 



(32.133) 


for t 0 = 8.877 ps and L 0 = 8 pm. The relationship between rise/fall time and flake thickness is indicated in Figure 32.30, 
where it is compared to a typical depletion width w = 55.26 pm for a boron concentration of 10 14 cm -3 (recall Eq. (16.4)). 
The boron concentration matters more than has been let on: the internal fields in the diamond flake are not linear as 
so far assumed: rather, the charged impurities tend to shield the back contact and retard the acceleration of the newly 
minted secondaries, contributing to losses and spreading the beam further [322]. Still, if performance is below par in the 
linear field model, matters are only going to get worse when doping (intentional or not) is considered, so the models are 
a best-case scenario. 

An obsession over rise/fall time compared to other features of emitted bunches of electrons is seemingly dispropor¬ 
tionate to its importance, but there are reasons for the interest. The diamond amplifier is an innovative solution to 
hard-to-meet demands placed on the electron sources for FELs and advanced light sources. The performance of an FEL 
depends on the charge within a bunch, and how well its spreading can be constrained, so rise/fall times matter. Spreading 
is related to emittance: high current densities from the cathode are demanded, and the very strong fields at the surface 
of a photocathode are to counter space-charge forces that exist in the bunch in an effort to keep spatial features of a 
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character to meet stringent FEL requirements. Bunch shape therefore matters, with a top-hat profile in particular being 
preferable to Gaussian or rounded shapes for emittance reduction [593, 631, 632]. The European X-ray FEL, for example, 
uses a trapezoidal pulse 20 ps long with a 4 ps rise/fall time [595], with other FELs seeking similar shapes and rise/fall 
times [633]. Therefore, usage of a diamond amplifier to provide a secondary electron emission boost to the performance 
of a photocathode, for example, entails more than just raw current or bunch charge needs to be practical. 



CHAPTER 33 

Electron beam physics 


His greatness weighed, his will is not his own. 

For he himself is subject to his birth. 

- William Shakespeare 1 

Liberating electrons from their bondage is not the end of the tale: the variety of electron sources speaks to the many 
technologies that use them, and so the nature of the beams produced becomes a deciding factor in their utility. The 
applications of radar, communications, and electronic countermeasures (Chapter 1) rely on the production of electro¬ 
magnetic radiation with wavelengths in the range of meters to millimeters [429] and shorter (terahertz and beyond) [13, 
634], Others such as scanning and tunneling electron microscopy [162], electron beam lithography [16], high power 
microwave [635] and directed energy [591], energy conversion and optoelectronics [636], and particle accelerator/ad¬ 
vanced light source and X-ray free electron lasers (FELs) [523] are no less exacting in the properties of the electron beam 
even as the needs of each application are phenomenally different in terms of bunch charge, pulse duration, response 
time, and beam spread - all of which will be recast in the language of current density, emittance, and brightness. 
How the beam is born figures strongly in what the beam does, and describing that is a hugely complex undertaking 
[148, 637, 638], 

A few simple considerations (such as the beam has cylindrical symmetry, and the magnetic field is static) do not 
undercut an appreciation of what awaits. Feral electrons are typically not well behaved: they flee in directions often not 
normal to the surface (leading to emittance), and they generally try to get away from each other (leading to space-charge 
effects). To corral the disbanding electrons, emission is often into a magnetic field region so that the transverse velocity 
components of the electron to the direction of the magnetic field B simply causes them to spiral about the field lines. 
Thus, if an electron with a velocity v is traveling through a region with a magnetic field B = Bz, then the force equation 
mr = F = —qv x B will describe their subsequent motion. By considering motion in a region without an electric field, the 
energy of the electron is not changing. By conservation of angular momentum, the angular velocity v g = pQ = pw 0 is 
likewise constant. So, setting the centrifugal force to the magnetic force, 

m— — mpa 2 = qpa> 0 B (33.1) 

from which the cyclotron resonance frequency co 0 is defined by 



which depends only on the magnitude of the magnetic field B, not the kinetic energy of the electron. As electrons are 
borne from the surface with a distribution of transverse velocities v e , 2 this means that all of them gyrate about the 
magnetic field lines in orbits with the same frequency co 0 , the radii of which are also distributed because v g = pco g . 


EXAMPLE: Find the radius p 0 of an electron for which the transverse kinetic energy (1/2 )m{v 2 + v 2 ) = mv 2 is equal 
to the average thermal energy k B T (Eq. (3.14)). Let the magnetic field B = 0.7 Tesla (a representative value for FELs 
[639]), and let the temperature of the thermionic emitter be T = 1400 K. 


^ef. [37]: Hamlet, I.iii.17-18. 

2 An angular velocity is not associated with a change in energy whereas a radial velocity is, so when electrons are not being accelerated use v e 
instead of i> ± . 


Introduction to the Physics of Electron Emission, First Edition. Kevin L. Jensen. 
© 2018 John Wiley & Sons Ltd. Published 2018 by John Wiley & Sons Ltd. 
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SOLUTION: Evaluate qcB = 0.20985 eV/nm and k B T = 0.12064 eV. Relating p 0 to the cyclotron resonance 
frequency m g from Eq. (33.2), then 


Pa 


1.1832 pm 


As the last example shows, the radii of the orbits compared to the radius of a standard thermionic dispenser cathode, 
with a diameter from millimeters to centimeters, is several orders of magnitude smaller. In simulation, the launch of a 
"cold beam" that neglects the initial transverse velocity components of the electrons [640] is easier and often sufficiently 
accurate for linear beam devices with large beam tunnels. When the emission area is very small, or when the beam tunnel 
is very small, neglecting the traverse velocity components during the launching of the electron beam can undermine 
the simulation. When high current density inside beam tunnels is desired, the problem is compounded. So as to not 
limit thermal cathode lifetimes by operating them at excessive temperatures, a substantial compression of the beam 
from the cross-section of the cathode to the cross-section of the beam tunnel is performed, for example using a Pierce 
gun [12, 638, 641]. Indeed, the high current density of field emitter arrays coupled with a rapid modulation capability 
motivated their consideration [480, 492, 642, 643] and demonstration [483] in inductive output amplifiers (IOAs). As 
the transverse velocity components of the beam do not go away, simulations of very small beam tunnels see their effects 
[644]. Why transverse velocity components matter will lead to the development of a beam envelope equation and a 
consideration of the effects of emittance. 


33.1 Electron orbits and cathode area 

“What is a Caucus-race?" said Alice; not that she wanted much to know... “Why," said the DoDo, “the best way to explain it is to do 
it ."... First, it marked out a race-course, in a sort of circle (“the exact shape doesn't matter," it said), and then all the party were placed 
on the course, here and there. There was no “One, two, three, and away," but they began running when they liked, and left off when 
they liked, so that it was not easy to know when the race was over. 

- Lewis Carroll 3 

As a result of the fact that the angular frequency of the electron's orbit about the field line depends only on the magnetic 
field, all the electrons return to their starting position in the x-y plane at the same time (but not along the z axis). If the 
electrons are emitted from the surface of a flat cathode, then those starting positions are distributed across the emitting 
area, with the initial transverse velocities randomly directed: when the electrons move away from the cathode surface, 
some will spiral to the center, but others will spiral away. Letting the radius of the electron orbit be p 0 , and the radius of 
the cathode area be R, then those orbits can be examined for various values of C = p 0 /R. 

If the emitted electrons are Maxwell-Boltzmann (MB) distributed, then a rather elegant method for calculating the 
starting transverse velocity components v x and v y all at once is possible. Recall from Eq. (3.12) (or Eq. (5.2), which is a 
variation) that for MB distributions in velocity, 

= (33.3) 

where z is reserved for the direction of the B field. This allows for the creation of an MB distribution of transverse velocities 
in a manner different than the algorithm of Section 5.1 when an MB distribution for one component of the velocity, v x , 
was generated. First, distributions in v 2 are required, for which an MB distribution is of the form e~^ F for E = (1/2 )mv 2 , 
meaning that something like Eq. (32.22) can be employed. The other requirement is that v 2 + v 2 = v 2 (where v 0 stands in 
for either v L or v g ), and therefore appending trigonometric functions to the random v x and v y once v 0 is obtained satisfies 
that condition. Thus, to generate v x and v y , a pair of numbers n l and n 2 is generated using two random numbers r, and 
r 2 , from which 

n 1 = y/-2 ln(ri) cos(2^t 2 ) 

n 2 = V-2 !n(r,) sin(2?rr 2 ) (33.4) 


3 Lewis Carroll, Alice's Adventures in Wonderland. Racine, Wis.: Whitman Publishing Company, 1945, Chapter III. 
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If v 0 is a characteristic velocity (say, v 0 = y/k B T/m, where the usual factor of 2 is missing because v 2 0 = v 2 x + v 2 ), then 
v x = v 0 n 1 and v y = v 0 n 2 . To take the next step and make v 0 correspond to v 0 takes a bit more. Because x = p cos 9 and 
y = p sin# then (letting the dot notation denote time derivative) 


v x = x=v p cos 9 - v e sin 9 

v y =y = v p sin9 + v e cos 9 (33.5) 


where p = v p and p9 = v e . Knowing v = + v y y and 6 = -x sin 0 + y cos 6 gives (ignoring any consideration of v z for the 

moment) 

v e = 9-v =-v x s\n9 + v y cos9 (33.6) 


In order for v 0 of Eq. (33.4) to correspond to v e then the identification 9 = lnn 2 + {n/2) is required to insure 9 ■ v 0 = 0. 4 
In short, then, for two random numbers r, and r 2 . 


l ’* = (~»r) V -2 ln(ri) cos (lictj + ^ 

"»=(¥) V-2 1n(r,)sin(2*r 2 + |) 


(33.7) 

(33.8) 


Assuming that the launch sites on a cathode of radius R are randomly placed, and the transverse velocity v e is MB 
distributed, then the spiraling electrons overlaid on the cathode area resemble Figure 33.1 for various values of C = p 0 /R, 
as calculated using the algorithm of Section A3.33: the starting points relative to the cathode area are the same in each 
sub-figure. The radius of the orbits need not be that much smaller than the cathode radius before significant excursions 
by spiraling electrons are noticeable beyond the radius of the cathode 

The starting points characterized by small black dots in Figure 33.2 are special: after a time t = 2n/co 0 all of the electrons 
regardless of their velocity v e return to those dots. As a result, the size of the circle enclosing all the trajectories oscillates 
periodically in time with a frequency co 0 . If the radius of the cathode is now the initial radius of the beam R-* R(t= 0), 
then the radius of the beam follows a harmonic oscillation. It is often desirable to bring the energy of the electrons in 
a beam up to their maximum energy as rapidly as possible to minimize longitudinal spread: when done, v z /c = /? is not 
negligible. The location of the electrons along z is a measure of how much time has passed, and so the time derivative of 
the harmonic oscillations that R(t) undergoes can be replaced instead by derivatives with respect to z via d/dt = v z d/dz. 
Because the total velocity is dominated by v z the paraxial equation [638] can be obtained. In the absence of effects that 
bedevil the final form of that equation, the harmonic oscillations of the radius of the beam that are made apparent by 



Velocity [au] 

Figure 33.1 An MB of transverse velocities v e evaluated using the algorithm of Section A3.33. The velocity is in units of yJk B T/m. The total number of 
particles is 256. From these velocities, the orbital radii of Figure 33.1 are calculated according to v e = pa> 0 . 


4 Were v 0 identified with v p , then 8 = 2im 2 directly, as can be confirmed by p • v. 
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Figure 33.2 Starting points (black dots) of 256 emitted electrons from a cathode area given by the red circle. Magnetic field is normal to the plane of the 
figure. Placement is random but the starting points are the same in each quadrant. The blue circles represent the trajectories the electrons take over one 
cycle. The figure is generated using the algorithm of Section A3.33 and labeled by pjR (lower left corner), where p 0 is the initial radius for the mean 
velocity (Figure 33.1) and R is the radius of the emitting surface. 


spiraling orbits shown in Figure 33.2 


described by 


d 2 p i ( & ' 

dz 2 \ 2 Pymc _ 


(33.9) 


where B is magnetic field and w 0 from Eq. 33.2 has been dressed with the relativistic factors p = v z /c and y = 1 /y/l - p 2 . 
Electrons accelerated rapidly to high energies make v z a fraction, and sometimes a large fraction, of the speed of light, 
for which the closeness of v z to the overall velocity magnitude v makes using v in evaluating p and y expedient. 


33.2 Beam envelope equation 

Fain would I run into the crowd for shelter and warmth; hut cannot prevail with myself to mix with such deformity. I call upon others 
to join me, in order to make a company apart; hut no one will hearken me. 

- David Hume 5 

As soon as the beam is born, however, the electrons inside it agitate to get away from each other due to their mutual 
repulsion, and so a space-charge modification must be added. A single particle gives rise to a 1 /r 2 force, as is well known; 
almost as well-known is that a line of charge gives rise to a 1/p force [99]. A fast way to such conclusions is to use the 
Gaussian pillbox argument [99] familiar from Eq. (16.35): for a sphere of radius r surrounding a point charge of charge 
q, then F is in the direction of r, and so equating the integrated force dotted to the surface normal over the area of the 
sphere, or f F ■ dS = q 2 /e a , gives 

, 4Q 

Akt 2 F= \6jcQ^F = (33.10) 

For a beam, the surface is a cylinder of length / (where X is the charge per unit length) and radius p which contains a 
charge XI, and so 

2nplF = \6nQ- F= — (33.11) 

q qp 


’David Hume, A Treatise of Human Nature, 2nd edn. Oxford: Oxford University Press, 1978, Book I, Part TV, Section VII, p. 264. 
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The charge in the beam, however, is moving, and so relativistic factors enter it, too. A careful derivation [638] results in 
the ratio of the beam current I a to the Alfven-Lawson current defined by 


(33.12) 


where the second form conjoins the fundamental time unit (c/a 0 = 1.76515 x 10 -4 fs), with a ratio of the energy scales 
associated with the electron (ratio of the potential energy 4 Q/a 0 of an electron in the hydrogen atom with the electron 
rest energy me 2 , or 4 Q/a 0 mc 2 = 1/a 2 ) as in Section 2.3. 


EXAMPLE: Evaluate the Alfven-Lawson current I 0 
SOLUTION: Using the second of the forms of Eq. (33.12) 


t _ 511000 eV/c 2 

0 ~ ^4(0.35999 eV-nm) 


= 1.0639 


lO 8 ^- = 17045 A 
fs 


where 1 A = 6241.5 q/ts from Table 2.2. Using the third of the forms of Eq. (33.12) is perhaps easier and involves the 
characteristic charge (q) and time (St = a 0 /c = 1.7651 x 10 -4 fs) of the hydrogen atom (Section 2.2.1), or q/a 2 St = 
17.045 kA. 


Electrons are ejected with transverse velocity as soon as they exit the surface, which cannot be corrected and that leads 
to the intrinsic emittance of the cathode: no amount of focusing or beam manipulation (aka beam optics) after emission 
will undo what the statistical nature of emission does to mess up a beam, in contrast to other causes of oscillations in 
the beam radius from which recovery is possible [645]. Emittance causes the cross-sectional area of the beam to seem to 
swell in the beam tunnel, as in Figure 33.2. More (much more) will be said or can be read [638] on emittance e, but for 
now it is sufficient to note that incorporating the effects of emittance in the beam envelope equation along with space 
charge results in [646]-[649] 


(33.13) 


where the factors after the gradient term d 2 R/dz 2 are the magnetic term, the space-charge term, and the emittance term, 
respectively: that they keep company thusly indicates their linked nature, as in Figure 33.3, in affecting the overall beam 
quality and therefore device operation. 

For electrons sprinting along at a good fraction of the speed of light (and even electrons at 1000 eV do so, with 
v/c = 0.063), the transverse velocity components are overshadowed (an electron at k B T for T = 1400 K has v/c = 6.9 x 
10 -4 ). It is therefore convenient to take v z « v. The relativistic factors p = v/c and y = 1 /\/\ - P 2 are then related to the 
acceleration potential V b = q<p b that brings the beam up to speed by 


ymc 2 = me 2 + V b 

Big as V b may be, it is generally less than me 2 , and so 



(33.14) 


(33.15) 


As a result, Py « \j2V b /mc 2 . 

Now return to Eq. (33.13): were there no space-charge or emittance effects, then the equation is of the form d 2 R + 
co 2 R = 0, indicating that R oscillates harmonically. On the other hand, as the combination of space-charge forces and 
emittance effects increases, at some point those terms match and cancel the co 2 term, resulting in d 2 R = 0. Such a matching 
is known as Brillouinflow and results in a smooth tapered beam entering the beam tunnel (although other physical effects 
rapidly complicate such ideal conditions [650]). As the interaction between the electrons in the beam and the circuit, be 
it a traveling wave tube (TWT), klystron, FEL, or other linear beam device, is strongest for electrons closest to the beam 
tunnel walls, this is good: the absence of oscillations keeps the electrons where they should be if they can be brought and 
kept there in the first place. In some devices, in fact, annular (or hollow) beams are advantageous [651, 652]. Electron 
beams are affected by the countervailing presence of ions [641] but that complication is ignored. Setting the sum of 
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beam quality, i 


nsverse velocity components of electrons and causes spreading. Bottom, how space charge, halo, and e: 
ith double arrows indicating feedback (based on a figure by P.G. O'Shea). 


e interact and affect 


the magnetic, space-charge, and emittance terms in Eq. (33.13) to zero, the factors can be regrouped and rearranged to 
give [451] 

4 _ 4 ( 2v t\ 1/2 (<lB\ 2 f 8mV b e 2 \ 
kR 2 8n\mc 2 ) \mcj \ q 2 B 2 R 4 J 


(33.16) 


where the f)y ss yJjVjmc 2 is used. The left-hand side is the current in the beam tunnel over the area of the beam, that 
is, the current density of the beam. The factor in curly brackets is therefore the degree to which emittance reduces the current 
density of the beam for Brillouin flow conditions, that is. 


Jbeam^) = Jbeam^){l-S(e)} 


(33.17) 


where the fractional reduction in emittance is characterized by the factor 


' q 2 B 2 R 4 


(33.18) 


For concreteness, consider a particular class of vacuum electronic devices used for microwave amplification and what 
constrains the push of the technology into the terahertz regime. Generally, the frequency of the device scales with the 
inverse radius of the beam tunnel. The quest for higher frequency therefore entails both smaller beam tunnels and higher 
current densities in those tunnels (Chapter 1 and Table 33.1): as the frequency range of the devices is pushed higher, the 


Table 33.1 Maximum tolerable emittance (in units of [mm-mrad]) such that 5 < 0.01 in Eq. (33.20). Pierce-type and magnetron 
injection are gun types described in Appendices A1 and A2 of ref. [638], Others are for generic cases in the literature. 


Type 

/[A] 

R [cm] 

V„ [q kV] 

B[T] 

f [GHz] 

Max. e 

Pierce-type [638] 

0.25 

0.6 

5.0 

0.01 

1-4 

75.3 

Magnetron [638] 

480.0 

0.6 

500 

0.08 

2-4 

49.5 

Gyrotron 

0.05 

0.344 

15.0 

9.7 

263 

13796 

S-band MBK 

32.0 

1.3 

0.53 

0.22 

2-4 

23940 

Generic TWT 

0.288 

0.1 

10.0 

0.09 

1-4 

13.2 

Generic TWT 

1.138 

0.128 

25.0 

0.09 

1-4 

13.6 

CCTWT [656] 

0.5 

0.0559 

19.2 

0.8 

27-40 

26.5 

Serpentine TWT [655] 

0.12 

0.019 

11.7 

0.3 

220 

1.48 
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maximum tolerable emittance goes from well in excess of what thermionic cathodes actually produce to a range far more 
challenging. If the maximum magnetic B field in a compact configuration achievable with permanent magnets is 1 Tesla, 
and V b is kept comparable to 20 kV to keep the device small while avoiding problems associated with voltage insulation 
[13], then the maximum achievable current density in the beam tunnel is limited by the emittance of the beam for a 
given beam tunnel radius. Noting that ( qcB ) has units of [eV/nm], let 


so that 



(33.19) 


(33.20) 


which reinforces that e has units of length. The actual units of emittance are [radians x length]: sometimes [mm-mrad] 
with mrad denoting milli-radians are used, sometimes [pm] is used. In the early literature, a factor of n would appear to 
reinforce the radians unit. The convention employed is not always obvious. Here, the unit is chosen to be [mm-mrad]. 
Clearly, then, emittance is a problem when e « (R 2 /R 0 )y/s for a particular allowable <5. For example purposes, taking <5 
to be approximately 1 % is agreeable to the human penchant for choosing pleasing magnitudes bereft of significance (it 
makes calculations easy), but actual emittance limits are more circumspect: given the conjunction of emittance with space 
charge in ejecting electrons from the core part of the beam (halo), and the damage that halo causes [653, 654], acceptable 
ranges are often much more discriminating. The beam tunnel diameters of the last row of Table 33.1, described in ref. 
[655], are comparable to 150 pm, with tunnels less than 100 pm anticipated in the future. For tunnels of comparable 
dimensions, emittance matters. 


EXAMPLE: Find R 0 assuming B = 1 Tesla and V b = 20 keV. From it, determine the maximum tolerable emittance 
if J beam (e) is 99% of the emittance-free J beam ( 0) (8 = 0.01) for a beam required to be R = 0.1 cm. 

SOLUTION: Using qcB = 0.29979 eV/nm in Eq. (33.19) gives 


R 0 




10 4 eV)(5.11 X 10 6 eV)] 1/2 
0.29979 eV/nm 


= 0.095378 cm 


Consequently, e < y/8R 2 /R 0 = 104.85 mm-mrad is required. 


EXAMPLE: Consider a high-frequency serpentine TWT [655] or an extended interaction Klystron [644], for which 
the beam current is on the order of 100 mA, the beam tunnel current density is desired to be 100-225 A/cm 2 , the 
acceleration potential is 25 kV, the magnetic field is 0.3 Tesla, and the beam tunnel radius is 130 pm [451], Find the 
maximum emittance if S < 1%. 

SOLUTION: Using Bqc = 0.089938 eV/nm, then 


or e < 0.47545 pm. 


(0.089938 eV/nm) 2 
(25 keV)(0.511 MeV) 


For FElLs [657], the laser beam optical mode in the wiggler constrains the electron beam to fit within it for the laser 
threshold to be reached. If the radius of the mode is w, the divergence angle is 0, and the wavelength of the FEL is X, 
then the waist-divergence product has to be such that wO = X/n. Because emittance describes the spread of the beam, 
then [633] 

(33.21) 
4 n 

The factor of 4 has to do with how emittance has been historically defined, and the factor of k has to do with how 
emittance is defined in phase space (Section 33.3 and refs [658, 659]). Sources with emittances on the order of a micron 
are objectives, and in the case of an X-ray FEL [20, 524, 660], even smaller [523, 661], 
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Figure 33.4 Modeling the compression of an electron beam from a curved thermal emitter into a beam tunnel for a Pierce-type diode using the 
PIC-code MICHELLE, similar to that considered in Ref. [583]. Electron trajectories are colored bright green: the apparent solidity of the beam is a 
consequence of many of them together. Sky-blue dot indicates the region of Fig. 33.5. Image and simulation courtesy of J. J. Petiilo (Leidos). 



Figure 33.5 Launch sites at the edge of the cathode (sky-blue circle region of Figure 33.4). Left, cold launch in which trajectories are normal to the 
surface of the cathode (no transverse velocity). Right, hot launch in which a distribution of transverse velocities is applied to the emitted electron at 
launch. Images courtesy of J. Petiilo (Leidos). 


A recitation of numbers and equations, though, is not nearly as effective as conveying the consequences of emittance as 
a numerical simulation of an electron beam. Launching electrons from a cathode, compressing the beam to the diameter 
of the beam tunnel, and following the bunches down the length of the device is the ambition of many codes, for example 
MAGIC, PARMELA, elegant, IMPACT, WARP, Neptune, Vorpal and MICHELLE, and others [588, 640, 656, 662-668] 
are the basis of studies of beam dynamics. Particle-in-cell (PIC) codes are a particular kind useful to model amplifiers 
and accelerators. Consider, therefore, a simulation to investigate the consequences of including intrinsic emittance using 
MICHELLE [640] for a Pierce-type diode employing a thermal cathode [583] in which the beam is compressed to the size 
of the beam tunnel, as in Figure 33.4. 

The simulation takes the gun to be at 5 kV with a thermal dispenser cathode (analogous to that shown in Figure 12.1) 
with a radius of 0.25 mm and producing 100 mA at a temperature of 1500 K. In a "cold launch," electrons are born 
with no transverse velocity components, so that their trajectories are normal to the cathode surface, whereas in a "hot 
launch," electrons are randomly given a transverse velocity component comparable to (1/2 )mv 2 x ~ k P T » 0.086 eY, but 
otherwise the simulation methodologies of the cold and hot launches are the same. The electrons are followed to the 
entrance of the beam tunnel: the effects of transverse velocity are clearly seen in the larger diameter of the beam, where 
the beam tunnels have been oriented so as to face each other with the cold launch on the left and the hot launch on the 
right of Figure 33.6. Ad hoc parameters of B = 0.7 Tesla, e = 45 mm-mrad, and V b = 5 q kV = 5 keV, then 8(e) = 0.24 are 
used. The increase in the beam radius is visually apparent. 
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Figure 33.6 Trajectories of the electrons from the cathode surface for the cold (left) and hot (right) conditions of Figure 33.5. Observe the larger beam 
diameter for the hot launch conditions. Images courtesy of J. Petillo (Leidos). 


33.3 Emittance for flat and uniform surfaces 

Each point on the surface of the source is emitting particles with different initial magnitude and direction of the velocity vector. This 
intrinsic thermal velocity spread remains present in the beam at any distance downstream from the source... The emittance provides 
a quantitative basis, or a figure of merit, for describing the quality of the beam. 

-Martin Reiser [638], p. 56 

...but as for fate, I think that no man yet has escaped it once it has taken its first form, neither brave man nor coward. 

- Homer 6 

Emittance was previously characterized (Chapter 31) as a tendency of a beam to spread as it propagates. Although 
beams can be focused, the initial emittance of the beam, or the intrinsic emittance, cannot be undone in the same way 
that beam optics can undo spreading, and so the emittance generated at the cathode becomes important because it can be 
limiting. In thinking of emittance, keeping an optical analogy in mind is useful. The difference between the emittance of 
a flashlight and that of a laser pointer is instructive: flashlight beams spread out, but laser beams stay tightly collimated. 
Flashlights are said to have a large emittance and lasers a small one. 

To be precise, spreading is how the transverse extent of the beam changes as the beam moves along the z axis. The 
transverse axis can be either x or y, though the emittance of each is the same if the beam is rotationally symmetric so 
that mean values of x 2 and y 2 or k 2 and k 2 , are particularly simple. For ease of discussion, then, assume a rotationally 
symmetric beam and speak of the x emittance in place of both. Emittance should give a measure of x' = dx/dz in addition 
to x for each electron in the beam. The empirical definition of emittance is then the area of the shape that encloses all of 
the (x,x') pairs for a beam. 

Difficulties arise almost immediately in such a definition. For electrons accelerated in fields, for example, two problems 
are easy to see. First, all the points are distributed and therefore the area is difficult to quantify: should it enclose most 
of the (x, x') points, or include the odd outliers? Second, the points can be so distributed that two beams may seem to 
have comparable areas enclosing the (x, x') points, but the one exhibiting more painful contortions to enclose the points 
is difficult to work with. That is, as expressed by J. Buon [669], 

The usual definition of emittance as the area of a limiting contour suffers not only from the arbitrariness of the chosen contour, but also 
from its distortion when time is running. It may become so twisted and bent that its area is no more representative of the spread of the 
particles. 


6 Homer, The Iliad (trans. Richmond Lattimore). Chicago: University of Chicago Press, 1962, Book 6, lines 488-489. 
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A working definition of emittance should account for whatever torture of the shape enclosing the (x, x') points is in store. 
Lastly, as the electrons become relativistic, their spread in velocity is such that emittance appears to decrease markedly 
as v -*■ c for all the electrons in the bunch: an emittance that changes thus is not as useful a metric of beam quality as one 
that describes the beam over its evolution consistently, that is, a metric that remains invariant as the beam accelerates. 


33.3.1 Liouville's theorem and normalized rms emittance 


.. .Epaminondas demonstrated that the phalanx system could he adapted to achieve decisive tactical manoeuvre in the face of the enemy. 

At Leuctra, outnumbered 11,000 to 6000, he quadrupled the strength of his left wing and, masking his weakness on the right, led his 
massed column in a charge... the Spartans failed to reinforce the threatened section in time and were broken, for considerable loss to 
themselves and almost none to the Thebans. 

- John Keegan 7 

Phalanx warfare in 371 BC, when the battle of Leuctra was fought, normally proceeded by spreading the phalanx 
evenly across the boundary of engagement. By deceptively massing his outnumbered troops on one side, Epaminondas 
was able to overwhelmed the Spartans on the left wing and thereby carry the battle, as the losses inflicted on the side 
that broke and ran in phalanx engagements tended to be disproportionately heavy. The effective deployment of men on 
a battlefield, or arrangement of electrons in phase space, affects their utility, particularly if the manner of their grouping 
makes the phase space area seem more foreboding than it is. 

The grouping of electrons, as it were, has the added complication of keeping track of the velocity, and because visualiza¬ 
tion is perplexing on first encounter, consider a rather trivial and therefore artificial example, but one which nevertheless 
catches the spirit of the difficulties. Suppose that the distribution of (x,x') points is such that the initial distribution of 
phase space points is enclosed by a square region of size LL'. Consider linear forces first, so that the x coordinate x(f) of 
an electron can be described by 



letting a = F/m be only in the x direction. If there is no acceleration in the z direction, then x' = dx/dz = {dx/dt)/{dz/dt) = 
v x (t)/v z with v x (t) = v 0 + at (this is a highly artificial condition simply to illustrate a point; in real linear beam devices, the 
acceleration that matters is in the z direction, but that is for later). 

In fact, suppose that there is an N x N array of electrons as in Figure 33.7, positioned such that all in the same row 
have the same velocity, and all in the same column have the same starting position, as in Figure 33.8: the total area is 
then found by summing over all the small phase space rhombus areas dA where 

dA(t) = Ax(t)Ax'(0 (33.23) 
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Figure 33.7 Initial positions (blue diamonds) of an IV x N array of electrons in x-x! phase space, and their position at a time (later, assuming motion in a 
constant force F = Fx environment: there is no acceleration in the z direction, so that dx/dz - v x (t)/v z . Units are such that the initial x and x' positions 
are integers, and a = 2.5. The area of the red parallelogram is the same as the area of the blue rectangle. 


7 John Keegan, A History of Warfare. New York: Vintage Books, 1994, p. 258. 
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Figure 33.8 Evolution of phase space points from Figure 33.7 in close up: blue are at time = 0, red are at time = t, with the index i denoting row 
element and j denoting column element. The area defined by the four points is dA(0) and dA(t) for blue and red, respectively. 


For the (zj)th point, with i denoting position in row and j denoting position 
the ;th row equal to Vj(0), 

Xj(t) = Xj(0) + vfG)t + ^at 2 
x'it) = j- (vj( 0) + at) 

and so 

dA i} (t) = [x i+l (t) -x t (f)] \x' ]+l (t) - *'(f)] 

= [x !+1 (0) -x,(0)j [x' + ,(0) - x'(0)] = dAy(0) (33.26) 

because of the cancelation of the f-dependent terms. In other words, the area of the phase space element is unchanging 
(recall Eq. (7.15)), and so the sum of all the small dA areas to get the area of the beam in phase space does not change 
either. Another way of saying this is that as time progresses, the phase space distribution f(x,x', 0) maps on to a new dis¬ 
tribution t) and the mapping, being governed by a Hamiltonian, occurs by equations of motion that leave volumes 
in phase space invariant. Such is the essence of Liouville's theorem. 

What about when the distribution of phase space points gets, in the words of Buon, "twisted and bent," and how does 
that complicate the empirical definition of emittance as an area that encloses the phase space points? Consider just one 
row of Figure 33.7, but let the force acting on the particles be something more complicated, for example one in which 
the force is a function of position. The twisting that may result might resemble something like Figure 33.9. Although an 
5-like enclosing area can be found for the red points at a later time t compared to the straight line on which the blue 
points lie at time 0 (for which the area of the enclosing ellipse is vanishingly small), as a practical consideration as far as 
the utility of the resulting beam is concerned, the enclosing shape is better represented as a fat ellipse: the effective area 


in column, and all the initial velocities in 

(33.24) 

(33.25) 



Figure 33.9 Evolution of a line of phase space points with very little phase space area to a contorted shape that requir 
points, with dashed green lines showing the mapping. Even though a boundary that hugs the shape of the S-like curt 
ellipse better entails the difficulties of dealing with the resulting beam. 


s a larger ellipse to enclose the 
: would have the same area, the 
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Figure 33.10 Position (blue dots) and transverse velocity (red arrows) of electrons. The length of 
the red arrows corresponds to the magnitude | | and the arrow indicates the direction. The data 

for the representation is recast in Figure 33.11. The green circle shows the boundary of the beam. 


in phase space has increased so that emittance (like the lop-sided phalanx of Epaminondas) can appear as though it were 
larger than it is. A good definition of emittance should take that behavior into account. 

A definition of emittance that computationally captures the impact of twisted and bent distributions is to use rms 
values of x and x'. It is often represented as [638, 646, 658, 670, 671] 

e=sj{x 2 )(x' 2 )-{xx') 2 (33.27) 


Sometimes the definition has a An out front, sometimes just a 4, depending on the ulterior motives of the author. Such 
factors appear in the trace space definition of emittance, but the factor of 4 is eliminated when considering the rms form 
of Eq. (33.27) (the treatment by Reiser [638] of trace space, rms factors, and invariance is invaluable). Although this 
captures the effects of twisted distributions, it does not (yet) account for the shrinking of emittance as the beam is 
accelerated in the z direction. Therefore, return to the small black dots of Figure 33.2 representing the starting points of 
the cyclical electron motion on top of a circle the size of which gives an indication of the magnitude of the transverse 
velocity v x . Now let the black dots show the starting point and a red vector indicate the direction and magnitude of v ± , as 
in Figure 33.10: such a representation captures four dimensions (x,y, v x , v y ) of the six-dimensional distribution function 
f(r,v) encountered in Section 5.1. A representation more in keeping with (x, x') is to use one axis for position and another 
for velocity: such slices of phase space are shown in Figure 33.11. The shape and size of the area enclosing the points 
representing the particles in the beam therefore gives a measure of emittance. Still, v x and x! differ. 

Taking a cue from Eq. (33.25) relating x' to v x , make the replacement of 


Ax _ _ ££ 

Az “ ^Af ” v z 


(33.28) 



Figure 33.11 Position and velocity pairs, red corresponding to the pair (x, v x ) and blue to the pair (y, vi y ), where Xj denotes x or y and similarly with a. lor 
the r and v x of Figure 33.10. The factors r 0 and v 0 are the radius of the emission area and the average transverse velocity. The gray ellipse is an area in 
phase space that encloses most of the dots, and will be related to e. 
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For electron beam devices, the electrons are aggressively accelerated in the z direction to avoid emittance growth and 
space-charge issues. Consequently, the difference between v z and the total velocity magnitude v is not all that great, and 
so making the replacement v z -*■ v is useful in the relativistic factors p = v/c and y = 1 /yj 1 - p 2 . Additionally, it is not 
the velocity that matters in the phase space descriptions so much as it is the momentum m\v\ = h\k\. As the beam becomes 
relativistic, 

hk z = mv z -* ( ym)(Pc ) = pymc (33.29) 


where the behavior of Py in the denominator of Eq. (33.28) is what causes the emittance to decrease as the kinetic 
energy increases. Therefore, the normalized emittance takes out the Py factor and genuflects to the usage of Tik over x', and 
so [672] 


pye = e njms = A <£) - (xk x ) 2 


(33.30) 


Emittance can be brought to very low values simply by reducing the current in the beam: space-charge effects fade, 
small cathode areas can be used, and so on. But low current beams are often contrary to, or at least not only, what is 
desired. As a result, a related figure of merit gets both current and emittance together, and is called brightness B. Brightness 
relates the amount of current I to the area in which it can be focused A and how it diverges into a solid angle £2 by [658] 


(33.31) 


where the constant of proportionality depends on the distribution of the particles in the bunch, for example the pro¬ 
portionality constant for a hollow shell (the so-called Kapchinskij and Vladimirskij distribution) is 2 [659], Using the 
normalized rms emittance in Eq. (33.31) results in the normalized brightness. 


33.3.2 Thermal emittance 

The way things are does not determine the way they ought to be. 

- Michael J. Sandel 8 

For symmetrical distributions in (x, k x ) space, (xk x ) vanishes, as Figure 33.11 makes plausible: for every point on the 
right side (x > 0) a point on the left side (x < 0) matches it (or would for a sufficiently dense distribution of points), and 
so for beams from uniformly emitting rotationally symmetric surfaces, the factor (xk x ) -> 0. Additionally, for such beams 
(x 2 ) = (p 2 ) /2 and (k 2 ) = (k 2 ) /2 as is straightforward to show. 


EXAMPLE: For a rotationally symmetric and uniformly emitting cathode of radius R, first show that (x 2 ) = R 2 /2 
where R is the rms radius of the cathode, and second show that h 2 (k 2 ) /2m = E L /2. 

SOLUTION: Define p 2 = x 2 + y 2 and k\ = k 2 + k 2 . Then 

> = Ji 2 r* 2 r &(p2 -* 2 - y2)dxdy={y2) 

where the last equality follows from the symmetrical nature of x and y. As a result. 

For R = \J{p 2 ) then the result follows. Applying the same logic to (k 2 ) and letting E x = h 2 k x /2m and E ± = E x + E y , 
then 


s Michael J. Sandel, Justice: What’s the Right Thing to Do. New York: Farrar, Straus and Giroux, 2009, p. 165. 
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For uniformly emitting and rotationally symmetric emission areas, it is then common to define the mean transverse 
energy (MTE) as (E L ) and so write the normalized rms emittance as 

£ n,rms = ^VMTE (33.32) 

For thermal emission, the emitted energy distribution is MB. In cylindrical coordinates k p takes over for the transverse 
k L , and so 9 

, , k 2 exy(-E„/k„T)2jrk n dk n 

(£) i m (33.33) 

UZ exp(~E x /k B T)dk x } 2 

The integrations can be performed analytically to give 

An(mk B T/h 2 ) 2 _ 2mk B T 


W) = 


2 nmk B T/Ti 2 h 2 


(33.34) 


which might have been expected from the equipartition theorem of Eq. (5.13). The result identifies MTE = k B T, and so 
in the case of thermal emission, 

R fk»f 

£.™(thennal) = - \ (33.35) 

' 2 V me 2 

so that emittance is often identified by a temperature for thermal emitters. 


33.3.3 Photoemission emittance 

The application of the mean transverse energy approach to calculating the emittance of a photocathode is a bit more 
nuanced [181, 673], as the evaluation of the moments in Eq. (31.21) require a consideration of the photon energy's 
contribution to the emitted electron energy. For a flat surface (one-dimensional emission barrier), continuity of the 
electron wave function and its first derivative require that the transverse momentum component k L be conserved across 
the barrier. The total energy of the electron prior to encountering that barrier, however, is E + Tico. This means for metals, 
h 2 k 2 

—— =(E+ Tied) sin 2 # (33.36) 


As a result, and very much like the calculation of quantum efficiency i: 
emittance for photoemission becomes 


Eq. (31.21) using the moments approach. 


= R(±\ I M 2^ k l) 

£n ’ rms 2\mc)]j 2M 0 (k ± ) 


where, recall, the subscript of M n specifies the power to which its argument is taken in the integrand as per Eq. (7.8). 
Substituting the various approximations behind the integrand terms into Eq. (31.21) as applied to M n (k ± ) gives [673] 


,, ,, , 1 / 2 m \<”+ 3 )/2 j 

" ( l)W 8a: 2 V h 2 ) (« + 2)(H 


X / 


^/j^+-s(s+hw - d>y n+2 v 2 


(33.38) 


For photon energy hco close to the Schottky-reduced work function 4>, the integral may be approximated by 


\]2mg/ti 2 [2m(to - $)/ft 2 ] ( ” +2 ^ 2 

An 2 {g + hco)( 1 + p 0 )(n + 4 ){n + 2) 


(33.39) 


Much of what dresses this approximation 
most of it. What is left is 


M n is stuff independent of n\ as a result, ratios of M 2 with M 0 will eliminate 


£„, rms (photo) ft 


(33.40) 


9 One may wonder what became of the k and k z integrations in the denominator. In cylindrical coordinates for the MB distribution, all of 
the kj integrals are equal (hence the square of the term in the denominator). Further, the integrals containing k z appear the same in both the 
numerator and denominator, and so cancel. 
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which is the Dowell-Schmerge formula for photocathode emittance [181]. 10 Like the thermal emittance of Eq. (33.35), Eq. 
(33.40) is the product of a characteristic radius with the square root of a characteristic energy factor (k B T for thermal, 
(tico - (f>)/3 for photo) in ratio with the electron rest energy me 2 , an intuitively pleasing result. 


33.3.4 Field emission emittance 


And ere a man hath power to say “Behold!" 

The jaws of darkness do devour it up: 

So quick bright things come to confusion. 

- William Shakespeare 11 


The evaluation of emittance from a planar surface for field emission is similar to the moments approach for photoe¬ 
mission but without the addition of a photon energy and using the transmission probability associated with tunneling 
rather than a step function for emission over the barrier. Recall that in the Fowler-Nordheim model, the transmission 
probability can be obtained (e.g., Eqs (13.11) or (17.11)) as 

-In D(k z )= V2»z0 3 V (y)~ 2j-yj2m® t(y){E z -g) (33.41) 


Although the full behavior of v(y) captured in Eq. (13.23) will be used, the variation of t(y) is weak and so approximating 
it by t(y) —<■ t(y 0 ) = 1 + (1 /6e) = 1.06131 is convenient. When used in the moments approach, the MTE for field emission 
becomes 


<* 2 > = 


fo F dk z D {k z ) k)dk p 

f k " dk z D ( k z ) k p dk p 


(33.42) 


The integrations over k p are easily dispatched, giving 


= l /c \ T dk z P(k z ) Q <4-kj) 2 ^ 2k 2 

{ P 2 f 0 kr dk z D ( k z ) (k 2 p - k \) ~ [4ghF(2m0) l/2 t(y) - l] 


(33.43) 


and so the emittance associated with a one-dimensional field emission barrier becomes 


Field emitters have always been described as exceptionally bright sources, but their brightness comes not from the MTE 
values but rather their smaller emission areas, as will be seen in Section 33.4. 


33.3.5 Comparisons of the one-dimensional emittance equations 

.. .since statistical data are rarely uniform, they may be grouped or “interpreted * in different ways. Do not hesitate to use only that part 
of the statistical data which suits your case, especially if you do not expect to be challenged. 

- Nicholas Capaldi 12 

Insofar as the photo, field, and thermal emission equations for current density and emittance can be related to a 
moments-based model, they can be compared. Secondary emission (from diamond) is a breed apart, and so will be taken 
up separately. By virtue of the approximation of a planar emission area, comparisons of e n rms amount to comparisons 
between \jMTE for the three emission mechanisms: ratios between e will simply factor out the radius of the circular 
emitting area. This is just as well: comparing the emittances of different electron sources with different emission areas 


10 The mistaken neglect of the photon energy hxo contribution to k 2 in ref. [674] introduced a factor of -\Ju/(u + ho>) that was identified and 
corrected in ref. [181]. Science is, as Carl Sagan approvingly observed, "self-correcting" (Carl Sagan, Cosmos. New York: Random House, 1980, 
p. 333). 

11 William Shakespeare, A Midsummer Night's Dream, 1.1.147-149. 

12 Nicholas Capaldi, The Art of Deception: An Introduction to Critical Thinking. Buffalo, NY: Prometheus Books, 1987, p. 58. 
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and operating conditions is cherry-picking of data in the worst way. Removing the dependence on cathode area by first 
considering equal emission area conditions allows for greater appreciation for which cathodes are bright and which are 
not when emission area is re-introduced. 

Each emission mechanism has a characteristic parameter on which e depends: thermal emission involves k B T, pho¬ 
toemission of course involves hco, and field emission most certainly will squeeze in B„® 3/2 /F. Once the characteristic 
parameter is identified based on typical currents drawn from typical applications, the variation in emittance as parameters 
change can be addressed. Representative cases for the various processes are considered in the next few examples. 


EXAMPLE: Find \[MTE for standard thermal emission conditions. 

SOLUTION: A typical current density as used in a generic TWT (Table 33.1) is given by 
0.288 A 


J = - 


w(0.1 cm) 2 


- = 9.16732- 


For typical parameters O = 2 eV and F = 0.1 eV/pm, then J RIiD {T) as per Eq. (12.3) will occur for T = 1357.09 K. As 
a result, 

V-MTEf thermal) = ^ = 0.000478388 

For a cathode of radius 0.1 cm, then e n ^(thermal) = 0.239194 mm-mrad. 


EXAMPLE: Find \JMTE for standard photoemission conditions. 

SOLUTION: Consider a metal photoemitter of radius r = 0.5 cm with <t> = 4.5 eV and n = 7 eV that is R = 50% 
reflective with a penetration depth of <5 = 12.8 nm at a wavelength of 2 = 266 nm. Let the relaxation time be t = 4 
fs, typical of e-e collisions, as in Figure 32.15. Let the maximum field correspond to F = 115 eV/pm. It follows 
that 0 = ® - i/4 QF = 4.09307 eV, k 0 = y/2m(fi + 4>)/h = 17.0634 nm” 1 , and so p = m8/hk 0 r = 1.61994. When such 
values are used in Eq. (31.27), then QE = 2.12268 x 10 -4 . Let J(fua) correspond to a charge of 0.01 nC emitted in 
50 ps from a circular area with a diameter of 1 cm. Collectively, such values, nominally suggested by refs [675] and 
[676], entail that the laser intensity be 7 i = 5.59165 kW/cm 2 as per J = ( q/Tico ) QE I x . Then, using Eq. (33.40), 

V MTE(photo) = ( (4-66106-4.09307) eV j 1/2 = 0 000608697 
v ^ \ 3 X 510999 eV / 


EXAMPLE: Find \jMTE for standard field emission conditions. 

SOLUTION: For a field emitter with ® = 4.5 eV and n = 7 eV subject to an apex field of F = 6 eV/nm for an apex 
radius such that the the notional emission area is nr 2 = n{20 nm) 2 , then J(F)nr 2 = 52.28 pA, a high but not atypical 
value [460]. Expressing emittance in terms of B = (4/3h)y/2m® 3 = B 0 <t> 3 / 2 = 63.0458 eV/nm, where B 0 is as per Table 
2.3 and t(y 0 ) from Eq. (13.24), then 

VMTE(field) = | ^ [3f(y 0 )| (|) - l] 'j ' 

/ 8X7 \ 1/2 

= - ^2 - ] = 0.00144032 

\ 510999[53.8264 — 1] / 


Each of the MTE relations can be put in the form of MTE/MTE g = x/x 0 , where x is given by k B T (thermal: k B T 0 = 
0.116945 eV), FO/B (field: F^O/B = 0.414064 eV), or A/A 0 with A = hco-4> (photo: A 0 = hco 0 - (p = 0.567995 eV). 
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Observe that in the examples, the values of \JMTE for (thermal, photo, field) are in the proportions (1,1.272,3.011), 
that is, MTB(thermal) is the smallest of them, and MTE’(field) is the largest for the chosen values of T 0 , F 0 , and co 0 (which 
are themselves somewhat arbitrarily chosen). This is not in contradiction to the expectation that field emission sources 
have unparalleled brightness: what has not been included in the analysis - yet - is the consequences of emission area. 
By increasing the laser power and substantially reducing the emission area, the emittance of photocathodes can be made 
better than thermal sources. Field emission, with its nanoscale emission areas, is breathtakingly small, on the order of 
e nm(S (field) = 14 nm-mrad for r = 20 nm and other parameters of the field emission example above. 

However - and this is a rather consequential "however" - field emitters are sharp little structures from which electrons 
are borne with a rather substantial transverse energy kick. Of course, thermal and photocathodes also have structure, 
and such surface "roughness" is implicated in their emittance values exceeding the theoretical limits, particularly for 
photocathodes [677-679], but the sharpness of field emitters is particularly legendary, and how curvature of the surface 
contributes to the emittance, especially for an array of emission sites, merits its own treatment in Section 33.4. Trans¬ 
verse emittance, though, is not the only concern. Longitudinal energy spread leads to longitudinal emittance, and it can 
interact with space charge in particularly mischievous ways for the high current density associated with photocathodes 
[514, 680-682]. All told, thermal emitters have advocates [683], and field emitters have opportunity [20, 24] apart from 
what one-dimensional models imply. 

33.3.6 Space charge and emittance 

... men do not make events, but events make men. 

- Clarence Darrow 13 

What electrons do depends on what is going on around them. Although transverse emittance has received most of 
the attention, a gentle familiarization with some issues surrounding longitudinal emittance merit a look because of how 
space charge affects it. Longitudinal emittance is caused by differences in forward energy. The position of a non-relativistic 
electron in a bunch is given by the kinematic equation Eq. (33.24) with a = F/m. For a pulse length of St, the separation 
between an electron at the head of a bunch to one in the tail is then simply dictated by the difference in launch times, 
where the tail electron is emitted at a time St after the head electron, or 

x(t) - x(t - St) = v 0 St + —(2 t-St) (33.45) 


where v 0 can be the maximum energy of a photoexcited electron v 0 = \J2{hco - (j))/m or perhaps the thermal energy v 0 = 
y/k B T/m. Although the head and tail electrons can have different velocities (e.g., if the tail electron has a larger transverse 
energy on emission so that its forward energy is close to zero) and therefore induce a larger separation x{t) - x{t - St), at 
surface fields in the MV/m range, such distinctions get trifling, and it is simpler to take v 0 as the same for both head and 
tail (e.g., in a 1 MV/m field, an electron is accelerated to the 2.6x the room temperature thermal velocity in 1 ps). 

Compare such separations with the radius R of the emission area. For representative parameters of photoinjectors 
[684, 685], take F = 1 eV/nm and R = 0.1 cm for a St = 10 ps pulse, and let the electrons emerge with a thermal velocity 
for room temperature conditions (v g = \Jk R T/m = 67.431 nm/ps). Under such conditions, \x(t) — x(t - St)\/R = 0.062233: 
the thickness of the disk of charge compared to its radius is such that the bunch is a pancake (sometimes described as 
"all ends"). 

Demands on the photoinjectors to reduce emittance make matters more interesting. A reduction in area decreases 
transverse emittance, and the acceleration of the beam to near-light velocities as rapidly as possible has all the electrons 
in the bunch traveling at nearly the same speed so that further elongation of the pulse does not continue. Space charge, 
however, creates issues. Keeping the visualization of the pancake of charge in mind, if the charge density of the disk is a, 
then a field on each side of the disk of SF = 8 nQo = q 2 o/ 2e 0 is created (Section 16.1 and the Gaussian pillbox argument 
of Eq. (16.35), and recall that a is a number charge density). Electrons at the head of the pulse therefore feel a force 
pushing them forward, whereas those in the tail feel a force pushing them back, relative to a charge in the center of the 
pulse. 


13 Quoted in Irving Stone, Clarence Darrow for the Defense. New York: Signet, 1971, p. 133. 
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EXAMPLE: Find the space-charge field associated with a disk of radius r = 0.05 cm containing Aq = 0.1 nC of 
charge. Compare its magnitude to a background field corresponding to F = 50 eV/pm. 

SOLUTION: Using 8F = 8 nQc with a being the number charge per unit area, then 

8F = 8xQ ( — \ i = 0.00719 — 

\ q J 7rr 2 nm 

or 14.38% of F. 


The field SF is added to the force accelerating the electrons in the head, and taken away from that for the electrons in 
the tail, causing Eq. (33.45) to become 

x{t) - x(t - St) = v a 8t + F ~ 6F (2t - 8t)8t + —t 2 (33.46) 

2m m 

Now consider the slightly more interesting ad hoc example of a Aq = 1 nC bunch emitted in 4 ps, still with R = 0.1 cm, 
where F = 50 eV/pm (corresponding to a field of 50 MV/m). In the absence of space-charge forces, after a time t = A8t, 
then x(t) - x(t - St) = 0.49274, but when the space-charge contributions are included, then it becomes 1.4913: right after 
emission, space-charge forces elongate the pulse. 

That is not all space charge does. For the expected circumstance of when the electrons have different velocities on 
emission as a consequence of how far the photon energy ha exceeds the barrier height 4> (if a metal) or E a (if a semicon¬ 
ductor), those that emerge at a sprint separate themselves from those that dawdle (either because of a different normal 
energy component, or a smaller total energy), and that difference is exacerbated by the space charge such that the energy 
differences can be made much more consequential, increase the longitudinal emittance, and put complicated structure 
on the beam [686]. Such changes greatly complicate how the beam is accelerated and couples in with non-uniformity 
of emission and its relation to both longitudinal and transverse emittance [687]. As a result, and particularly for the 
treatment of RF photoinjectors for particle accelerators and FELs where the bunch charges can be large, the accelerating 
fields intense, and the emission areas by no means uniform, the implicit assumptions behind the gentle theory here (or 
elsewhere) are not adequate. Understanding the evolution of the beams then requires the intervention of beam optics 
simulation codes to make headway. 


33.3.7 Secondary emission and emittance 


All right then. I'll go to hell... 


- Mark Twain 14 


At the moral climax of Twain's masterpiece novel, Huck agonizes over acting in accordance with prevailing social norms 
of his time and place, but resolves instead to do what he naively thinks is egregiously wrong (but ethically exemplary) 
by choosing not to betray his raft-mate Jim to the authorities. Perhaps no other literary example distills so well that 
sometimes what seems to be the worst decision actually represents the right choice when viewed beyond a narrow 
purview. In a way, secondary emission provides another example. 

The treatment of photoemission (Chapter 31) looked askance at scattering because of its effects on diminishing quan¬ 
tum yield (particularly for metals) while at the same time elongating response time (particularly for semiconductors). It 
seems suspicious that scattering can be put to good use after it has been decried, but with emittance, scattering represents 
an opportunity for a short and simple reason: for NEA surfaces such as hydrogen-terminated diamond, electrons that 
have thermalized with the lattice can nevertheless be emitted, and that improves the transverse emittance, as well as 
the longitudinal emittance. This is no surprise in the literature: the virtues of thermalization of hot electrons for high 
quantum efficiency NEA DC photocathodes (like GaAs) are appreciated [526, 529, 688, 689], Bringing the subject up in 
the context of secondary emission, though, is because the secondaries in diamond are created with such a large energy 
above the conduction band minimum (about 13.75 eV or so), so that reducing such a magnificent energy distribution 
to a thermal spread is rather dramatic and offers advantages for diamond amplifiers [324, 529, 586, 587, 590, 690, 690] 
beyond simply allowing for a narrative with more panache. 


^ark Twain, Greenwich Unabridged Library Classics: The Adventures of Huckleberry Finn. New York: Chatham River Press, 1982, p. 282. 
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The consequences of scattering on secondary emission are clearly evident by comparing secondary emission in 
reflection mode with those in transmission mode (Section 32.1). The behavior of the electron energy distribution 
curves (EDCs) show the non-thermalized emitted electrons by the high-energy tail of the reflected EDC compared to 
the transmission EDC where it is absent, as was also apparent in Figure 32.3: such tails are counter to the demand for 
narrow energy spreads needed by some applications, but they do contribute to high yields in reflection mode operation. 
To enable a thermalized beam, the diamond flake must be thick enough to eliminate the non-thermal component, yet 
not so thick that the secondary yield is reduced too far. Relying on transmission current rather than reflected has one 
other advantage: it protects the NEA surface of the hydrogen-terminated diamond flake, or allows the diamond flake 
to protect in turn a more sensitive electron source such as a photocathode from a damaging environment such as an 
RF photoinjector. A diamond flake thinner than the depletion width L D (Section 32.4) allows a field to be maintained 
across it, which further improves the yield. For such reasons, the range over which thermalization takes place needs to 
be compared to the penetration depth of the primary beam as a function of its energy E 0 in order to maximize the yield. 

Measurements of the EDCs in transmission and reflection mode of high purity and very thin (< 5 pm) boron-doped 
diamond films using chemical vapor deposition, when compared to Monte Carlo simulations, determine the minimum 
thickness diamond flakes that will allow for complete thermalization of the secondaries before they are emitted [144]. 15 
In that study, two fixed-thickness nanocrystalline diamond flakes, one of 2.3 pm thickness and one of 0.65 pm thickness, 
were subjected to a range of primary electron beam energies. The penetration depth R(E 0 ) decreased when E 0 was small, 
and at some point the high-energy tail of the EDC was eliminated, leaving a narrow distribution with a full width at half 
maximum (FWHM) of about 0.35 eV composed of secondaries that had thermalized with the diamond lattice. 

For the thicker (2.3 pm) flake, no transmission occurred until the primary beam energy reached E 0 = 13.5 keV, and 
the transmission that did occur did not have a high energy tail. When the beam energy is increased to E 0 = 17 keV, the 
first signs of a non-thermal tail became evident. For E 0 > 17 keV, the total transmission continued to increase, but the 
relative proportion of emitted electrons in the tail stayed the same. The measurements are shown in Figure 33.12, where 
the beam is normalized to its maximum value. More accurately, because experimental measurement has uncertainty, the 
beam is normalized rather toy = [7(E) — I m i„\/\I max — 7 mi „] where I max and I min are the maximum intensity for the former 
and the minimum (or representative minimum because of the noise behavior) of the latter. The thermalized electrons 
all show the same behavior at the large peak near E = 6.5 eV. The non-thermalized electrons make up the tail, and 
are progressively eliminated as E 0 decreases due to both decreased penetration of the diamond flake as per Eq. (32.4) 
and the fewer number of collisions needed to thermalize the primary. To underscore this interpretation, the difference 
between reflection mode for a much lower primary energy of E 0 = 1 keV and the E 0 = 16 keV transmission mode line 
of Figure 33.12 is shown in Figure 33.13. The thermalized component is the same for both, but the non-thermalized tail 
apparent in the reflection data is absent in the transmission data, understandable because the reflection mode secondaries 
are borne very close to the surface and are able to escape before they are brought to heel by scattering. 



Energy [eV] 


Figure 33.12 Emission intensity as a function of energy for various primary beam energies E 0 normalized so that the maximum value of all lines 
coincide. As E 0 declines, the tail of non-thermalized electrons is progressively eliminated. Based on Figure 3 of J.E. Yater, K.L. Jensen, T.I. Feygelson, 
and B.B. Pate, MRS Advances, 2016. Data courtesy of J. Yater (NRL). 


15 The CASINO Monte Carlo program [691] was used by Yater et al. in contrast to the Monte Carlo methods of Section 32.2. 
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Figure 33.13 Emission intensity as a function of energy for various primary beam energies E 0 normalized so that the maximum value of all lines 
coincide. As E 0 declines, the tail of non-thermalized electrons is progressively eliminated. Based on Figure 3 of J.E. Yater, K.L. Jensen, T.I. Feygelson, 
and B.B. Pate, MRS Advances, 2016. Data courtesy of J. Yater (NRL). 

The manner in which the secondaries loose their energy and thermalize, however, dictates that the diamond flake 
thickness is an example of the Goldilocks principle: 16 in weighing between the extremes of thermalization and losses, 
there is a region that is "just right" where both the emission and thermalization are jointly maximized, and Monte Carlo, 
by allowing the secondaries to be followed and their energy loss monitored, shows how. Matching the experimental 
arrangement, different primary energies can be, via simulation, injected into a flake region and contours of their energy 
loss monitored: when the losses have dropped enough, say to just past where 5% of the initial energy contour is, then that 
is the Goldilocks thickness. Two such examples are given in Figure 33.14, which shows the cross-section of a diamond 
flake for which high-energy primary electrons are incident from the bottom and generate a line of secondaries each with 
an initial energy of» 13.75 eV. The secondaries then diffuse and lose energy, and the contour lines mark the extent of 
their loss as a percentage of their initial value. The broccoli-like contour shapes show the fanning out of the secondaries 
as they drop in energy, with the largest contour being that for which the energy has dropped to 5% of its initial value of 
13.75 eV, the average energy at which secondaries are created by the primary. The injection sites of the primary are offset 
so as to enable easier comparison of the contours. If the thickness of the diamond flake is made to coincide with the largest 
contour (the 5% line), then the electrons will have thermalized just as they are emitted into vacuum (top boundary of 



Figure 33.14 Energy contours for various primary energies, each color associated with a different E 0 . Contours are drawn at the values of 5%, 10%, 
25%, 50%, 75% and 90% for largest to smallest contours of a given color, respectively, with the various cases spread out along the x axis. The left-hand 
figure is for a box of size 1200 nm x 1200 nm, whereas the right is 2400 nm x 2400 nm. The left shows the primary energies for E 0 = 5.5, 7.0, 9.6 and 
13.5 keV, whereas the right shows 13.5, 16.0, and 19.0 keV because a greater area is being shown. 


16 The "Goldilocks principle", named after the children’s story of a larcenous young girl resorting to breaking and entering to obtain edibles and 
bedding of intermediate temperature and firmness, respectively, has wider application: it is used to describe the just right conditions of certain 
planets like Earth that support liquid water and enable life, up to the minute balancing of the physical constants of Nature that have to be just 
so if life in the Universe can develop - the later discussion invariably being associated with the philosophically fecund "Anthropic Principle". 
See John D. Barrow, The Constants of Nature (Vintage Books, New York, 2004) or Brian Greene, The Hidden Reality (Knopf, New York, 2011). 
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figure). Differences with respect to field across the flake and the thickness of the simulation region occur with respect 
to the left and right images, just as variation in flake thickness is associated with differences in grain boundaries and 
band bending in the nanaocrystalline diamond. Thus, for example, if the flake is 650 nm thick, then the most energetic 
primary to be used is 9.6 keV, as the 5% contour is just at the 600 nm mark. Such studies conclude, therefore, that for 
primary beam energies of 5 keV, a minimum flake thickness of 1 pm is required to insure thermalization of the beam 
(although a flake twice that thickness may be preferable as it adds mechanical strength) [590], 


33.4 Emittance for a bump 


Once more unto the breach, dear friends, once more... 

- William Shakespeare 17 

As the Bard tells it, the exhortations of the English king Henry V to his men to yet again storm a break in the walls of 
the French port of Harfleur enabled the taking of the town in the year 1415, the opening move in Henry V's ambitions to 
press his claims on the French throne. Incensed, the French overtook the English at Agincourt, intent on exterminating 
the invaders. Astonishingly, the wearied English force defeated the substantially larger French force in the resulting battle, 
an outcome that owed much to the use of the longbow and a preponderance of archers, whose task, in a description by 
John Keegan, 18 was to 


.. .provoke the French into attacking, and it was therefore essential that their arrows should “group " as closely as possible on target. To 
translate their purpose into modern artillery language, they had to achieve a very narrow 100° zone (i.e., that belt of territory into which 
all missiles fell) and a Time on Target effect (l.e., all their missiles had to arrive simultaneously). 


The arrow barrage precipitated its desired effect. Longbow arrows could punch through armor for ranges closer than 
100 m, but at the much longer distances of the first volley that provoked the French to charge, the plate steel making up 
the helmets and armor of men afforded greater protection. The effect on the less protected and therefore wounded and 
panicking horses of the French cavalry was an entirely different matter. The descriptions of a volley of arrows in a small 
area descending at a sharp angle and arriving on a plane at a set time is very reminiscent of the description of "emittance". 
Where the arrows landed and the angle at which they protruded from the ground (or whatever they lodged in) brings 
to mind x and dx/dz in a tightly clustered pin cushion of phase space rendered in the relatively antiseptic Figure 33.10 
compared to the carnage of the battlefield, and invites rumination on what impact on the trajectories of arrows - or 
electrons - that being launched from an uneven surface causes. 19 So, once more unto the breach. 

In the absence of wind resistance (drag is neglected), arrows follow a parabolic path making the kinematics of their 
ascent symmetric to their descent, and so for the sheer convenience of it, consider just the descent part of the trajectory. 
From apogee to ground, the path is equivalent to shooting a projectile horizontally from a height h equivalent to the 
apogee. Setting t = 0 to be when the arrows are at apogee, then the initial velocity is v 0 , the initial height is found from 
z(t) = 0 = h - (1/2 )gt 2 (where g = 9.8 m/s is the gravitational constant), and the range fromx(f) = v 0 t. The velocity in the 
z direction is simply v z (t) = -gt. Thus, it is easy to show 


dx _ Vp _ v 0 
dz \j2g{h - z) v z® 


(33.47) 


As a result, v z (dx/dz) = v g , which hearkens back to Eq. 33.30 for the capturing of the behavior of a property that does 
not change as the "beam" of arrows accelerates, just as if the radius of the cathode is R and the rms velocity is v 0 , then 
(Syt = Rv 0 /c, that is, it is invariant. So far, so good. 

Arrows released horizontally with a velocity v 0 from an elevated height h follow the same trajectory, but launching 
from a height suggests variations. If the arrows are launched normal to the surface from which they are fired (say, 
straight out of a window), the change of the surface from flat to curved means that many of the arrows come out in 
different directions. Variations in the strength of the archer means that there are also variations in the value of v 0 for 
each projectile. If the stronger archers cluster, then the initial velocity and the number of particles changes across the 


17 Ref. [37]: The Life of King Henry the Fifth, m.i.l. 

18 John Keegan, The Face of Battle. New York: Viking Press, 1976, Chapter 2: Archers versus Infantry and Cavalry. 

19 It is to be hoped that such speculation would follow, rather than lead, deep pondering about whether pressed claims over territory and an 
asserted right to rule justify four-digit body counts in a single battle, a fraction of which were captured prisoners. 
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Figure 33.15 Electron emission from a carbon fiber field emitter as simulated using the beam optics code 
MICHELLE. The diameter of the carbon fiber is 35 pm and the length of the fiber (not all of which is 
shown) is 1.5 mm, comparable to experimentally tested carbon fibers. Light blue lines denote the 
equipotentials. For the trajectories, the deep blue lines contain the most current, green are intermediate 
amounts, and red contain the least current. Based on Figure 19 of ref. [451], Image courtesy of J.J. Petillo 
(Liedos). 


surface. All of these are readily understandable in how they affect the "time on target", but less analogous modifications, 
such as gravity varying in magnitude and direction near the surface, the projectiles repelling each other in flight, and 
their spiraling about the Earth's magnetic field, really complicate matters: by the time these particles get to ground, they 
will arrive at various times (x) and stick into the ground at various angles (dx/dz). The sophomoric understanding of 
emittance as a cluster of phase space points seemingly considered at one point in time in one plane gets convoluted. 

Following the trajectory of particles subject to such variations is the task of beam optics codes, and such codes have 
become remarkably versatile. An example of electrons emitted from a wire-like surface subject to many of the afore¬ 
mentioned complications above is shown in Figure 33.15 for the simulation of field emission from a 35 pm diameter 
and 1.5 mm long carbon fiber [448, 451], with even more modifications: the effects of the adjacent carbon fiber off to 
the right, not seen in the view shown, are inferred from the unsymmetric blue equipotential lines hugging the surface 
representing the fiber (the lines on the right do not drop as quickly, signaling the presence of the neighbor). Such codes 
can follow the growth in emittance from all the varied complications through the life of the beam or electron bunch. 

Nevertheless, what happens near the surface of the cathode, the so-called initial conditions of the beam or bunch's 
subsequent trajectory, is tough, particularly when the emission sites are on the nanoscale and the small geometric shapes 
making up the surface of the emitter in Figure 33.15 are pm sized and typically larger. The contributions to emittance 
due to unruly features and conditions at the surface of the cathode are termed intrinsic emittance-. no amount of beam 
optics will undo the damage to phase space that the cathode puts on the distribution, and so intrinsic emittance is an 
irreducible part of e and, sometimes, the dominant part. Simple models relating to how it manifests itself are therefore 
still useful, first for simple understanding and second for figuring out how the initial conditions (the cathode's "virtual 
anode") in beam optics codes can be constructed. 

It seems obvious that care must be taken with each complication. The paths the electrons take to the (virtual) anode 
and their time of arrival means that a method of calculating trajectories in the high fields near the apex of the emitter is 
needed if emittance is to be addressed. The changing of the current density over the surface and the launch direction (the 
normal to the surface) affect the transverse velocity components that survive to the anode. Lastly, the presence of more 
than one emitter bodes ominously for how (x 2 ) is to be calculated. Begin systematically and start with the trajectories. 


33.4.1 Trajectory evaluation 

Prevent trouble before it arises. 

Put things in order before they exist... 

The giant pine tree 
grows from a tiny sprout. 

The journey of a thousand miles starts from beneath your feet. 

- Lao Tzu 20 


3 Lao Tzu, Tao Te Ching: By Lao Tzu (trans. Stephen Mitchell). London: Francis Lincoln Limited, 2013, p. 64. 
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The trajectory equations above, namely, z(t) = h - (1/2 )gt 2 , v z (f) = -gt, and x(t) = v 0 t, are the simple solutions to New¬ 
ton's equation for constant force expressed as 

= F (33.48) 

with r(f) = x(t)x + z(t)z, F = —mgz, and where h, v 0 , and g are constant. Because gravity is constant and directed along 
one of the cartesian coordinate axes, the equation separates into separate equations for x(t) and z(f), and each is solved 
separately. The methods to be discussed apply to either x(f) or z(t), but for simplicity pick x(f) and dx/dt = v(t). The task 
is then to numerically integrate the equations: naively, it may seem that the algorithms of Section A1.4 may be useful, 
but alas, in those algorithms both boundaries x(t = 0) and x(t = t max ) are known, whereas in the present case only the 
initial conditions x(f = 0) and v(t = 0) are known, and that makes a difference. Still, the finite difference methods there 
are useful to understand before undertaking the approaches described here. 

Accurate methods such as the Runge-Kutta (RK) approach [236] are useful for numerically solving ordinary differ¬ 
ential equations (ODEs). A commonly used form of RK is fourth-order accurate. Nevertheless, consider the leap-frog 
and average value methods here for calculating the trajectories even though they are only second-order accurate: their 
simplicity makes their implementation easy, and accuracy can always be improved simply by reducing the time step size. 

The leap-frog method is easier to introduce. The first step is to recognize that there are really two differential equations 
inside Eq.(33.48), one for x(f) and one for v(t), and they satisfy 

|x( t) = v(t) (33.49) 

£ (33.50) 

where F = F • x. Of course, the force may be position and time dependent. Now the artistry comes in: by the central 
difference approximation methods of Section Al.3.2, the derivatives are 

x(f + Ar)-x(r) = o(r+y)Ar (33.51) 

= (33.52) 

The notation is already distasteful, but a method to bring it to elegant simplicity is to let the time coordinate, and therefore 
the position and velocity terms, be discretized such that 


t -> tj = jAt 

x(t)^x(tj) = Xj (33.53) 

v(t- At/2) -► v(tj_ 1/2 ) = Vj 

It is essential to observe that whereas position is defined on the time steps tj, the velocity is defined on the halfway point 
between the time steps, or at tj_ , /2 . Using Eq. (33.53) in Eq. (33.51) gives 

X;ti ~ x j = At Vj (33.54) 

v j+l - Vj = At a } (33.55) 

where the acceleration = F(Xj)/m is introduced. Is it good? An nth-order method is exact for a polynomial for which 
the largest power present is n. Motion in a constant gravitational field is parabolic (second-order in time), and so the 
leap-frog and average value methods are exact under such circumstances. Consider how this is shown for the leap-frog 
approach. From Eq. (33.54), x ;+1 = x ; - + At Vj but now recall that Vj is defined at the time step between the x ; - time steps, 
that is, 

x[(j + l)At] = x(jAt) + v[(j + 1/2)At] At (33.56) 

From Eq. (33.55), though, v[(j + l/2)Af] = v(jAt) - (g/2)At and so 

x[(j + l)At] = x(jAt) + At \v(jAt) - i#At] 

= x(jAt) + v(jAt) At - ^gAt 2 (33.57) 
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which is the solution xft + At) = x(t) + v(t)At - (g/2)At 2 to motion in a gravitational field and confirms the leap-frog 
approach is accurate to second order. The final step is to express the leap-frog algorithm in a manner suitable for repeated 
evaluations by formulating it as a matrix equation. When put in a form that will accommodate the second (or average 
value) approach as well, it is 

/1 _a A /v \ /v\ / n n\ / xi \ 

(33.58) 


for which the solution is 


(: r)fe)-0*(: 
te)-(; r)K 


v w 


Unfortunately the leap-frog approach has complexity below the surface that may be confusing: the indices on x and v 
refer to different times, and so a facile reading of the equations Eq. (33.54) and (33.55) that result leads to trouble for the 
unaware. That, plus having to specify what v 0 = v(-At/2) is requires one to be careful in setting up the calculation of a 
trajectory. Such demerits, as it were, should be weighed against the advantage of only needing the acceleration term aft) 
evaluated at the time t (and, as shall be seen below, fewer computations). 

The second approach conveniently defines xft) and v(t), and by extension aft) = Fft)/m, at the same time steps t —► 
tj = jAt, or Xj = x(jAt), Vj = vfjAt) and a } = aijAt). Doing so, however, means that the central difference approximation of 
Eq. (A1.85) has lost its midpoint (why v[(j + l/2)Af] was introduced in the first place). Observing that the midpoint can be 
approximated by the average of the end points, as in Eq. (A1.9), then a reasonable approximation is sj +1 / 2 = if + s j+\)/ 2, 
where s stands in for x, v and a. The defining relations are now 


Xj+i = x j + ^(.Vj + v ]+l )At (33.60) 

v j+i= v j+\^j + a j + i)At (33.61) 


To show that the method is (like leap-frog) second-order accurate, use the gravitational example for which a ; - = a ;+1 = —g. 
Then it immediately follows that v j+l = Vj - gAt and so Xj +1 = Xj + VjAt - (g/2)At 2 simply and naturally. These are, of 
course, the usual elementary equations for motion in a constant gravitational field that bedevil undergrads, just written 
differently. Lastly, the solution of the associated matrix equation is 




The advantage of having x and v defined at the same values of time has come at the cost of needing to know a j+1 in 
the average value approach, something the leap-frog algorithm did not require. For constant acceleration, that is not a 
concern, but it is for more general problems involving forces that change with position (such as space charge or fields near 
a protrusion). Approximating a j+1 « aj renders the entire simulation only first order in accuracy because the equation 
for v j+1 becomes only first-order accurate, independent of the fact that the equation for x ]+l is second order: accuracy is 
always set by the worst performer, much like a chain being only as strong as its weakest link. 

The weakest link can be fixed. Let the value of x J+l that comes about from using the first-order approximation a j+1 -» 
a* = aj in Eq. (33.62) be denoted x*, that is, x* is the first order approximation to x ;+1 . The second-order approximation to 
a j+l is then a j+1 = a(x*) = F(x*)/m, and that value is used when Eq. (33.62) is evaluated a second time. In such a way, 
a good bit of the second-order accuracy is recovered. The price of having the same time points used for x(f) and vft) is 
at the cost of having to perform the matrix evaluation twice in order to achieve second-order accuracy. The result of 
an average value method approach compared to a leap-frog approach is compared to the analytic result in Figure 33.16 
for the harmonic oscillator, evaluated using the code of Section A3.34: both are seen to be good. The results would be 
better (the small drift away from the analytic result over time) if an additional constraint like conservation of energy 
were imposed at each time step, a method often employed in particle simulations [606]. 

The accuracy of the leap-frog and average value methods can be assessed by applying them to a quintessential problem, 
namely the harmonic oscillator of Section 9.3, which is an excellent problem on which to test them: it has a known 
analytical solution and it is not monotonically increasing or decreasing, and therefore tends to flush out how errors like 
to propagate. As a practical matter, the importance of having an analytically exact solution to a problem on which numerical 
solutions are applied to test the quality of the numerical approaches cannot be overemphasized. Insert the known solution 
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0 16 32 48 

Time [At] 


Figure 33.16 Comparison of the leap-frog and average value methods for evaluating the trajectory x(t) of a harmonic oscillator for units of x(0) = 1 and 
At = 1 using the algorithm of Section A3.34 for N = 48 and <»Af = 4ir/N. 


x{t) = A cos(®t + $) into the discrete equations Eqs (33.54) and (33.60) for the purposes of discerning what the error 
terms are. It follows that v(t) = -coA sin(®f + <■/>) and a(t) = -m 2 x{t) with t = nAt. Let 4> = 0 and A = 1 for convenience. For 
the leap-frog method, Eq. (33.54) becomes 21 

-^® 4 Af 3 cos(wAf) - ^w 3 Af 2 sin(wAf) « 0 (33.63) 

^-® 4 Af 2 cos(wAt) » 0 (33.64) 

where the first equation derives from x J+ , and the second from Vj +l . For the average value method, Eq. (33.60) becomes 

i® 4 Af 3 cos(ttAf) + i® 3 Af 2 sin(«Af) « 0 (33.65) 

i® 4 Af 2 cos(«Af) - i® 5 Af 3 sin(ttAf) « 0 (33.66) 

where use is made of the small angle relations sin 0 « 9 - 0 3 /6 and cos 6 « 1 - 6 2 /2 only for those terms not containing n, 
as t = nAt can be large. Therefore, both methods are of order At 2 accurate as already known, but the leap-frog approach 
has the smaller error, perhaps expected due to the roughness of approximating a midpoint by an average value of the 
endpoints of a time step. 

Comparing the output of Figure 33.16 suggests that the average value method is performing somewhat better than 
the leap frog method, and this may appear perplexing. There is a reason for it: the average value method is a somewhat 
ham-handed effort to claim the benefits of implicit methods over explicit methods in solving a differential equation [236, 
242], Suppose that instead of solving for one trajectory x(t), one required trajectories for N particles, and therefore had 
an IV-component vector |x) where x n (t) is the nth component of the vector representing the history of the mh particle. 
Parenthetically, the ket notation |x) is used only as a convenient notation of a vector quantity on which matrices will act 
without implying a relationship to anything quantum mechanical, even though the implicit methods discussed below 
are essential, for example, in time-evolving quantum phase space distributions [126, 264, 325] or modeling the heating 
of an electron gas in photoemission [85]. The distinction between implicit and explicit methods concerns the language 
of matrices. Let y be a matrix that in some way scrambles the values of a vector. For example, in the solution of the 
diffusion equation, where d t u(x, t) = Kd 2 u(x, t ), then y would be the finite difference representation of Kd 2 . Letting \u(t)) 
be the vector of u(x n , t ) values, where n denotes the individual particles, then the difference between the schemes is that 
• explicit methods rely only on past values to calculate future values, so that 

\u(t + At))=\u(t)) + M\u(t)) (33.67) 


21 This can be shown by a straightforward and routine albeit tedious analysis using the relations of Eq. (A2.1), the details of which are omitted 
as an act of kindness. 
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• implicit methods use future values in the solution and solve 

\u(t + At)) = | u(t)) + ^W {|w(f + Af)) + | u(t)) } (33.68) 

The implicit scheme looks much like the average value method in that the average of a future |w(f + At)) and past | u(t)) 
are used to create the vector on which y acts, but it demands that a much harder matrix equation must be solved. After 
rearranging, Eq. (33.68) becomes 

|w(t+AO) = {o-iy} {i+iy}|M(f)) (33.69) 

where D is the identity matrix. As an aside, if Schrdinger's equation was being solved rather than the heat diffusion 
equation, then the matrix y -* -zM in Eq. (33.69). The right-hand side is then just the second-order accurate approxi¬ 
mation to 

Wit + Af)> = exp(-zAf y) Wit)) (33.70) 

and the propagator representation of Eq. (8.22) is recovered. 

Matrix inversions can be hard and time-consuming, and are not usually considered unless there is good cause. The 
good cause is that much larger time steps are enabled and stability is achieved at such a level that the additional effort 
is well worth the computational expense (e.g., finding out how an electron gas heats during photoemission [85, 206]). 
The average value method, then, is seen to try to filch the benefits of implicit schemes by taking a guess as to what the 
future value is and using the guess in the calculation of x J+1 , a disposition it shares with RK methods [236], 

The actual evaluation of the trajectories is on hold until, first, the initial velocity of the electron is specified and, second, 
the analytical impulse approximation is introduced. Both will naturally emerge in a consideration of the emittance and 
how it is to be calculated for a curved surface. 


33.4.2 Hemispheres and emittance 

. ..in their endeavor to show that nature does nothing in vain, i.e., nothing which is useless to man, they only seem to have demonstrated 
that nature, the gods, and men are all mad together. 

- Benedictus de Spinoza 22 

The tranquil and elegant derivations of emittance for planar surfaces are shattered when the primary feature of surfaces, 
namely, their roughness, are considered, and any understanding of a useful cathode must deal with it. Roughness comes 
in two flavors, as shown in Figure 33.17: the physical roughness is due to undulations and protrusions on the surface, 
and chemical roughness is caused by work function variation with crystal faces that cause fluctuations in equipotential 
lines near the cathode surface [321, 692], In fact, both occur at the same time, as images from a field emitter tip quickly 
demonstrate in Figure 33.18: on the left, a field emission electron microscope (FEEM) image shows regions of high 



Figure 33.17 Emittance due to trajectory alterations by physical variations in surface field due to physical structure (left) and chemical variations in 
crystal face work function (right). 


22 Benedictus de Spinoza andDagobert D. Runes, Spinoza: Dictionary. New York: Philosophical Library, 1951, pp. 100-101. This source is all the 
more interesting for having its foreward authored by Albert Einstein. 
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Figure 33.18 Images from a Spindt-type field emitter tip comparable to that shown in Figure 13.1. Left, field emission electron microscope image 
(FEEM). Middle, field ion microscope image (FIM). Right, FEEM plus FIM. Images for the analysis courtesy of Capp A. Spindt and Paul R. Schwoebel 
(SRI International). 

geometric field enhancement and low work function sites where electron emission is likeliest, in the middle, a field ion 
microscope (FIM) image shows regions of high geometric field enhancement and high work function where ion emission 
is likeliest, and lastly, on the right, the combined FEEM plus FIM images show where conditions are favorable to both 
(yellow), namely, where high field enhancement and moderate work function overlap. Where both regions are dark is 
where the work function is high, thwarting emission regardless of the field. 

Finding field variations for work function changes is in essence a numerical boundary value problem for which numer¬ 
ical packages are available, but a discussion about it is less enlightening for that reason. Field variation due to geometric 
structure, on the other hand, is easier to take in on first encounter because the idea of an electron being emitted normal 
to an undulating surface suggests right away the idea of a transverse velocity. Not only that, but the models which can 
be brought to bear to treat emittance from a bump for examining the effects of bending trajectories have analogs to the 
curved equipotential lines near work function patches, so the trajectory methods to be described below are similar in 
both cases. Therefore, choosing to treat the model with a well-defined surface is simultaneously more enlightening and 
easier. 

To compare the emittance from a surface with structure for the various emission mechanisms of thermal, field, and 
photoemission, having a good analytical model on which to test all three is desirable. A useful one is the hemispherical 
model of Section 30.2, and using it will follow ref. [451]. 23 As found in Eq. (30.9), the potential outside the sphere is, in 
spherical coordinates. 



(33.71) 


The relations z= rcosO and p = r sin 8 take the spherical coordinates over into the cylindrical coordinates. Finding the 
fields in the p and z directions is straightforward 



(33.72) 



(33.73) 


where the first relation is in cylindrical and the second in spherical notation. At the surface, F z = 3F 0 cos 2 8 and F p = 
3 F 0 cos 9 sin 6, so that the force is radially outward with a magnitude F r = 3 F g cos 0 as per Eq. (30.10). Electrons ejected 
from the surface and accelerated by such forces will follow trajectories that can be evaluated usin g the met hods of 
Section 33.4.1. Recalling that emittance for a rotationally symmetric problem goes as e n rms = (h/2mc)J(p 2 ) (k 2 ) because 
(x 2 ) = (p 2 ) /2 and (k 2 ) = (k 2 ) /2 in Eq. (33.30) (the cross term vanishes), dealing with p and k p is a bit simpler as F p 
can be used. 

In a beam optics code, all the particles in a bunch have known positions and velocities, and so the evaluation of the 
emittance of the bunch is well defined. In contrast, treating emission as continuous makes the evaluation of emittance 


23 Saying "mostly follow" would have been more accurate. First, ref. [451] introduces the impulse approximation late in the development, but 
here, it is introduced from the outset. Second er(0) has changed to Eq. (33.77), which has consequences albeit small ones. Third, Eq. (33.115) 
is mildly different compared to its progenitor. 
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less well-defined. Take a cue from Agincourt (Section 33.3.1), but instead of evaluating the position and angle of arrows 
after they have stuck into the ground (or, regrettably, something standing on the ground), consider instead the group of 
arrows that have the same time of flight or, here, the electrons that have traveled for a time t from the cathode. Such a 
definition will differ from electrons that have crossed a plane (such as a phosphor screen) because of small differences 
in arrival time, but at least the definition gives an approximate measure that can be used to compare disparate sources 
with very different surfaces and operating conditions. 


33.4.3 Impulse approximation 


If we cannot bear too much reality, we might at least avoid too much fantasy. 


- John Dominic Crossan 24 


Finding the trajectories of electrons near surfaces with geometric structure is very difficult and is the bane of simula¬ 
tion, particularly when that structure is nanoscale, but whatever strategy concocted to do so should not be too fantastical. 
Far away from the surface, F -* F 0 z and the familiar ballistic equations analogous to the gravitational problem take over. 
How far out is suggested by Figure 33.19, where the magnitude of the force components F z and F p are evaluated for 
the hemispherical potential of Eq. 33.71: the later becomes negligible and the former transitions over to the background 
field F 0 over a length scale of na where a is the radius of the hemisphere and n is between 2 and 3: the modifications 
to the constant field are proportional to (fl/r) 3 , for which (a/r) 3 = 1/n 3 or between 1/8 and 1/27 for 2 < n < 3. Conse¬ 
quently, the influence of the hemisphere is only at the initial part of the trajectory, suggesting that the initial conditions 
encapsulated in v 0 might be profitably modified. But how to modify v 0 is ambiguous. The initial velocity v 0 in the ballistic 
equations appears to have been given a boost, or impulse, to account for acceleration near the hemisphere. Therefore, a 
simple approximation is then to make the replacement v 0 -*■ o 0 v 0 , where a 0 is a dimensionless factor that dictates the size 
of the impulse. Fantasy, to be sure, but not an egregious one. 

The estimation of o 0 can be made by setting the change in kinetic energy to the change in potential energy as an 
electron moves from the enhanced field near the surface of the emitter to a region where the potential has more or less 
become that due to the constant background field. That is, using Eq. (33.71), let 

|mu 2 [<r(0) 2 - 1] = V f {a, 0) - V f (na, 6) (33.74) 

and so 

a(0) 2 = 1+ (^l) ( ! ^ : )cosW (33.75) 



Figure 33.19 The field components associated with the hemispherical potential shown in Figure 30.2. The blue region represents the hemispherical 
emitter. The size of the region shown is 3 a where a is the radius of the hemisphere. Contours are equispaced between the maximum (3 F a for F z , 3F„/2 
for F p , white) and minimum (0, black). Based on Figures 10 and II of ref. [451]. 


24 John Dominic Crossan, God&Empire. New York: HarperCoilins, 2007, p. 202. 
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which makes sense: emission from the apex gets a good impulse (<r(0) = cr 0 ), but at the base of the hemisphere, where 
the fields vanish, then the impulse should not occur (ct(^/ 2) -*■ 1). Observe the natural introduction of a dimensionless 
factor k defined as the ratio of the energy scale associated with the field and that associated with the initial kinetic 
energy 

F„a 


which, coupled with an estimate drawn from Figure 33.19 suggesting n = 2, gives 

m= [l + (y) cos^] V2 (33.77) 

As a result, replace v 0 -*■ o(Q)v 0 in the ballistic trajectory equations, which thereby accounts for field enhancement effects 
near the bump, and so 


pit ) = a sin 9 + a{6)v 0 t sin 9 

Fn , 

z(0 = a cos 9 + u(0)v t cos 9 H- -t 2 (33.78) 

2m 

The impulse approximation compared to the numerical trajectory evaluation of Eq. (33.60) treated in Section 33.4.1 is 
shown in Figure 33.20. 25 Letting t = mv 0 /F be a characteristic time scale, then define t = pr. In terms of the dimensionless 
factors k and p, then Eq. (33.78) becomes 


With both xit) = p{t) i 
seen to be 


^ = (1 +<j(0)p)sin0 (33.79) 

- =(l+o-(0)p)cos0+ ^cp 2 (33.80) 

and z(f) known, then it is finally possible to obtain the much sought factor x'(t), now readily 


v x {t) _ ui0)v o sin 0 cos (p _ sin 0 cos cp 
v z (t) a{9)v g cos 9 + (F 0 t/m) ~ cos 9 + ( p/o{9 )) 


(33.81) 


Observe that if /(f) were under consideration, the relation would be much the same but for cos cp —> sin <p, where <p is 
the azimuthal angle such that x = p cos cp and y = p sin cp. 



25 The prior version of the impulse approximation (Eq. (68) and (69) of ref. [451]) made use of the ad hoc form o a o 0 cos 9, suggested by the 
intuition that the impulse factor a must decline as 9 increases. The present treatment shows that cos 9 by itself is a tad too aggressive, and that 
Eq. (33.77), being based on Eq. (33.75), gives a better account. 
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33.4.4 Single hemisphere emittance parameters 


A critical factor in the operation of coherent radiation sources such as free electron lasers (FEL), gyrotrons, etc., is the quality of the 
electron beams ...the problems which are cathode-related such as patchiness, nonuniform emission, roughness, and nonconstant work 
function, etc., may in many cases place a fundamental limit on the brightness of an electron beam. 

- Yue-Ying Lau [693] 


When comparing the thermal, photo, and field emission mechanisms, the development of parameters depends on 
scales characteristic of each process so that direct comparisons between them can be made. If distances are given in units 
of a and time in units of t such that t = pc, then P z = vjc = (p + 1 )v 0 /c. Next, emission shall vary down the side of the 
hemisphere to a degree specified by the emission mechanism being considered. The notional angle cos 9 n = q suggested 
by Eq. (30.17) and defined by 

\-q = jj sin0dd = P(9) sin Odd (33.82) 


becomes useful to develop scaling relations using the methods of Section 30.2. Because each emission mechanism (ther¬ 
mal, field, or photoemission) differs, so will the forms of P(9) = J\F(9)\/J(F tip ), q, and v 0 differ. From P(9), A(0), defined 
after the integral of Eq. (30.13), is 

A(0) = — = f P(0')sm0'd0' (33.83) 

hoss 1 - n Jo 


where I boss is the total integrated current over the surface of the hemisphere. A (9) therefore gives a measure of what 
fraction of the current is emitted by the surface of the hemisphere inside the polar angle 9. 

The velocity v 0 needs a bit more effort: using methods motivated by Eq. (3.5), but in a way closer to the spirit of Eq. 
(7.9), of J = qp{v), then, from Eq. (11.1) 


f°°(hk/m)D(k)f(k)dk 
fo° D(k)f(k)dk 


(33.84) 


where D{k) is the transmission probability, f(k) is the supply function, and hk/m is a velocity. Likewise, the evaluation of 
( v 2 ) would involve a factor of (hk/m) 2 in the numerator, so that, for example, the rms velocity is 


v ms = Viv 2 ) = (h/m)yfW) 


(33.85) 


The velocities v 0 and v rim are not the same, although they can be close: the former is required to launch the electrons in 
a trajectory calculation, but the latter is required in the evaluation of emittance right at the surface. 

In what follows, the calculation of the notional angle factor q, the integrated current fraction A(0), the launch velocity 
v gt and the rms velocity v rms is undertaken. Add to that list a dimensionless characteristic energy ratio parameter going 
by the designation b: it is crafted from the field- and/or temperature- and/or frequency-dependent characteristic energy 
parameters of each mechanism on which the other terms will depend, and will be found on a case-by-case basis. The 
evaluations require terms drawn from Tables 2.3 and 33.2. 


Table 33.2 Emission parameters used in the evaluation of emittance (see also Tables 2.2, 2.3, and 33.3). 


Symbol 

Definition 

Relation 

Unit 

k F 

Fermi wavenumber 

y/2i^/h 

1/nm 


Inverse temperature 

w 

eV" 1 

<t> 

Schottky lowered® 

®--\/4 QF 

eV 

a 

Hemisphere radius 

- 

nm 

<m 

Impulse factor 

Eq. (33.77) 

nm 

V a 

Launch velocity 

Eq. (33.84) 

- 

v rms 

rms velocity 

Eq. (33.85) 

- 

n 

Notional cosine angle 

Eq. (33.88) 

- 

m 

Integrated current fraction 

Eq. (33.83) 

- 

mn 

Characteristic energy ratio 

Process dependent 

- 
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33.4.4.1 Thermal emittance 

Begin by considering the current density J of the Richardson-Laue-Dushman equation given by Bq. (12.3). The field 
dependence resides in the Schottky lowered work function <£ = ® - \JAQF. Knowing that F{9)/F 0 = 3 cos# for the field 
along the surface of a hemisphere (Eq. (30.10)), then the field- and temperature-dependent energy ratio parameter 
b(F, D is 


b(F, T) = 


\[lQF 
k B T 


= /?V4 QF 


(33.86) 


in terms of which the emission probability factor P(#) is given by 

P(#) = exp |# ^ Vcos 6 - l^j 

The notional cosine angle is defined by 


/■*/2 _ 

= 1 - / e Ky/cos0-i) sin edQ 


(33.87) 


(33.88) 


where the second line has used a change of integration variable to u = b{sj cos 9 - 1). The integrated current fraction A 9 
follows a similar evaluation, and results in 


A(#) = 1 - 


t- (b\/cos9- 1 


e by/cos9 


(33.89) 


1 +{b- l)e b 

Using methods already considered in the evaluation of Eq. (3.14), the launch velocity and rms velocity are given by 

v 0 = (2k B T/jim) l/2 (33.90) 

Vrm S ={k B T/m) 112 (33.91) 


EXAMPLE: Evaluate b, r), v 0 , and v rms for typical parameters associated with a thermal emitter, namely, an electric 
field of £ = 333.33 kV/m (or 3 F 0 = 1 eV/pm) and T = 1200 K (or k B T = 0.103408 eV). 

SOLUTION: The evaluations follow sequentially: 

• The field at the apex of the hemisphere is F = 3 F 0 . Therefore, b = \fAQF/(k B T) = (0.0379469 eV)/(0.103408 eV) 
= 0.366962. 

• By Eq. (33.88), r\ = 0.111875, or 9 n = 83.56°. 

• v 0 = (0.206816/;r510999) 1 / 2 (299.792 nm/fs) = 0.134862 nm/fs. 

• v „ = (0.103408/510999) 1 / 2 (299.792 nm/fs) = 0.190723 nm/fs. 


33.4.4.2 Field emittance 

Next consider the current density J of the Fowler-Nordheim equation given by Eq. (13.25), a form which shows the 
field dependence explicitly. As before F(9)/F 0 = 3 cos# for the field along the surface of a hemisphere (Eq. (30.10)). The 
energy ratio parameter b{F) depends only on field and is 


(33.92) 


in terms of which the emission probability factor P{9) is given by 


P(#) = (cos#) 2 “ v exp [#( 1 - sec#)] 


(33.93) 


The notional cosine angle is 


already encountered in Eq. (30.17) and is 


n = 


b+ 3-v 
b + 4-v 


(33.94) 
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The integrated current fraction A(0) makes use of the same approximations encountered in evaluating Eq. (30.15) and is 
A(0) = 1 -exp[-(fc + 4- v)(sec6>- 1)] (33.95) 

Lastly, the launch velocity and rms velocity are best evaluated by assuming that the chemical potential p is large and 
emission is concentrated about k « k p so that 



hkp (T>t 0 bn- ®y 

m \ 3f„2>p + ® / 


where t 0 is familiar from Eq. (13.24) and the discussion following it. 


(33.96) 

(33.97) 


— 
EXAMPLE: Evaluate b, rj, v 0 , and v rms for typical parameters associated with a field emitter, namely, an apex field 
of F = 8 eV/nm and copper-like parameters. 

SOLUTION: The evaluations follow sequentially. With B„® 3/2 = 65.2073 eV/nm and v = 2B 0 Q/3\/<S> = 0.772808 
then 

• b= (65.2073/8) = 8.15091 

• r\ = 0.912112, or 9 n = 24.2° 

• using 3 bt 0 = 25.952, then v 0 = 1.49241 nm/fs 

• = 1.42113 nm/fs. 

_ 


33.4.4.3 Photo emittance 

Lastly, consider the current density J based on the low temperature limit of the Fowler-DuBridge equation of Eq. (14.16). 
The field dependence resides in the Schottky lowered work function 4> = ® - y/4QF. The field- and frequency-dependent 
energy ratio parameter b(F, w) is 

b(F, ffl) = (33.98) 

V4QB 

in terms of which the emission probability factor P{6) is given by 


P{e) = b~ 2 (b- l + Vcos6>) 


(33.99) 


The notional cosine angle is defined by 


The integrated current fraction A (0) is long-winded to evaluate but relies only on simple integrations, and is 
[ 6{b - l) 2 + &(b - l)-\/cos0 + 3 cos#] 


Lastly, the evaluation of v 0 and u rms contains a feature that thermal and field evaluations do not: the photoemitted 
electron has absorbed the photon's energy, and it is the excited energy that matters. The approximations behind the 
Fowler-DuBridge approach are adequate to deal with that complication: the difference between the vacuum level and 
the conduction band minimum is the height of the barrier n + cp, and it is removed from the energy of the photoexcited 
electron by the means of E' = E + ha - + 4>), where the prime indicates the vacuum level energy. The remaining 

integrals are a challenge to evaluate, and so under the approximation that {Tico - <j))/n is a small parameter, the integrands 
in the velocity equations can be expanded to leading order to find 






tico — 4> ( 

tico — cb 

r^v~ \ 

-~L2\ 


(33.102) 


(33.103) 
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where the primes are to remind one that the quantities are defined on the vacuum side, not the material side. For 
expediency, the field dependence of 4> is not shown, but evaluation requires knowing that 4> = ® - y / 4QF is the Schottky 
lowered work function. 


EXAMPLE: Evaluate b, rj, v 0 , and v rms for typical parameters associated with photoemission, namely, F = 75 eV/pm 
and X = 250 nm. 

SOLUTION: The required factors are the Schottky factor QF = 0.328629 eV, ha = 4.95937 eV, and fcw - </> = 
0.787997 eV. Then 

• b = (4.95937 - 4.17137)/(0.328629) = 2.39783 

• r\ = 0.249042, or 9 n = 75.58° 

• from {ha - 4>)/n = 0.112571, then v 0 = 0.301821 nm/fs 

• = 0.302539 nm/fs. 


33.4.4.4 Parameter comparisons 

Each of the emission mechanisms (thermal, field, photo) are employed with characteristic sizes, fields, temperatures, 
and conditions. For the purposes of comparison, each technology should be evaluated under the conditions in which it 
operates. Therefore, consider the following representative values: 

• Thermal: Dispenser cathodes used in high-frequency microwave amplifiers operate space-charge limited using an 
extraction grid that plays the role of the "anode" in Child's law (Eq. (16.18)). For J CL = 2 A/cm 2 and an anode-cathode 
gap of D = 250pm, then VJD = 0.2638 eV/pm [492], A background field of F 0 = 0.333 eV/pm is therefore in excess 
of the space-charge limit but comparable to extraction grid technology. Machining of the cathode surface results in 
grooves and bumps with feature sizes of a few microns [281], Let a = 2 pm. 

• Field: A molybdenum field emitter array used in a 100 WTWT generated 100 mA using 50,000 emitters [483]. Assume 
that 40% of the emitters produce all the current, and that they have a characteristic apex radius of a = 5 nm. Apex 
fields of 8 eV/nm are therefore required, so let F 0 = (8/3) eV/nm. 

• Photo: A multi-alkali antimonide photocathode was used in the first high average power photoinjector [520]. The 
drive laser had a wavelength of 527 nm, and the surface field was 26 MV/m, corresponding to F 0 = 26 eV/pm. Such 
fields are comparable to more recent X-ray FEL fields [676]. Recent metal photocathode characterizations [695] suggest 
a surface roughness equivalent to a = 0.2 pm. 

With these values, the characteristic parameters can be obtained, as in Table 33.3. Additionally, the characteristic inte¬ 
grated current fraction A (0) (Eq. (33.83)) can be compared, as in Figure 33.21. 


Table 33.3 Emittance parameters and values for the evaluation of emittance based on copper. When 
appears in the equation and/or units column, that parameter is not needed for the emittance 
parameters for that emission mechanism. See also Table 33.2. 


Term 

Equation and/or unit 

Thermal 

Field 

Photo 

„ 

[eV] 

- 

7 

7 

<6 

[eV] 

- 

4.5 

4.5 

a 

[nm] 

2000 

5 

200 

F 

3 F 0 [eV/nm] 

0.001 

8 

0.075 

T 

[K] 

1200 

- 

- 

X 

[nm] 

- 

- 

250 

0 

4> - y/4Q(3F 0 ) [eV] 

4.4621 

1.1059 

4.1714 

v a 

33.90, 33.96, 33.102, [nm/fs] 

0.1076 

1.4924 

0.30182 


mvJFl fs] 

1835.4 

3.1820 

68.642 

K 

F 0 a/mvl 

30.381 

3.1587 

72.402 

a 0 

Vl+(7«r/2) 

10.360 

3.4721 

15.950 

b 

33.86, 33.92, 33.98 

0.36696 

8.1509 

2.3978 


33.88,33.94,33.100 

0.11187 

0.91211 

0.24904 

e n 

arccos(»/) [degrees] 

83.577° 

24.201° 

75.579° 

A(«y 

33.89,33.95, 33.101 

90.524% 

66.591% 

82.647% 
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Figure 33.21 Evaluation of A(0) using Eqs (33.89), (33.95), and (33.101), and the parameters of Table 33.3. Dashed vertical labeled lines indicate the 
location of 6 n . Based on Figure 4 of ref. [451]. 


33.4.5 A notional emittance for a hemisphere 

Consider whence each thing is come, and of what it consists, and into what it changes, and what kind of a thing it will be when it has 
changed, and that it will sustain no harm. 

- Marcus Aurelius 26 

If a factor 0(0, cp ) is specified over the surface of the hemisphere, then the calculation of its mean value proceeds 
according to 

sin 6d6 /?* 0(0, cp)P(0, cp)dcp 

(O) = — ---r- (33.104) 

f* sin OdO f 0 x P(0, cp)dcp 

where P(0, cp) is the weighting factor. For the rotationally symmetric hemisphere, field enhancement varies only with 0, 
and so P(0, cp) -* P(0) = J[F(0)]/J[F(0)\, where J is the current density. After experiencing what goes behind the evaluation 
of field enhancement, and the varied forms the canonical equations of emission take, finding (O) is promising to be an 
ordeal. The same was said of the evaluation of emission area in Section 30.2. Then, as now, the consideration of the 
notional quantity served as a convenient stand-in to the quantity itself: in the case of area, although the notional area 
is always smaller than the actual emission area, both scale the same way as the field (or other governing parameter) 
changes. In the same way, the otherwise frightful complexity of the evaluation of (x(t) 2 ) and the other terms of e„ rms 
can be sidestepped by considering instead their notional values and observing that these values serve as lower bound 
estimates, but which will also scale in the same manner as the more accurate representation involving P(0). If 0 n is the 
notional angle, then the notional average quantities will be defined by 

(O) = — f ” sinOdO f O(0,cp)dcp (33.105) 

4^ Jo Jo 

a relation that provides a profoundly easier analytical representation. 

Letting 0(0, cp) = x(t) 2 is a perfect place to start. From Eq. (33.78), x(t) = a( 1 + op) sin 0 cos cp but this can be approx¬ 
imated by x(t) « aop sin 0 cos cp because in general op » 1. Therefore, the corrections due to the launch site on the 
hemisphere being a fraction of a radius a off-axis is largely irrelevant, and so it can be ignored. Inserting the approximate 
form into Eq. (33.105) results in 

(x(0 2 ) = J <r(0) 2 sin 3 OdO J cos 2 cpdcp 

(po.v. t) 2 , 

« g a (2 + >?)(1 -1?) 2 (33.106) 

where the second line follows after using p = cos 0 n and replacing t -* pr. 


26 Marcus Aurelius, The Meditations of Marcus Aurelius, The Harvard Classics: Plato, Epictetus, Marcus Aurelius, ed. Charles W. Eliot, Collector's Edition. 
Danbury, CT: Grolier Enterprises Inc., 1980, p. 289. 
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The next term corresponds to x'(t) 2 = ( vjv z ) 2 , which can use Eq. (33.81). The 6 dependence of a (9) is an impediment 
to analytic integration of what remains. Observe, however, that as 6 increases, so does the denominator, and so making 
the swap a(9) ->• a 0 gives the lower bound estimate to (x'(f) 2 ), not to mention its virtue in enabling an analytic solution 
to the integral. Letting s = cos 9 then 

(33.107) 


where S(p,r() is defined by (with p = p/o 0 ) 


(*'<0 2 > = is( JA) 

S(p,p)= [ . ~ * ds = 2p\n (+ 

Jr, (p + s ) 2 \ri + p) 


(1 — i?)(l — 1 — 2p) 


The cross term (x(f)x'(f)) is the final piece, and the same substitutions yields 

x ' 4 J 0 (j(9)cose + p 4 \<y 0 ) 

where 

R(p,p) = ( 1 -p 2 )ln ^ [(P-4) 2 - (l -P 2 ) 2 ] 


(33.108) 

(33.109) 

(33.110) 


and so, when the velocity of the electrons in the current is non-relativistic, the emittance for a solitary hemispherical 
bump is 

e x > Upo 0 v 0 yj±(\ - >0 2 (2 + p) 2 S(p) - R(r,) 2 (33.111) 

which is satisfying until one actually tries to use it. The two groups of terms in the radical are very closely equal to each 
other so that ferreting out the scaling behavior is not easy, the invariance of pye x is not discernible at all in the large p 
limit, and thoughts (contra Marcus Aurelius) turn unkind. But before indulging dark musings, it is worth reflecting on the 
consequences of such cancelation: that the "direct" term (x 2 ) (x' 2 ) is largely canceled by the "cross" term (xx') 2 is not just 
a peculiarity of hemispheres, as the cancellation likewise occurs for needle-like field emitters modeled by ellipsoids where 
the electron trajectories follow field lines in a prolate spheroidal geometry [661]). The emittance of single protrusions is 
quite small as a result of the cancelation, and therefore the brightness (Eq. (33.31)) of needle-like cathodes, particularly 
field and Schottky emitters [154], is significant even though the transverse velocity off the apex of the emitter can be 
comparatively large when the current is high. There is evidently more to the story. 

A pleasing method to coax out the remnants after the cancellation is by treating the integrals as a kind of distribution 
function in conjunction with another term for which the moments are taken. Consider the following definitions 


(x 2 ) = ifV( W T) 2 [°g(9)de 

4 Jo 

(33.1 

112) 

( x ' 2 ) = ^o^) 2 J Q a mfgmde 

(33.1 

113) 

{■XX') = ] -Na 0 v 0 pr J ° f (9)g {9) do 

(33.1 

114) 

where the functions f(6), g(9) and normalization N are defined by 




(33.1 

115) 

g(0) = |sin 3 0 

(33.1 

116) 

N=±(l-r,) 2 (.2 + r,) 

(33.1 

117) 


where N is so defined that g(9) is normalized to unity, and g{9) takes on the role of a distribution for which / (0) becomes 
the term for which moments are taken. Why do that? Because with (f n ) defined by 


/„ = (f”) = f" U 


(33.118) 
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then use can be made of (in a manner completely synonymous with Eq. (8.65)) 

A/ 2 = (A/ 2 ) = [ ” - (f)) n g(0)d0 

Jo 

= (/ 2 - 2/(0 + ( O 2 ) =/a - 2ft +/ 2 =/ 2 -/ 2 


| (jw,u,r) VA^ (33.120) 

Because A f 2 is well behaved (it is not the difference of two very nearly equal terms), the scaling behavior of emittance 
can be found. To obtain an approximate analytical solution in terms of s = cos 0 and g = cos 0 n , where 6 n is the notional 
angle (Eq. (30.13)) and /(cos 0) is defined by Eq. (33.115) 


m = - 


1 


then because /(l) < /^ < f(p), a middle value s g = cos 0 o exists where f y = f(s 0 ), which in ti 

The evaluation of A f 2 will therefore encounter an integral of the form 

1 (1 - s 2 )(s - s 0 ) : 


A f 2 = 


N[s 0 + (P/(T 0 )\ 


f 1 (1-y 


[s + (p/s 0 )\ 


—ds 


Such an integral undoubtedly has a closed-form solution, but that is not the point: how it depends on (1 - rf) is, and so 
consider approximations that are more or less sufficient to capture the flavor even while introducing a smidgen of error. 
Two approximations that are reasonable for doing so are first, the location of s 0 is between 1 and r\ so approximate it by 
s g « (1 + if)/2 and, second, evaluate the denominator of the integrand at s = s 0 and pull it out, leaving an integral that is 
trivial to evaluate: 

1 (1 -s 2 )(s-s 0 ) 2 ^ (2ri + 3) (1 — r(f 


[s + (p/o 0 )} 2 


—ds a 


15[l+^ + (2p/a„] 2 


(33.124) 


As crude as the approximations are, they capture most of the behavior as seen in Figure 33.22. When put into the 
equation for emittance, and considering the long time limit (p » 1), then 


a C>)V(2+>/)(2 i, + 3)(1 - n? 

yfi r(7 g p 2 v g \pb T(r g v g 

30c[(t o (1 + p) + 2p] 2 120c 


(33.125) 

(33.126) 



Figure 33.22 The values of (p/a 0 )f ( = (p/<? 0 ) and [A/.] 1 / 4 as a function of notional angle (tj = cos 6 n ), evaluated numerically (symbols) and using 

the approximations behind Eq. (33.124) (lines). 
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where the large time (p -*■ oo) limit of C 0 (p) is shown. 27 The factor C 0 (oo) serves as a characteristic emittance coefficient 
for each process of thermal, field, and photoemission. 

Emittance for a hemisphere has several distinct features. First, the emittance depends on the characteristic time con¬ 
stant r = mv 0 /F, which in turn depends on the launch velocity v 0 and the typical field associated with that emission 
mechanism. Second, emittance depends on the behavior of the notional emission angle 6 n through the factor (1 - r/) 3 , 
and that is a strong function of how rapidly emission falls off down the side of the bump. As field emitters have small 
6 n , the emittance of a single field emitter is small and field emitters are therefore quite bright sources, particularly when 
the substantial current density on apex is taken into account. Third, for a hemisphere, the emittance is independent of 
hemisphere radius a. Lastly, the emittance caused by electrons being launched at an angle to the beam direction due to 
surface roughness is governed by v g , whereas the intrinsic one-dimensional emittance due to the emission process itself 
is governed by v rms , and these quantities are comparable. In treatments in the literature, these two differing kinds of 
emittance contributions are sometimes termed '"slope" and "field distortion" or "tilt" effects [677, 692], Treating field 
distortion is generally difficult due to an absence of knowledge about how the field lines behave except in special circum¬ 
stances in which they can be exactly determined in theoretical models (e.g„ the tour deforce by Lau [693]). Additionally, 
how the field away from the hemisphere (or any protrusion) induces further variations complicate how the electrons are 
accelerated, leading to, for example, an c 2 c|d term [696]. An account of the total emittance from a rough cathode surface 
must include them all, and a method to do so is 



where "ID" is the emittance due to transverse energy contributions in the one-dimensional model, "bump"' is due to 
the effects of a transversal launch from a rough (or hemisphere-like) surface, "field" is for field non-uniformity effects 
during acceleration close to the cathode surface, and "■ ■ is for whatever else Nature chooses to bedevil emittance with, 
be it near the cathode or acquired elsewhere during the beam's propagation. 

Knowing that p < 1 for thermal, field and photoemission sources, a comparison of C 0 ( oo) for them using the parameters 
of Table 33.3 results in C 0 ( oo) [nm] = 1.4688, 0.018439, and 1.5771 for thermal, field, and photoemission, respectively. 
If only emission was strictly normal to the surface (it is not), then it would seem that field emission sources are far and 
away the brightest emitters of the bunch because of both C 0 and (1 - //) 3 , and for single emitters, they are. But for most 
electron sources, emission is not from a single bump, but rather (and particularly for field emitters) from an array of sites, 
and that small observation has consequences for the (1 - p) dependence by removing the ability of the cross term (xx 1 ) 2 
to offset (x 2 ) <x' 2 ). 

33.4.6 An array of emitters 

... if their actions be directed according to their particular judgements, and particular appetites, they can expect thereby no defense ...(f)or 
being distracted in opinions concerning the best use and application of their strength, they do not help, but hinder one another _ 

- Thomas Hobbes 28 

Emitter surfaces are not atomically flat. They have structure and variation that can be significant, as images of various 
thermal, field, and photoemitter surfaces show [176, 382, 480, 677-679, 697-704], The cumulative effect of all those 
busybody protrusions or emission sites is to complicate making a good beam. There is nothing simple in how they do so, 
but simple models nevertheless give a feel. 

Realistic thermionic and photoemission surfaces have varying surface feature size and random placement. In contrast, 
arrays of field emitters are made in a triangular array rather than a square lattice so as to increase packing density: for a 
tip-to-tip spacing, or pitch, oil, a square array has (1 emitter)// 2 , whereas a triangular array has (1 emitter)/)/ 2 sin(x/3)] = 
1.1547 emitters /l 2 . Be that as it may, nevertheless model the emitters as sites on a square lattice that are uniformly sized 
with the number of sites being M 2 and the area of the array being L 2 , such that 

L 2 = [(M - l)/] 2 (33.128) 


27 The form of this equation is similar to that of ref. [451], but differs in that here/, = l/[s 0 + (p/o g )] is retained and s 0 ss (1 + p)/2 is not used. 
The crudeness of the estimate s 0 rj (1 + rj)/2 makes Eq. (33.125) differ by about a factor of 2 compared to a more careful treatment (the value 
of s 0 really does matter), but tedious Taylor expansions are avoided. If the point is to recover the (1 - >j) 3 dependence, the price is a bargain. 
28 Thomas Hobbes and C.B. Macpherson, Leviathan. Harmondsworth: Penguin, 1968, "Of Common-wealth", Part n. Chapter 17, p. 224. 
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Lastly, let the trajectories of a single boss be unaffected by its neighbors, 29 which takes l on the order of several hemisphere 
radii a at least. By way of comparison: (z) the microstructure on scandate dispenser cathodes can have feature sizes from 
5 to 20 pm [702], (z'z) pitches of Spindt-type field emitters are a few micrometers [176], (Hi) atomic force microscopy 
images of SLAC's linac coherent light source (LCLS) photocathode [696] have undulations with a period of about 10 pm, 
(zv) roughness parameters assumed in a theoretical treatment [705] took the periodic bumps to be tens of nanometers 
apart on a photocathode, (v) sputter-deposited and solid lead photocathodes [706] have feature sizes on the order of 1 
pm, and (vz) aligned carbon nanotube fibers can be placed 10 pm apart [701], Lastly, assume that the M 2 protrusions are 
uniformly sized and emitting, the latter a condition which can be relaxed afterwards. 

Because knowledge of the emitter's location in an array is now required, consider a redefinition of x(t) to account for 
it [451, 661, 707]: let the location of the emitter in the zth row and;th column be designated by appending the location 
of the center to x(f), and the same with the location y(t), and so 


x(t)^X lJ (t)=x I] +x(i) 

y(0 -* Y ij (t) = y ij +y(t) (33.129) 

Xy = (2/ — M— 1)/ 

y ij = (2i-M-l)l (33.130) 

where, as expected, Xy{t) is independent of the row (z) and Yy(t) is independent of the column (j ). Now, (Xy(t) 2 ) takes 
the place of (x(f) 2 ), where 

M M r 

= 4^211 /< 33 - i3i > 

where d£l /4 n includes all the factors which appear in Eq. 3 3.104 apart from M 2 , which is needed for turning the summa¬ 
tions into averages. For a macroscopic array, then L 2 » (x(f) 2 ), leading to the independence of Xy on;. As a consequence 
of (x^) = 0 (in an infinite array, emitters on one side of the emitter under examination are balanced by emitters on the 
other side) then 

{Xyit ) 2 ) = (x 2 ) + <x(f) 2 ) = \{M 2 - 1) l 2 + <x(f) 2 ) (33.132) 


EXAMPLE: Show /x 2 .^ = (1/3) (M 2 - l) Z 2 as in Eq. (33.132). 
SOLUTION: From the definition of averaging and using Eq. (A1.2), 


= S Z N 2 - 4 * M + !) + ( M + l) 2 ] 


= l 2 (|(M+ 1)(2M+ 1)- (M+ l) 2 ] = ‘-(M 2 - l) 


Convert over to L by ( M 2 - l)/ 2 /3 = (M + 1)L 2 /12(M - 1) and then taking the large M limit to get L 2 / 12. Now I? 
is big, certainly far more so than (x(f) 2 ), which is evaluated over one emission site. Therefore, (x^t) 2 ) is of the size 
L 2 /12. Now, the factor (x'(0 2 ) is not affected by the array and so Eq. (33.107) can be used. That leaves the cross-term 
(Xy(Z)x'(0) = (x(f)x'(Z)), which therefore does not change in size and is therefore negligible compared to L 2 (x'(f) 2 ) /12. 
The important thing is that (x(f) 2 ) cx N whereas (x^(f) 2 ) oc L 2 /12 so that a factor of IV <x (1 - rj) 2 (seen in Eq. (33.112)) 
seems to have gone missing, meaning in short that 

e„ !rms ocd-z?)^ (33.133) 


29 This approximation is not as bad as it sounds: apart from being reasonable for widely spaced bumps, for the extreme case of field emitters on 
a (perfect) regular lattice, apart from shielding, the neighbor emitters are symmetrically placed and have a reduced effect on trajectories. 
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instead of the (1 - rfy > as before: the effect of the array has been to enlarge the emittance due to an emission area factor 
L 2 via a weaker dependence on (1 - ff), an effect particularly consequential for field emission as rj for it is near unity. 


33.5 Emittance and realistic surfaces 


33.5.1 Emittance and undulating surfaces 


... we are now reaching the thermal emittance as the limit to increasing the beam brightness. However, the effective thermal, or “intrinsic ", 
emittance depends on several effects, including the crystallinity, surface roughness, surface impurities, and QE non-uniformity. Thus it is 
very challenging to measure and combine all these phenomena into a complete and useful physical model. 

- David H. Dowell et al. [523] 


Machining or processing of a cathode surface can leave grooves that have a spacing of approximately 10 pm for example 
the dispenser cathode surfaces of refs [281] and [282], but such structure has analogs in the aforementioned LCLS 
photocathode of ref. [696], The impact of grooves and undulations is a recurrent topic of theoretical treatments [693, 
705, 708] and is particularly of interest when limits on emittance are essential [679], 

Consideration of a sinusoidally undulating surface by Dowell et al. [696] assessing several contributions to emit¬ 
tance gives a feel for the relative weight of each to the final emittance. The thermal component (intrinsic e { ) is as 
already encountered in Eq. (33.40). Including the cosine-modulated (tilt e t ) of the surface then amends the thermal 
component by 


_ R I ha — <t> r 

~ 2 V 3 me 2 i 


where the surface height is modeled by z{x) = z 0 cos(foe). The impact of the electron accelerating through the curved fields 
caused by F near the ridges [692] is approximated by 


and so 


_ R 

E f~ 2 



(33.135) 





jckz*F 
Amc 2 


(33.136) 


EXAMPLE: Evaluate e ; - using the values of ref. [696] of A = 255 nm, F = 57.5 eV/pm, and <b = 4.8 eV. Let R = 2 mm. 
SOLUTION: The photon energy is ha = h{2n/X) = 4.8621 eV. The Schottky factor is y/AQF = 0.44X 0.0575 = 
0.28775 eV, so that <f> = 4.5123 eV. Therefore, 


/4.8621 -4.5123 


EXAMPLE: Evaluate the tilt correction (kz 0 ) 2 /2 using the values of ref. [696] of z 0 = 17 nm and k = {n/ 5) pm ' . 
Let R = 2 mm. 

SOLUTION: Straight substitution yields 

i (fe ” ,2 4(^) s = 5 ' 7046><ir! 

making the correction very small so that e t is less than 1% of e ; . 
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Chapter 33 


EXAMPLE: Evaluate the field emittance using the values of ref. [696] given in the previous examples. Find the 
total emittance. Let R = 2 mm. 

SOLUTION: Using kz 0 from before and Fz 0 = 0.9775 eV, then 

. jnx 0.9775X0.010681 , 

e r = (1 mm)-i/-= 0.12668 mm-mrad 

} V 4x511000 

and so 

£ i+t+ f = + e 2 f = 0.49426 mm-mrad 

The thermal emittance e ; dominates. 


33.5.2 Intrinsic emittance and space charge 

. ..beam dynamics in an accelerator can be very sensitive to the details of the 6D distribution at the source. Even if a beam appears to be 
■well behaved in configuration space, distortions in velocity space can reappear as density structures later downstream. Note that this fine 
structure persists for a long time. 

- Rami A. Kishek et al. [666] 

With such an understanding in hand, the linkages of space-charge, halo, and emittance in Figure 33.3 can be bet¬ 
ter understood. The evolution of a bunch of charge represents the interplay between the initial spatial variations or 
non-uniformities of emission at the cathode, the fluctuations of current density induced by space-charge interactions, 
and the spatial and energy spreads caused by emittance. Particle-in-cell codes can capture that interplay for a presumed 
value of emittance at the cathode. Could not, then, the emittance of the cathode be "predicted" if the initial distribution 
can be specified and monitored as it propagates? That, in fact, was the goal of an experiment performed at the Uni¬ 
versity of Maryland's Electron Ring (UMER) designed to understand the waves and collective oscillations carried by a 
space-charge-dominated beam [666]. 

By passing the electron beam through a mask containing five holes, very clear structure was put on the beam in the 
shape of a quincunx (four items in the corners of a square or rectangle with the fifth one in the center). That beam was 
then allowed to propagate a distance and imaged on a phosphor plane, as shown in Figure 33.23. The starting emittance 
of the beam affects its subsequent evolution through its interplay with space-charge forces, and by assuming various 



Figure 33.23 Left, the experimental launching and mixing of five beamlets (top image) over a length of 104 cm to a phosphor screen (bottom image). 
Right, simulations using PIC code WARP with different starting emittances but otherwise modeled after the experimental arrangement. Of the four 
cases considered, only the one marked with a check replicated the experimental behavior, showing the beam preserved details of the density structure 
and the emittance affected the evolution [666]. Images courtesy of R. Kishek (UMD). Right-hand side based on Figure 2 of R.A. Kishek, S. Bernal, C.L. 
Bohn, D. Grote, I. Haber, H. Li, P.G. O'Shea, M. Reiser, and M. Walter, Phys. Plasmas.10, 2016, 2003. 
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starting values of that emittance and comparing the final simulated shape with the experimentally measured one, a 
fair match of the starting emittance was thereby discerned. The experimental/simulation comparison therefore provides 
a rather striking example of the realization of the underlying physics behind many of the simple models behind the 
treatments herein, and serves as fair warning for the consequences of foolishly taking the initial conditions of a beam for 
granted. What is more tantalizing is that such "chaotic mixing" characterizes more than just electron beams: gravitational 
interactions also exhibit a 1 /r 2 force, and so, amazingly, beam experiments such as those at UMER can address hypotheses 
coming from as unexpected directions as galactic dynamics, with a twist: gravity is attractive, so the dynamics of the beam 
interaction have to be played in reverse. 
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PART VI 

Appendices 


It is also reported that Ptolemy once asked Euclid if there was not a shorter road to geometry than through the Elements, and Euclid 
replied that there was no royal road to geometry. 

- Proclus 1 

The practical implementation of formulae requires numerical computation, and what Euclid allegedly opined about 
geometry is true of mathematics in general: there is no Royal Road. 2 There are, however, ways to ease the road one is 
on. These appendices provide assistance to do so in the form of numerical methods, algorithms, and means to evaluate 
functions of particular interest for emission physics. 


1 Proclus, A Commentary on the First Book of Euclids Elements (trans. Glenn R. Morrow). Princeton, NJ: Princeton University Press, 1970, p. 57. 

2 The Royal Road built by Darius I in the 5th century BC allowed rapid travel across a section of the Persian Empire, almost from the Aegean Sea 
to the Persian Gulf. Herodotus (Herodotus, The History of Herodotus (trans. George Rawlinson, Great Books of the Western World, ed. Robert Maynard 
Hutchins, Vol. 6, Herodotus, Thucydides. Chicago: Encyclopaedia Britannica, 1952, Book 8, passage 98, p. 277)) speaks of the unparalleled speed 
of the Persian messengers along the Royal Road, observing "these men will not be hindered from accomplishing at their best speed the distance 
which they have to go, either by snow, or rain, or heat, or by the darkness of night”, from which the inscription on the James Farley Post Office 
(New York City) drew its inspiration and became the creed associated with the US Post Office. 


APPENDIX 1 

Summation, integration, and differentiation 


For all the wisdom of a lifetime, n 


A1.1 Series 

The following series are useful. 

(AM) 

£; 2 = ^N(N+ l)(2N+ 1) (A1.2) 

The following Taylor series are encountered with regularity. Derivatives of each side give further series relations of 




(A1.3) 


e ±x = ^ ( ±x y 

j= 0 

The following series are associated with Riemann's f function: 

j=i ■/ 

” (-iy +1 




A1.2 Integration 

A1.2.1 Series summation 

The trapezoidal approximation [236] to the numerical integration of a function/(x) between x 0 and x,, and perhaps the 
simplest of the methods usually considered, amounts to approximating/(x) « ax + b and finding 

h= J x 1 fOQdx * i(f(x 1 )+/(x 0 ))(x 1 -x Q ) (A1.9) 

a formula often said to be exact for polynomials up to and including degree 1 (a precise way of saying "linear"). 

1 Jim Cregan, Kevin Savigar, Bob Dylan, and Rod Stewart, lyrics to Forever Young, 1988. 
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Appendix 


Extending the integration over N regions bounded by Xj that are equally spaced such that x j+1 - Xj = A for 
0 <j<N results in the composite summation formula to approximate the integral J N by (where the notation f{xf) = fj 
is used) 

/■% N * 

I N = f{x)dx^S N = Aj j f j -^(f 0 +f N ) (A1.10) 

j =0 1 

where the designation S N reinforces that a simple summation of the function f(Xj) is involved. 

Simpson's rule is a modification of the trapezoidal rule by evaluating/(x) in Eq. (A1.9) at the midpoint: a curve that 
passes through three points is a quadratic, so that Simpson's rule is expected to be exact up to degree 2, and is given by 

h = f * fWdx a Q 2 = | (f 2 + 4/i +/ 0 ) (Al.ll) 

In fact, letting /(x) = x“ shows that Eq. (A1.11) is exact for 

|^-=|(2“ + 4) (A1.12) 

that is, for a = 2 and 3: the approximation is therefore good up to degree 3. Using Simpson's rule to generalize Eq. (A1.10) 
results in a summation for which the odd; and even; terms have coefficients of 2 or 1, respectively, and give rise to the 
widely used composite Simpson's rule to estimate I N [236], The series formula using Simpson's rule will be designated 
Q n and is defined by 

Qn=j |-/o+/* + 2 g (2/ 2j+1 +/ 2i ) | (A1.13) 

which evidently is a bit more involved to code. Higher-order polynomials result in inserting more intermediate points 
(e.g„ Bode's rule uses five evaluations and is accurate up to degree 5, but the resulting generalization of Eq. (A1.10) 
results in sums with more complex coefficients). All involve algorithms that are increasingly of greater difficulty than S N . 
The higher-order approaches all achieve their accuracy by including more points in the integration region, as it is known 
that a polynomial of order N requires N + 1 points to specify it [243]. 

It would be convenient to have a summation formula close to the simplicity of Eq. (A1.10) with an accuracy 
closer to the higher degree approximations, but without the alternating coefficients. The method to do this is to 
make use of the derivatives of /(x), and leads to the Euler-MacLauren series method. This would seem to make 
the development of a simple summation formula harder, not easier, because now in addition to evaluations of /(x), 
evaluations of df/dx = /'(x) at x ; - are also required, but this expectation will be shown to be subverted. It is found 
that 

h^M x = |(/i +/ 0 ) + y (f[ -f 0 ) (A1.14) 

h*M 2 = | (f 2 + 2/i +/ 0 ) + y </'-/ 0 ') (A1.15) 

where/(x,) =/■ and df /dx =/'. It is seen that for/(x) = x“, then explicit evaluation of I 2 gives (compare Eq. (A1.12)) 

= 1 + |J(12 - a) (A1.16) 

which is likewise satisfied exactly for a = 2 or 3. For higher values of a, the relative error characteristic of Simpson's rule 
compared to the modified trapezoidal rule of Eq. (A1.15) is shown in Figure Al.l, and so the modified trapezoidal rule 
outperforms Simpson's rule. 

Observing that Eq. (Al.l5) is not dependent on/', the composite modified trapezoidal rule can be created, in which 
the prime terms in Eq. (Al.l5) are opposite in sign. Cancellation thereby occurs, resulting in only the derivatives at the 
integration limits requiring evaluation. Thus, I N is approximated by M N defined by 

M n = S n + y (f' -//) 


(A1.17) 
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Figure A1.1 Trapezoidal approximation applied to integrals. Comparison of the errors associated with the modified trapezoidal approximation of 
Eq. (A1.15) and Simpson's rule of Eq. (At.11) for/(x) = x°. 


and is exact up to degree 3, as shown by explicit evaluation with fix) = x a , yet the formula is no more difficult to evaluate 
than the trapezoidal rule S N augmented by two additional derivative terms, and is therefore simpler to implement than 
Q n and generally more accurate. 

It is worth pointing out that often the trapezoidal rule S N is treated as a lower-order approximation than the composite 
Simpson's rule Q N , but this comes from neglecting the endpoint derivatives in the factor ~ f' N ) of M N : S N is good to 
degree 3 apart from the boundary terms. Two examples are chosen so that = f 2 = 0, and so M. N ->• S n , to compare S N 
to Q n and show that the former is closer to the actual integral. 

• For f{x) = cos 2 (x) for A = n/ 2. Explicit evaluation shows that I 2 = S 2 = n/2 but Q 2 = x/3. Similarly, for A = n/N and 
N = 8, I g = S 8 = n/2. 

• For fix) = [x(l -x)] 2 and let iVA = 1. Explicit evaluation shows that I 2 = 16/15, whereas S 2 = 1 and Q 2 = 4/3. Also, 
I S =I 2 = 1.0667 but S g = 273/256 = 1.0664. 


EXAMPLE: Compare S g , M g , and Q g for A = 3/8, x 0 = 0, and fix) = x 4 to the analytical result I g . 

SOLUTION: Direct integration shows I s = 243/5 = 48.6. Series summation shows that S g = 408483/8192 = 
49.8636, Mg = 398115/8192 = 48.5980, and Q g = 99549/2084 = 48.6079. It is seen that M g is the most accurate 
approximation. 


A1.2.2 Series expansion 

If an integrand can be Taylor expanded in terms of quantities that can be term-by-term integrated, another method 
for the evaluation of integrals is possible. Examples based on series expansions of ln(l +x) and 1/(1 +x), as found in 
Chapter 17, are a case in point. Let a function gix) be such that 


g(x) = J^fj(x) 
j =o 


(A1.18) 


Let G be the integral of gix) and Ffx) = J fix)dx. Explicitly, 

G = [ gix)dx=^(F j (b)-Fj(a)) (A1.19) 

Ja j =0 

which is a seemingly obvious finding. The utility arises if the series involving F n is summable, and especially if Fj at one 
of the limits vanishes. Examples give the flavor. 
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EXAMPLE: Let g{x) = 1/(1 - x) with a = 0 and b<\ . Find G. 

SOLUTION: The Taylor expansion of g(x) reveals fAx) = x j so that Fj(x) = x> +1 /(j + 1). Therefore 

G = I'TZ~ =-X^ = - 1 n (1 -fe) (A1.20) 

J0 1 x j-i J 

where the summation has been re-indexed. 


EXAMPLE: Using the previous example, find G for g(x ) = x 3 /(l — x) with a = 0 and b < 1. 

SOLUTION: Although the integral can be evaluated by repeated integration by parts, the series method is faster. 
From fj(x) = x ;+3 and Fj(x) = x ;+4 /0 + 4) it follows 

G=/"f*=-|^=-ln(l-»)-2^ (A1.21) 

J o 1 X ;=4 J 7=1 J 

where Eq. (A1.20) has been used. 


EXAMPLE: Find the integral of x/(e* - 1) with a = 0 and b -*■ oo. 

SOLUTION: To use the series method, the integrand must be recast in terms of a g(x) that allows for a convergent 
series expansion. Observe that 

The term e~ jx < 1 for the range of integration so a convergent series expansion is straightforward. With fj(x) = xe~’ x 
then Fj(x) = -(1 +jx)e~ ix /j 2 and so 

[ FTZl dx = X l (1 " (1 ( AL23 > 

Jo e 1 7=1 J 

which, in the large b limit, is YfjLij 2 = C(2). 


Observe a lesson from the last example. Consider the integral f* xdx /(1 - e~ x ): as b -* oo the integral diverges. Situations 
arise in which the large, possibly even divergent, part of the integral may cancel the divergent part of another integral, 
leaving finite remainders that are of interest. For the integral in question, if over most of the integration region the 
denominator is well approximated by unity, then remove unity from the integrand. Thus, for b » 1, 

£&*£’*+£7*1*1*+™ < ai - 24 » 

a method that is useful in the development of the Fowler function of Eq. (14.9), albeit with a different integrand. 

Lastly, the choice of (1 —x) and (1 - e~ x ) in the denominator of the example integrals was for convenience: the more 
topical cases (1 + x) and (1 + e~ x ) represents the inclusion of a factor (— iy +1 in/■(*), the consequences of which are left to 
the pleasure of self-discovery. 










Summation, integration, and differentiation 


A1.2.3 Gaussian quadrature 

A smooth function fix) can be expanded in a series of orthogonal polynomials where the locations at which fix) are 
evaluated, Xj, and the weights given to those points, Wj, are to be determined. It is useful to first rescale x in Eq. (A1.9) 
and consider instead the equation 

G = fix)dx * G„ = WjfiXj) (A1.2 5) 

for a given n. There are 2 n unknowns (n apiece for the Wj and xf, and to determine them, in the case of the Legendre 
polynomials (for which the Xj are the zeros) let f(x) be represented by a power series of 2 n terms, or 

(A1.26) 

The values of are arbitrary, so n equations have to be satisfied. The equation associated with each is of the form 

0 = Z W A V = odd ) (A1.27) 

= X ~ even ) (A1.28) 

k= 1 

Consider the lowest order cases n = 2 and n = 3 below (the case n = 1 corresponds to the trapezoidal approximation). 


EXAMPLE: Find the n roots Xj and the weights Wj for n = 2. 

SOLUTION: For; odd and even, Eqs (A1.27) and (A1.28) are concisely represented as (left and right, respectively): 



The top (odd) matrix equation is satisfied if the determinant of the 2x2 matrix vanishes, or x,x 2 (x 2 - xj) = 0, there¬ 
fore x 1 = -x 2 . The top row of the same matrix then demands in, = w 2 . Making these substitutions into the bottom 
(even) matrix shows that 2 w l = 2 and 2x,in, = 2/3, which sets w l = 1 and x, = 1 Observe, therefore, that for 
n = 2, there are two different roots x, = -x 2 = 1 /yfi and one weight in, = w 2 = 1 . 


EXAMPLE: Find the n roots Xj and the weights Wj for n = 3. 

SOLUTION: For; odd, Eqs (A1.27) and (A1.28) are again concisely represented as 3 x 3 matricies (top and bottom, 
respectively): 



'O' 

0 

0 


1 1 1 

xf 4 x? 


2/3 

.2/5. 


Setting the determinant of the odd (top) matrix to 0, it is found x, = 0 and x 2 = —x 3 . The topmost row of the odd (top) 
matrix then entails that w 2 = w 3 . Inserting these values into the bottom (even) matrix results in three equations: 
in, + 2w 2 =2, 2 xjw 2 =2/3, and 2x 2 w 2 =2/5. The ratio of the second and third entail thatx 2 = -y/3/5, which, when 
inserted into the second equation, sets w 2 = 5/9 and lastly w 1 = 8/9. Observe, therefore, that for n = 3 there are 
three different roots, x, = 0, x 2 = -x 3 = \/3/5, and two different weights, w 2 = w 3 = 5/9 and in, = 8/9. 
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Higher-order equations are more involved, although induction holds. First, the (odd) matrix determines that the roots 
come in ± pairs for n even and the unpaired root for n odd is zero, and that the weights are likewise paired. Second, the 
(even) matrix then allows for the determination of the independent roots and weights. By n = 4, the procedure for their 
numerical determination has become non-trivial, and so consulting the authoritative Abramowitz and Stegun tables [103] 
for n > 4 provides roots and weights for higher-order Gaussian quadratures. An algorithm for their calculation is given 
in Press eta/. [236], 

That such induction is valid is reinforced by observing that the Xj are roots of the Legendre polynomials that form the 
basis of the Gaussian quadrature method (others exist depending on the limits of integration). The Legendre polynomial 
P„(x) is defined by the recurrence relationship [103] 

in + 1 )P n+] (x) = (2 n + 1 )xP„(x) - nP^Qc) (A1.29) 

with P 0 (x) = 1 and P l (x) = x. Thus, P 2 (x) = (3x 2 - l)/2 and P 3 (x) = x(5x 2 - 3)/2, from which the roots can readily be 
inferred as given in the examples above, which shows that the ± root characteristic is a consequence of P 2n being a 
polynomial in x 2 and P 2n+ \ the product of x with a polynomial in x 2 . In the case of n = 4, P 4 (x) = (35X 4 - 30x 2 + 3)/8 has 
the roots x, = -x 2 = (2 V30 - 15)/35 and x 3 = -x 4 = (2\/30 + 15)/35. The reduced 2x2 matrix entailed by Eq. (A1.28) 
is then trivially inverted to show w y =w 2 = ( 18 + \/30)/36 = 0.65214515 and w 3 = w 4 = (18 - V30)/36 = 0.34785485. 

As with the composite trapezoidal method, so too can a composite Gaussian quadrature method be used to evaluate 
subregions of a larger integral, and so achieve numerical results of high accuracy for smoothly varying /(x). 


A1.2.4 Sharply peaked integrands 

The Dirac delta function S(x ) operates according to the relation 

f(0) = j f{x)8(x)dx (A1.30) 

Certain functions, such as <5,(x) = ]/(<P + \){e~ x +1), when sharply peaked by comparison to other functions in the inte¬ 
grand, act much like a Dirac delta function, for example Eq. (7.24) in the treatment of electrical conductivity and leading 
to Eq. (7.27). The method of evaluating the slowly varying function where the integrand is maximum so that it can 
be removed from the integral is a generally useful technique, an example being in the treatment of N(n, s ) function in 
Section (17.5). In short, 

f(x m ) » j f(x)S,(x)dx (A1.31) 

where x m is the location of the maximum of the integrand, which is close to but nevertheless different than 0. The 
statement is less profound than it may appear, as it is simply the consequence of Taylor expanding/(x) about x m and only 
keeping the first (constant) term, but at the (sometimes unbearable) cost that x m must be found. Numerical evaluation 
can assess the quality of the approximation, but the test case example here is analytically tractable. 

Consider the case/(x) = e ax for a small and <5;(x) = l/(e* + \)ie~ x + 1), so that the integral in question is 

m = fj {x)Sl(x)dX = f_l {e * + Y){e- x +Y) dX (AL32) 

where the assertion to be shown is then that F,(a) « e ax ™ = f(x m ), where x m (a) is the location of the maximum of the inte¬ 
grand f{x)8[(x). Using the following integral relations, derived from the observation that <5/(x) = —d x (e x + 1) _1 , integration 
by parts, and Reimann's zeta function (Eq. (A2.6)) 

J Hx) = 1 

j x 2 S t (x)dx= y = 2f(2) 

J x%(x)^ = ^ = 42£(4) 

(integrals with odd powers of x are identically zero because <5 ; (x) is an even function) along with a Taylor expansion 
e™ « 1 + ax + (ax) 2 12 show that to leading order for small a, 

Flip) * 1 + \{xa) 2 + j^inaf 


(A1.33) 

(A1.34) 

(A1.35) 


(A1.36) 
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Figure A1.2 Demonstration that the integration of a slowly varying function/(x) = e“ for a small with the comparatively sharp function <5 ; (x) is 
approximately/(x m ), where x m is the location of the maximum of /(x)<5 ( (x). 


Compare this to the approximation of Eq. (A1.31). The maximum of the integrand f(x)S,(x) is found by determining 
where its derivative vanishes. This occurs when 


2a + (i + a)e~*” - (1 - a)e= 0 


(A1.37) 


which has solutions 



(A1.38) 


an analytical form for x m being the motive for the choice oif(x). As Figure A1.2 demonstrates, Fj{a) as f(x m (a)) is a good 
approximation for small a. 

A1.2.5 Monte Carlo integration 

For some integrals of sufficient complexity, or which possess features that make their solution using the usual numerical 
methods foreboding or problematic, the use of randomly generated numbers offers a solution. The ability of random 
numbers or observations to make an estimation to quantities of interest has long been appreciated, perhaps one of the 
more intriguing examples being the Buffon needle problem [106], a captivating example of an "experimental" Monte 
Carlo calulation in which a needle of length L thrown onto a square grid: the needle will cross the grid lines, and the 
expectation value of the number of times n the needle crosses them is ( n ) = 4 L/n. Thus, the value of n can be empirically 
determined. 

First encounters with Monte Carlo, however, are often towards the evaluation of integrals. For low-dimensional inte¬ 
grals, numerical methods surpass Monte Carlo for accuracy, but for multidimensional integrals, or ones with particularly 
intensive integrands, Monte Carlo methods can be preferable [63, 105, 106]. The illustration here, however, will be for 
one-dimensional integrals easily compared to analytical results. 

First review expectation values and random numbers in a treatment quite similar to but simpler than Section (4.1). 
The expectation value of a randomly occurring quantity Xj that is observed N times is 



(A1.39) 


If the values of Xj can take on only one of n values Y k (n being the number of "bins" and k taking values of 1 to n, 
for example), and the number of times that Y k occurs in N events is Np k , with p k being the fraction of the times, the 
expectation can be re-expressed as 


<*> = X y jpj 


(A1.40) 
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For example, using dice throws, if the random event is the value of the die face, then n = 6. In the limit of large N, the 
fraction of throws showing a face kisp k = 1/6 for all k and Y k = k, so that (X) converges to 7/2. 2 Or, say, n could equal 
3, with P\ being the fraction of times the face is 1, p 2 being the fraction of times the face is a 2 or a 3, and p 3 the fraction 
of times it is a 4, 5, or 6. Thus, the expectation value can be evaluated using a variety of distribution functions of the 
random events and they should give the same result. 

Use these ideas to examine integrals. Let the integral of a function 0 < fix) < 1 defined over the range 0 < x < 1 be 
desired: there is no loss of generality for doing so, as x and f(x) can be so normalized, and the explanation becomes easier. 
That is 

T= f{x)dx (A1.41) 

Jo 

Let there also be a normalized probability distribution function p{x) (e.g., uniform, linear, and so on) such that pix) > 0 for 
all 0 < x < 1. Some desirable distributions are defined over all x, but by remapping the integration variable, such as y{x) = 
x/(l -x 2 ) 1 / 2 , then the range of y is from -oo to oo and, for example, the Gaussian distribution is p g \y(x)\ = exp(-y 2 )/\fn. 
The expectation value of some quantity O is then 


(&) = f Q Oix)p{x)dx « £ G{x k )p{x k ) = ^ £ 0, (A1.42) 

If the quantity in question is 0(x) = fix)/pix), then the expectation value is F, but this means that fix)/pix) is to be taken 
as the random event with a probability distribution of pipe). In other words. 


F« 



(A1.43) 


In programming, one random number distribution stands out, and that is the uniformly distributed random number r 
with the associated uniform distribution p u ir) = 1 , so that 


T u 




(A1.44) 


where r ; - is the ;th generated random number. This relation is by far the easiest to implement, but it is not always the 
most accurate. Alternatives are then a question of calculating other useful distributions from the uniform distribution 
(in addition to Section 32.2.2, see refs. [63, 105]). Let the random number h be associated with the distribution pix). 
Then a relation exists 


f Pix) dx = 1 


(A1.45) 


where p u (r) = 1 is used in the second integral. For choices of pix) which are simple and integrable, then hirf) = h } - can be 
found from a generated r y Why would this ever be done? Simply, because the more pix) looks like fix), the more accurate 
Eq. (A1.43) becomes. 

If pix) is not simply integrable, then it may nevertheless be obtained using the acceptance-rejection algorithm of Section 
(32.2.2), which can be contorted until, here, it takes the form of generating two random numbers r and r' and adding 1 
to a running sum if r < fir'). Such an approach is effectively evaluating an integral by bounding it in a unit area, taking 
a scattering of random locations in that area, and only counting up those points that lie below the/(x) curve. The ratio 
of points below/(x) to the total number of points is then the ratio of the area under the curve to the unit area, which is 
the integral. 

For an integrable case, consider the following example, where pix) = (1 + v)x v , where v is an arbitrary power. It is 
straightforward to show fg pix)dx = 1. Therefore, the inversion of Eq. (A1.45) gives 


hir) = 


r l/(V+l) 


(A1.46) 


2 This is not a particularly interesting example: it would be more interesting to use two dice and show the expectation value for the sum of the 
faces is closer to 6, but that is a longer example. The one die case is, however, an almost uniquely simple example, and that is the point. 




Summation, integration, and differentiation 577 



Run Index 


Figure A1.3 Comparison of the uniform distribution (Eq. (At.44)) and the power law distribution (Eq. (A1.47) for p(x) = 3X 2 ) for the estimation of the 
integral of f(x) = ( e“* - 1 - 1) with a = 5 in the code of Section A3.36. Also shown is the acceptance-rejection approach discussed in text and a gray 
line for the analytical result of T = 0.193216. 


Then, defining hfrf) = hj the approximation becomes 


F « 



(A1.47) 


A comparison of the uniform, the power law, and the acceptance rejection methods is demonstrated for the function 
f{x) = (e ax - V)/(e a - 1), for which F = a~ l - (e° - 1) _1 . For large a, this is a rapidly rising function and so choosing p(x) = 
3x 2 is a useful distribution. The results are shown in Figure A 1.3 for a = 5, in which Nj = 64 separate evaluations of F 
are made then sorted from lowest to highest and compared to the analytical result. The spread of the Nj estimates about 
the analytic value gives a feel for the accuracy of the method, and because p(x) mimics the shape of fix), it is seen to be 
the most accurate in general. The algorithm is implemented in Section A3.36. 


A1.3 Differentiation 

A1.3.1 Orthogonal coordinates 

A method to find the gradient operator V and the Laplacian V 2 in other orthogonal coordinate systems, particularly 
cylindrical and spherical, but also prolate spheroidal coordinate systems [60], starting with the Cartesian system, is given. 

A1.3.1.1 Radial coordinates ( r,9,cp ) 

The relation between cartesian (x,y,z) and spherical ( r,9,cp ) coordinates is 


x = r sin 0 cos cp (A1.48) 

y = r sin 9 sin <p (A1.49) 

z = r cos 6 (A1.50) 


The differential changes associated with them can be rendered in matrix notation to relate dr in cartesian coordinates 
( dr c ) to the same quantity in spherical coordinates (df s ) in the form dr c = R(r, 9, q>) ■ dr s . Explicitly, 


(dx\ 

(sin 9 cos cp 

r cos 9 cos cp 

—r sin 9 sin <?A 

(dr\ 

\dy\ = 

sin 9 sin cp 

r cos 9 sin cp 

r sin 9 cos cp 

d9 

UJ 

( cos 9 

—r sin 9 

o ) 

\dcp) 


Finding the spherical differential elements df s in terms of the cartesian df c then involves inverting R(r, 9, q>), the inversion 
of which requires finding its determinant. Although ominous, the inversion can be less painfully accomplished using a 
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cofactor expansion method [91] to show \R(r, 0, tp)\ = rsin 0. Thus, 

( dr\ l sin 0 cos tp sin 8 sin cp cos 6 A l dx\ 

dO = r _1 cos 0costp r^cosOsintp -r _1 sin0 Lm (A1.52) 

dtp) (-r _1 csc#sin<p r^cscOcostp 0 )\dz) 

Finding partials when shifting from one coordinate system to another can be read off the matrix elements. For example, 
d x = (d x r)d r + (d x 0)d g + (d x tp)d rp 

= sin0cos<2> d r + — cosOcostp d„ - - sm<i9 g (A1.53) 

r r sinO v 

where the coefficients of each d term are read off the corresponding row (e.g., the coefficient (d x r) of d r is read off the dr 
row). The same procedure yields 


d, ; = sin 0 sin tpd r + — cos 0 sin tpd„ + — cos cp g 
y r r smO v 

d z = cos 0d r - - sin 0d g 

In cartesian coordinates, the gradient and Laplacian operators are expressed as 

V = xd x + yd y + zd z 
V 2 = d 2 x +d* + d 2 


(A1.54) 

(A1.55) 


(A1.56) 

(A1.57) 


Finding their forms in spherical coordinates is aided with the introduction of the h a , h (l , and h r terms, where a general 
orthogonal coordinate system is specified by (a, 0, y). Define 


K = \l^ a x) 2 + (d a y) 2 + ( d a z) 2 

with analogous equations for h p and h y . A compact formula for the gradient is 

v = ^A + J_A + J_A 

h a da h p dp h y dy 

The Laplacian is a bit more involved, being 



(A1.58) 


(A1.59) 


(A1.60) 


but it is seen that the (h a h p /h y ) fractions cycle through the subscripts, and so the algorithm is easy to remember. 

Spherical coordinates are specified by a = r, 0 = 6 , and y = tp. It is straightforward to show, using Eqs (A1.48)-(A1.50), 
that 

h r = 1, h g = h v = — (A1.61) 


Therefore, 


dr 


. 1 d 

^ r sin 0 dtp 


(A1.62) 


and, with a bit more work, the Laplacian is 


(A1.63) 
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A1.3.1.2 Prolate spheroidal coordinates {rj,v,y) 

The relation between cartesian ( x,y,z ) and the (angular representation) prolate spheroidal (r), o, y) coordinates is 


x = L sinh(t/) sin(u) cos(y) (A1.64) 

y = L sinh(»;) sin(n) sin(y) (A1.6 5) 

Z = L cosh(t/) cos(a) (A1.66) 

where L sets the length scale. The h n factors are 

h ri = L (cosh (ji) 2 - cos (u) 2 ) = L (sinh {if) 2 + sin (a) 2 ) (A1.67) 

h v = L (cosh (ri) 2 - cos (a) 2 ) = L (sinh in) 2 + sin (a) 2 ) (A1.68) 

h y = L sinhft?) sin(a) (A1.69) 


Observe that h n = h v . For rotationally symmetric geometries such as the conical/Spindt-like and ellipsoidal/wire-like 
emitters of Figure 30.15, there is no y dependence, for which p = (x 2 +y 2 ) 1/2 and z as the cylindrical coordinates, and 
(t/, a) as the prolate spheroidal coordinates. The components of the Laplacian of Eq. (A1.60) are then constructed from 


Similarly, the gradient components are 


h n h a h y 

= L 3 (sinh (t/) 2 + sin (t>) 2 ) sinh (t/) 2 sin(o) 

(A1.70) 

' h,h„ \ 

= L sin(o)(sinh(t/)(J 2 + cosh(t?)f/) 

(A1.71) 

' KK \ 

X d ) 

= Lsinh(^)(sin(n)(J 2 + cos(v)d v ) 

(A1.72) 

' KK \ 

.Kv 

— L ( sinh Kf + sin (a) 2 \ d2 
\ sinhft?) sin(a) J 7 

(A1.73) 

e 

V 

.Id .Id .Id 

(A1.74) 


for which the simple substitutions of Eq. (A1.67) and its brethren are straightforward. 


A1.3.2 Finite differences 

The approximation of derivatives of a function y =f{x) given that/(x) is known at a finite number of discrete locations x jt 
or yj = f(Xj) is often desired. Finite difference methods can then be used to find solutions to differential equations [242], 
and are often closely related to methods used in approximation and interpolation [103, 243], A brief and slightly idiosyn¬ 
cratic account is given. The Taylor expansion of a function is well known to be given by (where primes represent 
derivatives with respect to argument) 

f(x) = f{0) + xf{ 0) + i* 2 /"! 0) + ■ ■ ■ 

= 0 ) < al75 > 

Further, let y = f{x) be specified on evenly spaced locations Xj = jAx, where j can take on positive and negative values 3 
so that yj = f{Xj). For notational ease, let 

AxWML*. o)=f (A1.76) 

so that/ 0 = f{x = 0), and so on. Nothing is inherently special about x = 0: the expansion point could be for any x 0 , but 
one can always redefine the coordinate system so that the expansion point is at the origin. Then the Taylor expansion 


3 The points Xj do not have to be evenly spaced, but the resulting formalism is much easier if they are; the extension to non-uniformly spaced 
points is an exercise left to those with the time to consider it. 
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formula of Eq. (A1.75) takes the form of 

y i = % 3 ^r (al77) 

This way of expressing a Taylor expansion makes it suitable for matrix methods. Let | y) and |/) be vectors whose kth entries 
are y k and/*, respectively, and let y be a matrix whose components are specified by M jn =j n /n\. Then the inversion 

\y) = Ml/> -*• l/> = M _1 |y> (A1.78) 

provides what is sought, as the approximations to the derivatives specified by |/) are provided by the equi-spaced points 
specified by |y). Observe that because the length N of both vectors is the same, that length specifies the order IV — 1 of 
the approximation. A few special cases are instructive. 


A1.3.2.1 First-order approximations 

The first-order approximations involve two points and assume that the underlying polynomial that passes through the 
yj is linear (i.e„ is a polynomial of order 1). The y i can be chosen anywhere, but accuracy generally increases if one of 
the points is at; = 0 and Ax is kept small. If the second point is at; = 1 the scheme is termed an upwind differencing scheme 
(UDS), and if;' = -1 then it is a downwind differencing scheme (DDS). The matrix equation for downwind is 



Unpacking the result, the top row asserts f° = f{x = 0) = y 0 , not surprisingly. The second 
the first derivative in the DDS to be (after invoking Eq. (A1.76)) 


(A1.79) 

gives the approximation to 


In exactly the same way, the UDS 


gives 



(A1.80) 


(A1.81) 


(A1.82) 


The accuracy of the approximation can be assessed albeit crudely by applying the algorithm to an (IV + l)th order 
polynomial, or /(x) = (x + l) 2 for the present case, comparing the known result to the approximation 

Z1 =2 ~^° = 2 + Ax (A1.83) 

Ax Ax 

so the approximation is said to be accurate to first order (the second term on the right-hand side is the error of the 
approximation). 


A1.3.2.2 Second-order approximations 

Adding another y ; - increases the accuracy of the interpolating polynomial, as a quadratic function can uniquely pass 
through three points. In the central difference scheme (CDS), the y ; - values after and before y 0 are used to give 



(A1.84) 
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so that the CDS approximations to the first and second derivative are 


(s)„“ =**■-»*> 


(fy) 

W)a 


(A1.85) 

(A1.86) 


Two other schemes exist that are higher-order versions of the UDS and DDS. Both can be found from expanding 



giving, for the first derivative, the approximations 


(A1.87) 


(A1.88) 

(A1.89) 


Observe that the second derivative has the same coefficients as the CDS scheme, the only difference being that the indices 
are shifted up or down by one unit, which is understandable because the scheme is accurate for quadratic polynomials 
(for which the second derivative is a constant). Testing the algorithms onf(x) = (x+ l) 3 gives 



P_ 

Ax 2 


3 ** ^ = 3 + A * 2 

2Ax 


2 Ax 

Vo-tyi +y 2 _ 


(A1.90) 

(A1.91) 

(A1.92) 


and similarly for the downwind version. Therefore, the first derivative is accurate to order Ax 2 and the second derivative 
to order Ax. Observe also that the error associated with the CDS scheme for the first derivative, where the expansion 
point is in the middle, is smaller than for the up and downwind SDS schemes, where the expansion point is on the edge. 
This is a general trend that the interior expansion points are more accurate. 

It is left as an exercise to show that the fourth-order scheme involves solving 



The expansion is a bit more involved to calculate, but doing so gives 


f°) 


12 y 0 

p 

l 

~V2 + 8Ti - 8y_! +y_ 2 

p 

~ 12 

-y 2 +16 y 1 - 30 y 0 + 16y_, -y_ 2 

p 


6 y 2 - 12y, + 12y_ x - 6 y_ 2 

In 


,Uy 2 - 48 y 1 + 72y 0 - 48y_, + 12y_ 2 , 


(A1.93) 


(A1.94) 


from which the finite differencing operators can be read off. The fourth-order CDS was shown, but the method is trivially 
extended to UDS and DDS by simply changing the range of; and inverting the resulting matrix. 
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For example, a test of the algorithm on the polynomial fix) = (x + l) 5 gives, using a compact representation of the 
derivatives (left-hand side) with their finite difference approximations (right-hand side). 


' 1 ' 

5 


1 

5 — 4Ax 4 

20 

fH 

20 

60 


60 + 30Ax 2 

k 120, 


120 , 


A1.4 Numerical solution of an ordinary differential equation 

Numerical methods to solve ordinary differential equations (ODEs), such as Runge-Kutta and others [236], are well 
known and often part of numerical programs. Nevertheless, there is always room at the bottom for a quick and 
dirty method: human cleverness takes time and computer stupidity is remarkably fast, so that when scheming is too 
time-consuming simple recourse to a finer discretization to increase accuracy can be effective. Poisson's equation in 
particular can be cast in a form that requires no memory overhead and is therefore compact. 

Consider, therefore, the numerical solution of the differential equation defined by 

—u(x) = v(x) (A1.96) 

dx n 

where the cases n = 1 are called first order, n = 2 called second order (the only two to be considered here), u(x) is to be 
found and v{x) is known (or at least known on the points Xj =jAx : j e [1, N}). If x is scaled so that the boundaries are at 
0 and 1, that is, u(x = 0) and u(x = 1) are known, then Ax = 1 /(N + 1). In that case, u 0 = u(x 0 = 0) and u N+y = u(x N+] = 1) 
are the boundary conditions. 


A1.4.1 First order 

Using the finite difference scheme of Eq. (A1.85), and doing so for N = 5 to keep the results simple but easily generalizable, 
results in a matrix equation of the form y|w) = 2Ax\v) + \u bc ), or explicitly as 



Note a few peculiar features about the form of this equation. First, the discretization element Ax has been brought to 
the right-hand side. Second, and this is important, u N+l is not a part of the boundary condition vector (the second vector 
on the right-hand side of Eq. (A1.97)): to make it so would be to over-constrain the problem and that in turn would 
cause the solution to fail in that the NxN differencing matrix would not have an inverse. As a consequence, the iVth 
row of the differencing matrix uses a DDS. This is only one choice: another is to let the first row contain a UDS and let 
u n+i specify the boundary condition. 

For smooth functions, such a cheap solution is surprisingly accurate. By way of demonstration, consider two examples. 
For the first, let c(x) = e~ x with boundary conditions u{ 0) = 0, so that the analytical solution is uix) = 1 - e~ x . Then even 
for N = 5 Eq. (A1.97) becomes 


4 -4 4 -3 1" 

4 0 0 0 0 

4 0 4 -3 1 

4 0 4 0 0 


0 


1 


'0.1534' 

0.2822 

0.3922 

0.4843 

0.5633, 


(A1.98) 


which is good to 1 %. Not bad. Observe further that the structure of the inverse differencing matrix has a peculiar repeating 
pattern that suggests a fast solution, although the presence of the DDS corrupts it a bit, thus suggesting the second 
example. 
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Figure A1.4 A comparison of the analytic form of u(x) = 1 +xcos(6x) to its determination from numerically solving d x u(x ) = v(x) using the code of 
Section A3.15.1 for first order (1) and Section A3.15.2 for second order (2). 


The solution to a generalization of Eq. (A1.98) may be found by using Gauss-Jordan elimination on the matrix M 
and simultaneously performing the same operations on the right-hand side vector [91], although it will be easier to deal 
with N even and so let N = 2n. The simplicity of M results in a clean algorithm that is not entirely unexpected. Using the 
notation Uj = the ;th element of | u), the solution is obtained by first performing the increasing; replacements for; from 
2 to n (remember, N = 2 n) 


u 2 «- 2Axv l + u 0 (A1.99) 

M 2j U 2j-2 + ^^ Xv 2j-l (A1.100) 

followed by the decreasing; replacements (again for; = 1 to n) 

u 2n—\ «- u 2n-2 + (3/4)Axc 2 „_, - (l/4)Axc 2 „ (A1.101) 

U 2n+l—2j *- U 2n+3—2j ~ 2 ^ XV 2n+2-2j (A1.102) 

The algorithm, rendered to code in Section A3.15.1 and applied to the trial function u(x) = xcos(6x)+ 1, is shown in 
Figure A 1.4 under the name "Numerical (1)". 

Observe that the algorithm of Eq. (A1.99) has elements and features very reminiscent of the CDS of Eq. (A1.85) and 
a whiff of the trapezoidal integration scheme of Section Al.2.1, but the intriguing partitioning of a downwind versus 
an upwind recurrent pattern demonstrates the differences brought to bear by using a matrix method to solve an ODE; 
observe further that it is the downwind differencing operator that connects the down and up portions of the Eq. (A1.99) 
algorithm. 

The application of finite difference methods to solving ODEs has more art to it than the broad brushstrokes here: there is 
much hidden here that is part of more formal treatments [242], as well as similarities to interpolation and approximation 
methods [103, 243], 


A1.4.2 Second order 

The algorithm for solving Eq. (A1.96) for the case n = 2 results in a matrix equation of the form W|m) = Ax 2 |e) + \ u bc ) or, 
for the case N = 5, explicitly as 



where the ability to use both boundary conditions results in much simpler matrix elements in row N = 5. As with the 
first-order case, the simplicity of M results in an even more elegant algorithm. Using the notation Uj = the;th element of 
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| u), then the solution is obtained by first performing (for; = 2 to N) 


followed by (for; = N - 1 to 1) 


**-< 2 u> + UH>jU 


(A1.104) 


(A1.105) 


followed by | u) -*■ —(1/2)| u). The algorithm, detailed in Section A3.15.2 and applied to the trial function u(x) = xcos(6x) + 
1, is shown in Figure A1.4 under the name "Numerical (2)". Observe that very slight differences are discernible between 
the two numerical methods, as is to be expected, but also note how small N may be to obtain reasonable results compared 
to the analytical values. 



APPENDIX2 

Functions 


Very important functions can be performed very wastefully and often are...It is far easier to cut the function than the waste. 

- John Kenneth Galbraith 1 

A2.1 Trigonometric functions 

Relations between the sine and cosine functions when their arguments are the sum or difference of two angles are 
repeatedly employed: 

sin(# ±<p) = sin 9 cos <p ± cos 9 sin 4> 

cos(#±$) = cos#cos$± sin#sin$ (A2.1) 

When 6 = 4) then from cos 2 9 + sin 2 # = 1, it follows that 

sin(20) = 2 cos 9 sin 9 

cos(20) = 1 - 2sin 2 # (A2.2) 


A2.2 Gamma function 

The gamma function can be defined by the relation T{p + 1) = pT(p) or by Euler's integral formula 

T(p) = J x p ~ l e~ x dx (A2.3) 

For p = n = an integer, it is related to the factorial function by Tin + 1) = n\. For large values of p, Stirling's approximation 
is useful and is the leading part of the relation 


The function has poles at negative integers, but for negative non-integer values, the relation 
np)T(\-p)=^— 


is useful, and shows r(l/2) = \Jh. 

A2.3 Riemann zeta function 

There are many defining equations (both series and integral) that provide a means to numerically evaluate Riemann's 
zeta function. This section shall instead focus on representations useful in calculations involving particles, and they are 

1 John Kenneth Galbraith, The Affluent Society. Boston: Houghton Mifflin, 1998, p. 178. 


Introduction to the Physics of Electron Emission, First Edition. Kevin L. Jensen. 
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given by 



(A2.6) 



(A2.7) 


Using a series expansion of the denominator of the form l/(e* - 1) = <?each term in the series can be integrated 

to give the more familiar series representation of f(p) as 


m = 


(A2.8) 


For integer w, f(2 ?j) is easily evaluated by noting the relationship between it and the Fourier transform of fix) = x 2 
over the interval \-k,jz\, for which the Fourier cosine coefficients are a 0 = 2x 2 /3 and a n = 4(-l)"« -2 . Forx = n, cos(mx) = 
(-1)”, then/(x) = a 0 /2 + a n cos (nx) from which 



(A2.9) 


and so f(2) = n 2 /6. Similar reasoning using/(x) = x 2 " shows f(4) = x 4 /90, f(6) = x 6 /945, and f(8) = x 8 /9450. 

The value of f(3) = 1.2020569 is known as Apery's constant. 

A method for the numerical evaluation of f (p) in MATLAB makes use of the integral call. An example is shown for 
the evaluation of f(3/2) and f(3), with the output compared to the exact values. 

% - 

% Riemann's Zeta function zeta(p) 

% - 

% zeta (3) and zeta (1.5) are calculated and 
% compared to exact values, but the method is valid 
% for p arbitrary, gamma(p) is the Gamma function 
% for which gamma(n+1) = n! and gamma(1/2)=sgrt (pi) 

% zeta30ext and zetal5ext are exact values 


fx = @(x,p) (x/(p-l) ) ./(exp(x) +1.0); 

zeta30ext = 1.20205690315959428539973816151; 
zetal5ext = 2.61237534868548834334856756792; 

P = 3.0; 

zeta30 = integral(@(x)fx(x,p),0.0, Inf)/.. . 

((1.0 - 2.0"(1-p))‘gamma (p)) ; 
zerror30 = 100*(zeta30ext - zeta30)/zeta30ext; 

P = 1-5; 

zetal5 = integral(@(x)fx(x,p),0.0, Inf)/.. . 

((1.0 - 2.0*(1-p))*gamma (p)) ; 
zerrorl5 = 100*(zeta!5ext - zetal5)/zetal5ext; 


The output is -3.6944 x 10 14 and -1.6999 x 10 14 for zerror30 and zerrorl 5, respectively. 
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A2.4 Error function 

The error function Erf(x ) is the integral over a Gaussian, or 


For large argument. 


Erf(x) = 






For small argument, a series expansion is 


Erf(x) = e~* 


“ x 2j+l 

hr (j + \) 


(A2.10) 


(A2.ll) 


(A2.12) 


A2.5 Legendre polynomials 


Consider the Laplacian in spherical coordinates as given in Eq. (A1.60) for problems with rotational symmetry (no <p 
dependence). For the equation W 2 y/(r, 0) = 0, then y/(r, 9) can be represented at 0) = as in the treatment 

of the hydrogen atom of Section 2.2.2. The eigenvalue equation for P z (cosP) is 


1 d ( dP,\ 

— (sinP — ) +/(/+ l)P,(cos 0) = 0 
sin 0 dO \ d9 ) 


Letting x = cos 0, then Rodrigues' formula for the generation of Pfx) is 

(-iy d 1 

2 l l\ dx l 




for which the first few are 

P 0 (x) = 1 
P l (x) = x 

P 2 W = \ox 2 - 1 ) 

P 3 (x) = |(5x 3 - 3x) 

A recursion relation to get to higher Pfx) can be used which makes use of lower / terms, or 
(/ + 1)P Z+1 - (21 + 1 )xP, + lP t _ i = 0 

The normalization is 


(A2.13) 


(A2.15) 

(A2.16) 

(A2.17) 

(A2.18) 


y p„wp,w^= 


2/+1 nJ \ ■ i 

Legendre's differential equation also admits non-polynomial solutions called Legendre polynomials of the second kind, 
and are designated by Q n (x). The first few of them are 




(A2.21) 
(A2.22) 
(A2.23) 

If Q n (x) is encountered with x > 1, as in Section 30.7.2, then the replacement (1 —x) -* (x- 1) in the ln(- ■ ■) function 
is made. 
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A2.6 Airy functions 


An extensive literature on Airy functions is available, the treatment here is after ref. [103], Airy functions are solutions 
to the differential equation 

d 2 y/ -xy/ = 0 (A2.24) 


Two real solutions Ai(±x) and Bi(±x) exist, where x > 0. For positive argument, Bi(x) and Ai(x) are exponentially increas¬ 
ing and decreasing, respectively. For negative argument Ai(-x) and Bi(-x) are oscillatory, as in Figure A2.1 and Table A2.1. 
The combinations of Airy functions highlight the additional asymptotic properties shown in Figure A2.2. From the com¬ 
binations of Zi(c,x) functions (Eqs 18.22 and (18.24)) in the form Zi(c,x) Zi(— c,x) it follows 

GAx)G Ax) 

Zi( 1, x) Zi(— 1, x) = Ai(x)Bi(x) = 1 -1 (A2.2 5) 

4xy/x 

, , |G,(x)| 2 

Zi(i, x) Zi(-z,x) = Ai(-x) 2 + Bi(-x) 2 = ' (A2.26) 

4 icyfx 

where x > 0 and the G functions are of order unity. The Wronskian of the Airy functions is defined by 

Ai(x)d x Bi(x) - Bi{x)d x Ai(x) = (A2.27) 


with the same equation holding for x -* —x. At the origin x = 0 the Airy functions are (where a prime denotes derivative 
with respect to argument) 


Ai(0) = 
Ai'(0) = 
Bi(0) = 
Bi'(0) = 

Series expansions are, for positive argument, 

Ai(x) = 


3 2 / 3 r(2/3) 

1 

3 1 / 3 r (i/3) 
1 

3 1 / 6 r (2/3) " 

31/6 

F(I73) =0 ^ 


= -0.258819 


2 \[jzx y I* 






(A2.28) 

(A2.29) 

(A2.30) 

(A2.31) 

(A2.32) 

(A2.33) 



Figure A2.1 The Airy functions Ai(x) (red) and Bi(x) (blue) (solid lines) with their derivatives Ai'(x) and Bi'(x) (dashed lines). 
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Table A2.1 Table of Airy function values. 


X 

Ai(x) 

Ai'(x) 

Bi(x) 

Bi'(x) 

-4.00 

-0.070266 

-0.790629 

0.392235 

-0.116671 

-3.75 

-0.251613 

-0.632454 

0.317185 

-0.467801 

-3.50 

-0.375534 

-0.343443 

0.168940 

-0.693116 

-3.25 

-0.419013 

-0.002454 

-0.016034 

-0.759759 

-3.00 

-0.378814 

0.314584 

-0.198290 

-0.675611 

-2.75 

-0.268491 

0.551338 

-0.344376 

-0.478387 

-2.50 

-0.112325 

0.678853 

-0.432422 

-0.220420 

-2.25 

0.061599 

0.695016 

-0.453921 

0.045904 

-2.00 

0.227407 

0.618259 

-0.412303 

0.278795 

-1.75 

0.365483 

0.478652 

-0.319503 

0.452495 

-1.50 

0.464257 

0.309187 

-0.191785 

0.557908 

-1.25 

0.520045 

0.139080 

-0.045867 

0.599814 

-1.00 

0.535561 

-0.010161 

0.103997 

0.592376 

-0.75 

0.517773 

-0.125991 

0.247780 

0.554475 

-0.50 

0.475728 

-0.204082 

0.380353 

0.505934 

-0.25 

0.418725 

-0.246389 

0.501400 

0.465151 

0.00 

0.355028 

-0.258819 

0.614927 

0.448288 

0.25 

0.291164 

-0.249062 

0.728747 

0.469861 

0.50 

0.231694 

-0.224911 

0.854277 

0.544573 

0.75 

0.179336 

-0.193175 

1.006931 

0.690300 

1.00 

0.135292 

-0.159147 

1.207424 

0.932436 

1.25 

0.099645 

-0.126487 

1.484388 

1.310203 

1.50 

0.071749 

-0.097382 

1.878942 

1.886212 

1.75 

0.050570 

-0.072854 

2.452271 

2.761582 

2.00 

0.034924 

-0.053090 

3.298095 

4.100682 

2.25 

0.023655 

-0.037759 

4.563206 

6.172558 

2.50 

0.015726 

-0.026251 

6.481661 

9.421423 

2.75 

0.010269 

-0.017864 

9.432379 

14.588170 

3.00 

0.006591 

-0.011913 

14.037329 

22.922215 

3.25 

0.004160 

-0.007793 

21.330905 

36.554851 

3.50 

0.002584 

-0.005004 

33.055507 

59.164320 

3.75 

0.001580 

-0.003158 

52.183238 

97.173147 

4.00 

0.000952 

-0.001959 

83.847071 

161.92668 


where 2 f = 2x 3/2 /3 and the b k are defined by 


b k 


r ( 3 *+f) 

54*r(jk+ i)r(* + 


(A2.34) 


The series expansions for negative argument, as well as the derivatives, can be inferred from the change f -* ~K as a 
consequence of x -*■ —x and the behavior of the Zi(c,x) functions of Eq. (18.24), but as the b k increase as k\ for large k, 
usage of the series requires caution in tunneling calculations in the form of truncating the series [85]. 

Alternately, using Taylor expansions and Eq. (A2.24), the values of Ai(x) and Bi(x) can be found from tabulated values 
Ai (xp = Ai j and so on by 

Ai(x- + S) « (1 + ix<5 2 + -<5 3 + —x?<5 4 W 
1 \ 2 1 6 24 1 ) 1 

+<5(l + ^XjS 2 + y^«5 3 )Ai' (A2.35) 

with the same equation for Ai -* Bi, as well as with x -* —x. As with the series expansions, care with arguments and the 
limitations of the expansion should be kept in mind. 


2 The notation follows ref. [103]: f should not be confused with the zeta function of Section A2.3. 
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A2.7 Lorentzian functions 

Lorentzian functions of the form 

are useful representations of the Dirac delta function in the limit n -* oo, that is, 

Jim J 8 n (x)g(x)dx = J S(x)g(x)dx = g(0) 

This can be demonstrated as follows [400], Consider the following three integrals for a -*■ 
J S n (x)g(x)dx <9 m J S n (x)dx = g m ( i — — arctan(an)^ 
J 8„(x)g(x)dx <9 m J 5 n (.x)dx = g m (| - i arctan(a«)) 
8„{x)g(x)dx = g{ 0)- arctan(an) 




(A2.37) 

, where g m > \g(x)\ for all x: 

(A2.38) 

(A2.39) 

(A2.40) 


For the case that a = as n oo, a -* 0, but arctanfi/n) -»• n/2. But this demonstrates that Eqs (A2.38) and (A2.39) 

both vanish, whereas Eq. (A2.40) survives and equals g( 0). In other words, 8 n (x) mimics a Dirac delta function. 

Alternately, because 8 n {x) is symmetric in x, integrals of odd powers of x with it vanish. If g{x) can be well approximated 
by a Taylor expansion to first order, or g{x) « ^(0) + xg’(O'), over the range where 8 n {x) is significant, then 


j 8 n (x)g(x)dx « J 8 n (x){g(0) + xg'(0)}dx = 


m 


(A2.41) 
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Algorithms 


And you may ask yourself 
Am I right?.. .Am I wrong? 
And you may say to yourself 


A3.1 Permutation algorithm 

An algorithm for generating permutations for n = 6 using the matrix methods of Eq. (4.25) by iteratively populating the 
matrix C is given. 

For each n loop, C is overwritten and enlarged (i.e., goes from C" -1 ->■ C") by (i) inserting C" on the first block-row in 
Eq. (4.25), (ii) filling the upper triangular part of C”, (iii) filling the lower triangular part of C", and finally (tv) replacing 
the diagonals of C " with n. The algorithm is 

Require: C" is matrix of integers with N rows = n\ and N :olumns = n 



Require: On input, C C 2 = 


for n = 3 to 6 do 
tz, = (n - 1)! 

for j = n to 2 do 
for k = 1 to n l do 

C(k,f) = C(k,j- 1) 

end for 
end for 

for i = 2 to n - 1 do 
for; = 1 to n do 
for k = 1 to n t do 

C((i -l)n,+ k,j + 1) = C{k,j + 1) 

end for 
end for 
end for 

for i = 2 to n do 
for; = 1 to i do 
for k = 1 to n l do 

C{(i - l)n, + k,j) = C(k,j + 1) 

end for 
end for 
end for 

for i = 1 to n do 
for k = 1 to n, do 

1 David Byrne, Brian Eno, Chris Frantz, Jerry Harrison, and Tina Weymouth, lyrics to Once in a Lifetime, 1981. 
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C((z - 1 )k, + k, i) = n 

end for 
end for 
end for 

Concatenate the columns and sort the resulting n\x 1 array. 


A3.2 Birthday algorithm 

The function W(k ) defined in Eq. (4.27) for a given n is evaluated exactly for k, and then compared to its approx¬ 
imate form of Eq. (4.28). The approximate form, after being shown to be quite good, uses the method of bisection 
(Section A3.7.1) to find the value of k a for which W(k a , n) = W 0 . 

%■% Birthday Problem 

t Find k such that W(k) = n! / (n-k)! n~k = 1/2 


if exist( 'n','var') 
if exist( 'k','var') 
if exist( 'Wo','var') 
if exist(' iout ',' var') 


1; else n 
1; else k 
1; else Wo 

1; else iout 


365; end 

52; end 
0.5; end 
10; end 


W = @(x,n) exp(-x).* (1 


(x/n) )/(x 


0.5) ; 


Wk = zeros (l,k); Wk(l) = 1; kx = (l:k); 

for j = 2:k; 

Wk(j) = (n-j+1)*Wk(j-1)/n; 


Waprx = W(kx,n); 

plot (kx,Wk,'o',kx,Waprx); 

% Bisection search for N(k) = Wo 
% Require Wk(kmax) < Wo and Wk(kmin) > Wo 
kmax = k;kmin = 1; 
for j = 1:24; 

ka = 0.5*(kmax + kmin); 
za = W(ka,n); 
if za < Wo; 

kmax = ka ; 

kmin = ka; 


if iout > 0; 

Mout = [kx',Wk',Waprx']; 

dlmwrite('BD.txt',Mout,'delimiter',... 

'\t','precision',6); 


end 
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A3.3 Least squares fitting of data 
A3.3.1 Theory 

A best-fit line through data {x^y,} is often needed to identify a parameter, such as the extraction of work function ® 
from thermionic emission data. A method to do so is to minimize the sum of the square of the deviations between the 
experimental data {y,} and the fit /({x,}). This process is known as least squares fitting [709], 

Consider perhaps the most common case, where/(x) = A + Bx. Define the sum of the square of the differences between 
y f and /(x ; ) as /, or 

N 

T-fita-A-Rtf (A3.1) 

The error is minimized by finding A and B such that that d A % = d B % = 0. This results in two equations: 

N 

^(y i -A-Bx t ) = 0 
J j (y i -A-Bx i )x i = 0 

which can be solved for A and B using conventional means. Here it is profitable to reconsider the problem in terms of 
matrix and vector manipulations to harness the full power of computational programming languages that are widely 
available and which are, like FORTRAN or MATLAB, well-suited for dealing with matrices. 

Using a bra-ket notation, let |x) be the column vector of the independent variable, and | y) the column vector of the 
dependent variable. It follows that (x| and {y\ are their transposes (row vectors). Similarly, let |1) be a column vector 
such that all the entries are 1. The following relations then hold 


(A3.2) 
(A3.3) 


J j l=(l\l) = N 

(A3.4) 

£x, = (l|x) = (x|l) = Nx 0 

(A3.5) 

N 


£xf =<x|x) 

(A3.6) 


The requirement of Eqs (A3.2) and (A3.3) can be concisely represented as 

/(1|1) <l|x>\ /a\ = /<l|y>\ 
\<x|l> (x\x)J\bJ y(x|y )J 


(A3.7) 


For a small number of data points N, the evaluation of each matrix element, and the inversion of the 2x2 matrix, is no 
difficulty, but for large N, the summations can become large as well, causing the determinant of the 2x2 matrix to be 
the difference of two large and possibly nearly equal numbers. It is profitable to introduce, therefore, the vector 


i^ = W- |1) ^ = | x)-x 0 | 1) 

in terms of which the equations for A and B are compactly written as 


N(x|x> 


. Qc\y) 

' <*!*> 


(A3.8) 


(A3.9) 
(A3.10) 
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where the factor in the curly brackets in A is seen to be an n x n matrix, and the quantity {x\x) is recognized as proportional 
to the determinant of the 2x2 matrix of Eq. (A3.7) as shown by 

(11 l)(x|x) - (11x)(x11) = JV(x|x) - N 2 x 2 0 

= N((x\ -x„(l|)(|x> -x„|l» 

= JV(x|x> 

where the equality is a consequence of Eq. (A3.5). The forms of Eqs (A3.9) and (A3.10) are ideally suited to programming 
languages that can vector multiply. 

The extension to equations that are exponential rather than linear is as follows. Lety « exp(A + Bx): the only change 
in Eqs (A3.9) and (A3.10) is the replacement |y) ->• |ln(y)) (i.e„ each y t is replaced by ln(y ; )). The methodology can also 
be extended to y = Af(x) + Bg{x) (see ref. [709]). 

A3.3.2 Matrix method for linear fit 

The implementation of Eqs (A3.9) and (A3.10) in MATLAB is given the following routine, which is designed to fit the 
temperature versus current density data to an equation of the form ln(J/T 2 ) « A + B/k B T. The least squares parameter 
B is then an approximation to the negative of the work function, or —<t>. 

% - 

% Routine Least Squares Fit exponential 

% - 

% Finds best A and B for log(y) = A + B*x 
% - 

load T.txt; 
load J.txt; 

T = T' ; J = J' ; 

x = 11604.5./T; y = J./(T.~2); 

n = length(T); 

% Bx = <x | , Kx = \x> , and so on 
BI = ones(l,n); 

KI = ones(n,1); 

Bx = x; 

Kx = X' ; 

Bxb = Bx - mean(x); 

Kxb = Bxb'; 

Bf = log(y); 

Kf = Bf'; 

% Best fit for y = exp (A + B*x) 

A = Bx*(Kx*BI-KI*Bx)*Kf/(n*(Bxb*Kxb)); 

B = (Bxb*Kf)/(Bxb*Kxb); 

% Compare raw data to best fit ya (xa) 

xa = 1900 + 450* (0:100)/100; 

ya = (xa.~2).*exp(A + B*11604.5./xa); 

plot(T,J,'o',xa,ya,'r') 

xlabel('Temperature^[Kelvin]','FontSize',18); 
ylabel('CurrentJ3ensity„[A/cnr2] ' , 'FontSize',18); 
title('Tableau, JOushman','FontSize',18) 
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Table A3.1 7(7) data. Data files T.txt and j.txt containing 
the temperature [K] and current density [A/cm 2 ] of Table I of 
ref. [67], where the emission area is taken to be 0.1825 cm 2 
from a tungsten filament. 


T.txt J.txt 


1 1935.5 

2 1986.5 

3 2036.0 

4 2077.5 

5 2086.5 

6 2102.0 

7 2134.5 

8 2134.5 

9 2158.0 

10 2182.0 

11 2204.0 

12 2231.0 

13 2235.0 

14 2271.5 

15 2280.0 

16 2306.0 


0.00051178 

0.0010811 

0.0021737 

0.0037173 

0.0041951 

0.0051304 

0.0074630 

0.0077753 

0.010422 

0.013907 

0.017912 

0.024137 

0.025205 

0.037671 

0.040515 

0.053655 


A3.3.3 Input data 

The input data required in the MATLAB code is contained in the files T. txt and J. txt, each of which contains a single 
column of numbers. The numbers are given in Table A3.1. 


A3.4 Monty hall algorithm 

The Monty Hall "measurement" of Eq. (8.37) can be applied sequentially (same contestant over N time steps) or thought 
of as the same contest applied to N different contestants all at once. Either way, the running average of the number 
of wins for; games models a progression over time. In MATLAB, an algorithm to implement Eq. (8.37) and record the 
number of wins for the STAY (do not switch curtains when offered) algorithm versus the SWITCH (switch curtains when 
offered) and graphically represented as in Figure 8.2 is given. 

% - 

% Routine Monty 

% - 

% Finds time evolution of running average of wins for 
% the STAY vs. SWITCH strategy in the Monty Hall Pb. 

% - 

N = 1000; 

prize = zeros(1,N); %initialize prize vector 
picks = zeros(1,N); %initialize picks vector 

%Hide the prizes, and let the contestants pick curtains 
for j = 1:N; 

prize (j) = floor(3*rand())+1; 
picks (j) = floor(3*rand())+1; 

end 

%initialize the counters 

igame = (1:N); %game counter (time) 
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istay = zeros(1,N );Unitialize wins for staying vector 
iswch = zeros (1,N )-,%initialize wins for switching vector 
nstay = istay; %running average of istay 

nswch = iswch; %running average of iswch 

for j = 1:N; 

if prize(j) == picks(j); 

%it does not matter what the host does: she only 
%wins if her initial pick was right. 
istay(j) = 1; 

else 

%whatever curtain host shows her, she switches 
% to other one: she only wins if her initial 
% pick was not right. 
iswch(j) = 1; 

end; 

nstay(j) = sum(istay(1:j))/j; %calc running average 
nswch(j) = sum(iswch(1:j))/j; %calc running average 

end; 

nstay = 100*nstay; %Convert to a percent 

nswch = 100*nswch; %Convert to a percent 

plot(igame,nstay,'r',igame,nswch,'b') ; 

fid = fopen('Curtain.txt','w'); 

fprintf (fid, ' %s„\t„%s„\t„%s„\n', 'Game' , . . . 

'Stay', 'Switch'); 

for j = 1:N; 

fprintf(fid,'%12.8f w \t^%12.8f^\t„%12.8f„\n', ... 
igame(j),nstay(j),nswch(j)); 

end 

fclose(fid); 

disp('FINISHED: J3eneration„of_. . . 

~~~~~~~~~~~~~~ outputsile„<Curtain. txt>' ) 


A3.5 Wave function and density algorithm 

Using the notation in Section 9.2.3.1, two calculations are performed. First, the time evolution of a wave packet incident 
on a step function barrier is plotted over time. Second, the time-independent density profile near the step function barrier 
is plotted. The output is collected for various parameters and shown in Figures 9.9 and 9.10. 

%%Program PSIplot 

% Set up and solve the time dependent problem of a 
t wave packet incident on a potential step, and the 
% time independent problem % of the density of 
% electrons near a potential step using coordinates 
% such that Ek = k~2 and k =(pi/4)*(j/N) for 
% simplicity. 

% - 


tt Functions 
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% - 

%wave vector on barrier side 
kappa = @(x,c) sqrt(-c. , '2 + x.*2); 

%reflection probability 

rkfun = @(x,y) (x - y)./(x + y); 

%transmission probability 
tkfun = @(x,y) 2*x./(x + y) ; 

% - 

%%Parameters 

% - 

N = 4 8;Nx = 100;Nt = 24; 

L = 40.0;Lt = 40.0; 

C = (1:N);C = (1.0 - COS(pi*C/20))./C; 

k = 0.25*pi*(1:N)/N; E = k.*2; 
ko = 1.750*max(k); 

% - 

%%Reflection, Transmission, and Wave Vectors 

% - 

kp = kappa(k,ko); 
rk = rkfun(k,kp); 
tk = tkfun(k,kp); 

% - 

%% Figure 1: Time Dependent wave function 

% - 

%position vector 

xx = -L + 1.5*L*((1:Nx)-1)/(Nx-1); 

%time vector 

tt = - Lt + 2*Lt*((l:Nt)-1)/(Nt-1); 

%Initialize Psi Matrix 
Psi = zeros(Nt,Nx); 
for it = 1:Nt; 

for ix = 1:Nx; 

for jn = 1:N; 

if xx(ix) < 0.0; 

Psi(it,ix) = Psi(it,ix) + C(jn)*... 

(exp(li*k(jn)*xx(ix)) + rk(jn)*... 
exp(-li*k(jn)*xx(ix)))... 

*exp(-li*E(jn)*tt(it)); 

Psi(it,ix) = Psi(it,ix) + C(jn)*tk(jn)... 
*exp(li*kp(jn)*xx(ix))*... 
exp(~li*E(jn)*tt(it)); 

end 

end 

end 
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Psi2 = (abs(Psi)).~2; %Density 

figure(1); 

waterfall(xx,tt,Psi2); 

axis tight square; view(0,35);grid off;axis off 

% - 

%% Figure 2: Time-independent density 

% - 

%initialize density 
dens = zeros(l,Nx); 

%Position vector 

xx = -3*pi + 5*pi*((1:Nx)-1)/(Nx-1); 

%mimic supply function 
C = ( (N+l) ~2 - (1 :N) .''2) / (N+l) A 2; 

for ix = 1:Nx; 

for jn = 1:N; 

if xx(ix) < 0.0; 

dens(ix) = dens(ix) + abs(C(jn)*... 

(exp(li*k(jn)*xx(ix)) + ... 
rk(jn) *exp(-li*k(jn) *xx(ix)))) .''2; 

else 

dens(ix) = dens(ix) + ... 

abs(C(jn)*tk(jn)*exp(li*kp(jn)*xx(ix))) . ~2; 

end 

end 

end 

dens = dens/dens(1); %normalize to boundary value 

figure (2) ; 

xzero = [0,0]; yzero = [0, 1.2]; 

plot(xx/pi,dens,xzero,yzero,-r') ; axis([-3 2 0 1.2]); 

A3.6 Hydrogen atom algorithms 

An acceptance-rejection (AR) algorithm is used to judge whether or not an electron is "measured" at a coordinate ( x,y , z) 
for the case y = 0 (i.e., the probability plot is for a slide in the x-z plane). The AR algorithm works by generating a random 
number r n and comparing it to the probability evaluated from the wave function y/ nml (x,y, z): if r n is greater, the electon is 
"not found" and if smaller the electron is "found". Although the position can be chosen randomly, the algorithm instead 
considers the three-dimensional space to be composed of n 3 small volumes and uses the AR technique to determine 
whether or not an electron is in that small volume. Such an approach makes the integration along a coordinate, as in 
Eq. (9.52), more simply represented as a summation over the occupancy of the n cells along that dimension. 

%%Program HydrogenSlice: calculation of probability 
% factors associated with the hydrogen atom but along 
% a plane 

% formula for n = 3, 1=2 hydrogenic wave function 
RY = @(x) (x.“2 - 9*x + 13.5),*exp(-x/3); 

% no need to put in phase exp (i*phi), as that factor 
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t will cancel when the absolute value is taken 

% default variables 

if exist ('n'var') == 1; else n = 1000 ; end 

if exist ('xmax'var') == 1; else xmax = 20 ; end 

x = xmax*(2*(1:n) - n - l)/(n-l); z=x; 

% Create the Psi"2 = rho matrix 
% calculate for y = 0 (in the x-z plane) 
rho = zeros(n,n); %rho = iPsil^ 
for i = 1: n; 

for k = 1: n; 

r = sqrt (x(i)*2 + z(k)~2); 
rho (i, k) = RY (r) ,~2; 

end 

% Scatter plot of probability distribution 
% in the x-z plane 

xt = zeros (l,n~2); zt = xt; 
rho = rho /max(max (rho)); 
nt = 0; 
for i = 1:n; 

for k = 1: n; 

if rand() < rho(i,k); 
nt = nt+1; 
xt (nt) = x (i) ; 
zt (nt) = z (k) ; 

end 

end 

end 

xt = xt(1:nt);zt = zt(1:nt); %trim the vectors 

figure(1); 

plot (xt,zt,'rs','MarkerSize',2); 
axis ( (x(1) x(n) x(l) x(n)]); 
axis square; 

The second code file performs the actual summation over the y coordinate, and therefore is to be compared to 
Eq. (9.52). Whether n = 3, / = 2 or n = 2, l = 1 is considered depends on which formula is commented out. 

%%Program HydrogenPsi: calculation of probability 
% factors associated with the hydrogen atom 

choose the orbital 

if exist ('n','var') == 1; else n = 2 ;end 
if n == 3; 

% formula for n = 3, 1 = 2 hydrogenic wave function 
RY = @(r,z) exp (-r/3) .* (3*z . '2 - r.~2); %dzz orbital 

else 

% formula for n = 2, 1=1 hydrogenic wave function 
RY = @(r,z) exp (-r/3).*z; %pz orbital 
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end 

% no need to put in phase exp (i*phi), as that factor 
% will not remain when the absolute value is taken 
% default variables 

if exist ('Nx'var') == 1; else Nx = 64; end 
if exist ('xmax'var') == 1; else xmax = 15 ; end 

x = xmax* (2* (1:Nx) - Nx - 1)/(Nx-1); y=x; z=y; 

%% Create the Psi'2 = rho matrix 
rho = zeros (Nx,Nx,Nx); %rho = \Psi\~2 
for i = 1:Nx; 

for j = 1:Nx; 

for k = 1:Nx; 

r = sqrt (x(i)*2 + y(j)~2 + z(k)~2); 
rho (i, j , k) = RY(r,z (k)) . ~2; 

end 

% Integrate over x then contour plot dnsx 
rhoxz = zeros (Nx,Nx); 
for j = 1 : Nx; 

for k = 1:Nx; 

for i = 1:Nx; 

rhoxz(k,j) = rhoxz(k,j) + rho(i,j,k); 

rhoxz = rhoxz /max(max (rhoxz)); %Normalization 

figure (2) ; 

contour (x,y,rhoxz,10); axis square; title ('rho(x,z)'); 
grid off; 

%% Scatter plot of probability distribution using 

% acceptance / rejection algorithm 

xt = zeros (l,Nx~3); yt = xt; zt = xt; 

rho = rho/max(max(max(rho))); 

nt = 0; 

for i = 1:Nx; 

for j = 1:Nx; 

for k = 1:Nx; 

if rand () < rho(i,j,k); 
nt = nt+1; 
xt (nt) = x (i) ; 
yt(nt) = y(j); 
zt (nt) = z (k) ; 

end 

end 

end 

end 

xt = xt(l:nt);yt = yt(1:nt); zt = zt(l:nt); 
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figure(3); 

plot3(xt,yt,zt,'ro','MarkerSize',2); 

axis([-xmax xmax -xmax xmax -xmax xmax]);view(40,20) 
axis square;grid on; 


A3.7 Root-finding Methods 

A3.7.1 Method of bisection 

The zeros of the function H n (X), defined in Eq. (15.13), are determined using the method of bisection. Initially X min = 0 
and X max set large enough that H n (X max ) > 0. At each iteration, the midpoint X h value is found and H n (X h ) evaluated. 
Depending on whether the result is greater than or less than zero, the maximum or the minimum X value, respectively, 
is set to the midpoint value. Because X min < X h < X max , as the separation X max - X min shrinks the value of X m is converged 
upon. After X m (n) is found, S/S m is evaluated based onEq. (15.8) and either plotted or output to a datafile Baroody.txt. 
In the code, X is lam and H n (X) of Eq. (15.13) is Hnlam. Output is shown in Figures 15.2 and 15.3. The calculated zeros 
X m (n) are shown in Figure 15.8. 

%% Secondary Emission Baroody Function 

% - 

t Input Terms: 

% n = power of E in R(E) = E'n / A*n 

t lmax = initial guess of how large lam can be 

% lmin = initial guess of how small lam can be 

% N = number of bisections to zoom in on lam 

% xmax = size of plotting region E/Em for output 

% iout = 0 - plot data; 1 - output data 


%%Functions 

% - 

fba = @(x,a,p,n) exp(a.*(x.*n - l).*p.~n); 
dfba = @(x,a,n) (1.0 + a*n*(x.~n - 1.0)).*... 

exp (a . * (x. ~n - 1) ) ; 


%%Parameters 


%- 


if exist('n','var') == 
if exist('lmax'var') 
if exist('lmin'var') 
if exist('N','var') 
if exist('Nx','var') 
if exist('xmax','var') 
if exist('iout','var') 


1; else n = 1.35;end 
== 1; else lmax = 10.0;end 

== 1; else lmin = 0.0;end 

== 1; else N = 16;end 

== 1; else Nx = 101;end 

== 1; else xmax = 6.0;end 

== 1; else iout = 1;end 


%Step 1: Test evaluation: For lam = 2 and p = 1/2, 
t deldelm = 0.8231054 

% - 

P = 0.5; 

deldelm = p.*integral(@(x)fba(x,2.0,p,n),0.0,1.0)./... 

integral (@ (x) fba (x, 2.0,1.0 , n) , 0.0,1.0) ; 

1-Step 2; Find lambda (n) using method of bisection 
% such that Hn(lambda(n)) = 0 

% - 
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for j = 1:N; 

lam = 0.5*(lmax+lmin); 

Hnlam = integral(@(x) dfba(x,lam,n),0.0,1.0); 
if Hnlam >0; t 
lmin = lam; 

else 

lmax = lam; 

end 

end 

%Taylor Correction: y = yO + (x-xO)*(yl-yO)/(xl-xO) 

% Solve for x(y=0) 

Hnlaml = integral(@(x) dfba(x,lmax,n),0.0,1.0); 

HnlamO = integral(@(x) dfba(x,lmin,n),0.0,1.0) ; 
lam = lmin - (HnlamO/(Hnlaml-HnlamO))*(lmax - lmin); 
iCheck: does Hnlam = 0? 

Hnlam = integral(@(x) dfba(x,lam,n),0.0,1.0); 

%Step 3: Plot the function (del / delm) 
pn = (0:Nx-l)*xmax/(Nx-1); 

yn = zeros(l,Nx); 
for j = 1:Nx; 

yn(j) = pn(j).*... 

integral(@(x)fba(x,lam,pn(j),n),0.0,1.0) . /. . . 
integral(@(x)fba(x,lam,1.0,n),0.0,1.0); 

end 

if iout > 0; 

format long 

fid = fopen('Baroody.txt','w'); 

fprintf (fid, ' %s tJ \t„%s t An' , 'p' , ' del/delm' ) ; 

for j = 1:Nx; 

fprintf(fid,'%12.8f„\t„%12.8f„\n' ,pn(j),yn(j)); 

end 

fclose(fid); 
format short 

else 

plot(pn,yn); 

end 

A3.7.2 Method of iteration 

When an equation to be solved can be put into the form of x = f(x) (an example being finding the maximum field which 
satisfies Eq. (17.90)), then iterative methods can under certain conditions allow for a solution. Graphically, the method 
is simple: the solution is the intersection of the two curves jq = x and y 2 = f(x). The method involves taking a reasonable 
estimate of x, calculating/(x), and feeding that solution back into/(x) as its argument ("iterating") until convergence is 
achieved. If one were to plot the iteration points, it would be seen that they bounce between the y, and y 2 lines until 
they converge on the intersection point. 
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Figure A3.1 A graphical representation of the iteration process used to find a solution toe*-x-2 = 0 using iteration of x=f(x) with/(x) = e* - 2. 
Open circles represent /(x^) and black circles represent x y The purple arrow indicates x ,. 


EXAMPLE: Find a root of e* - x - 2 = 0 using iteration. 

SOLUTION: Isolating x, then x = /(x) implies f(x) = e* — 2. Starting with x l = 1 and letting x ]+1 =f(Xj), then x x 
through x 10 are given by (1.0000, 0.7183, 0.0509, -0.9478, -1.6124, -1.8006, -1.8348, -1.8404, -1.8412, -1.8414), 
after which the last significant digit no longer changes. 


A plot of the iterated points of the previous example, shown in Figure A3.1, shows a bouncing between the lines y = x 
and y = f(x), with the points coming progressively closer to the unique point where x = /(x). The iterations appear to be 
"attracted" to the lower root, instead of the upper root near x = 1. The iterative process is not as rapid as, for example, 
Newton's method, where derivatives of the function are used, but then again, in the iterative method, the calculations 
of derivatives are not needed. 

It might be thought that to find the other root, x x should be chosen to the right of it, say atx, = 1.2, but that would be 
catastrophic: /(x,) would be larger, not smaller, and subsequent iterations would grow out of control. It is seen that for 
the iteration process to work, f\x x ) < 1, where the prime indicates derivative with respect to argument. Knowing that, 
a method to find the upper root by iteration rewrites the equation to be solved: isolating the x in e* gives the equation 
x = ln(x+ 2) so that/(x) = ln(x+ 2). Then 10 or so iterations show that x^, « 1.1462. 

So far, only the case of positive /'(x^) has been considered: an interesting behavior for negative /'(x^) occurs. Consider 
the example x= cos(x), with a solution at x^ = 0.7391. The example is interesting in that |/'(x)| < 1, and so iteration 
will converge on x x regardless of the x, chosen. A plot of the iteration sequence, however, shows the iterations spiraling 
towards the solution, as in Figure A3.2. 

A3.7.3 Final correction 

Although bisection is rugged and iteration elegant, both are moderate accuracy approaches. When greater accuracy is 
required, the algorithms must be run longer for convergence to an acceptable error level, but if the functions are smooth, 
short cuts to the final answer are possible. 

An example is the Taylor correction employed in the algorithm of Section A3.7.1. After a number of iterations, the 
maximum and minimum values are so close that H n (2) appears linear for A min < A < A max . For a general function/(x) that 
is approximately linear between x 0 <x <x lr then the Taylor expansion is to leading order 

f(x) « /(x 0 ) + (x - x 0 )/'(x 0 ) (A3.11) 

where the prime on/' indicates derivative with respect to argument. Solving for x. 
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Figure A3.2 A graphical representation of the iteration process used to find a solution to x = cos(x). Open circles represent/fej) and black circles 
represent Xj. The purple arrow indicates x x . 


In numerical treatments, when x, and x 0 are so close that/(x) is essentially linear between them, a more convenient 
approximation is to use finite difference approximations to the derivative to obtain 


x « x 0 - (x, - x 0 ) 


( ™ ) 


(A3.13) 


which has the added advantage of relying only on the evaluation of/(x) rather than its derivative. An iteration or two 
of this correction, with the evaluated x replacing x 0 , hones in on the root nicely. 

An even more rapid honing in on the root is possible if /(x) is approximately no worse than quadratic between x 0 and 
x,. Let x h = (x, + x 0 )/2 be the halfway point, and define 


fo =/(*o) 

(A3.14) 

d, =-3/(x 0 ) + 4/(x,)-/(x 1 ) 

(A3.15) 

d 2 = 4/(x 0 ) - 8f(x h ) + 4f(x y ) 

(A3.16) 


where the forms of d y and d 2 are familiar from upwind/downwind approximations to derivatives and second-order deriva¬ 
tives in finite differencing schemes, with d 1 « 2A/'(x 0 ) and d 2 « 4A 2 /"(x 0 ) with A = x, — x 0 . The quadratic approximation 
to the root is then 


X — X 0 — (Xj — x 0 ) 


d i + y/ d i~ 2 fo d 2 


The recommended approach to iteration, therefore, is to use it to get to the neighborhood of the root, and then, if the 
estimated root is close enough that /(x) can be approximated by a linear or quadratic function in that neighborhood, use 
the Taylor correction method a few times to obtain the final approximation. 


A3.8 Thermal-field algorithm 

The behavior of the current density integrand D(E x )f(E x ), where D is the transmission probability and / is the supply 
function, is investigated using a form of the Gamow factor 9(E X ) that is linear in energy and found using a Taylor expansion 
near E « fi, as in the development of the Fowler-Nordheim equation (see Chapter 13). The algorithm is used to evaluate 
the numerical data for the figures of Section 17.2. The code evaluates 0(E), which is called theta. It is evaluated on a 
mesh grid of equally spaced E x and F evaluation points. After each curve D(E x )f(E x ) is evaluated, the curve is normalized 
to its maximum value, so as to mimic the behavior of Figure 17.4. If the spacing A F is such that not too many curves are 
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evaluated, a second plot in waterfall format is generated. Figures based on this algorithm are shown in Figure 17.7 and 
17.8. Observe that the term "kb" is equivalent to l/k B . A convenient numerical evaluation of the Gamow factor 0(E) and 
P P (E) for the image charge potential may be obtained using Eq. (17.92). 

%% Gadzuk and Plummer-like Normal Energy Distribution 
t using a linearized Gamow Factor Theta(E) 


f- 

%%Functions: 

% 

Supply function and Nordheim functions 
using Forbes & Deane approximation 

fok = @(x,b,m) 

tx = @(x) 

1.0./(1.0 + exp (b.* (x - m) ) ) ; 

1.0 + (1.0/3)*(x.~2).*( log(x) - 3.0); 

1.0 + (1.0/9)*(x.*2) .*(-log(x) + 1.0); 

%%Parameters 


if exist ('Chem','var') == 1; else Chem = 7.0 ; end 
if exist ('Phi','var') == 1; else Phi = 4.5 ; end 


if exist('Temp',' 
if exist('DelE',' 
if exist('DelF ; 
if exist('Emin ; 
if exist('Emax' 
if exist('Fmin' 
if exist('Fmax ; 


var') 
var') 


1; else Temp = 
1; else DelE = 
1; else DelF = 
1; else Emin = 
1; else Emax = 
1; else Fmin = 
1; else Fmax = 


1570. 
0.02 
0.20 
6.00 
10.0 
2.40 


end 

end 

end 

end 

end 

end 


%Fundamental Constants and needed terms 
Q = 0.3599911;m = 5.6856297;hbar = 0.6582119; 
kb = 1.0/11604.5; 


%Create grid over which values of D(E)fo(E) are 
% evaluated as a function of energy Ex and field Fo 
[Ex, Fo] = meshgrid(Emin:DelE:Emax , Fmin:DelF:Fmax); 

%-Energy Slope Factors 
betaT = 1.0/(kb*Temp); 

betaF = (2.0./(hbar*Fo)).*sqrt(2*m*Phi).*... 
tx(sqrt(4*Q*Fo)./Phi); 

%Linearized Gamow Factor 

Ec = Chem + (2*Phi/3).*vx(sqrt(4*Q*Fo)./Phi)./... 

tx(sqrt(4*Q*Fo)./Phi); 
theta = betaF.*(Ec - Ex); 


%Grid values - calculated then normalized to maximum 
dJdE = fok(Ex,betaT,Chem) ./(1.0 + exp(theta)); 
maxJ = max(dJdE'); 

Ny = length(maxJ); 
for j = 1:Ny; 

dJdE(j,:) = dJdE(j,:)/maxJ(j ) ; 

end 
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%Plotting Section (do not make waterfall plots 
t if too many lines) 

figure(1); 

surf(Ex,Fo,dJdE);axis tight square; 
shading flat;view(2) 

xlabel('Energy^[eV] ' , 'FontSize',16); 
ylabel('Fields[eV/nm]','FontSize',16) 

if length(Fo(:,1)) < 50; 

figure (2) ; 

waterfall(Ex,Fo,dJdE);axis tight square; 
grid off; axis off; 

A3.9 Gamow factor algorithm 

An algorithm for the evaluation of the Gamow factor 0(E) for the image charge barrier is given. See Sections 11.2, 11.3, 
and 17.4, and Eq. (17.78). 

tt Evaluation of the Gamow Factor for Image charge 
% using full integral function 
% - 

%%Functions: Supply function and Nordheim functions 

t using Forbes & Deane approximation 

% - 

REx = @(x,a) 0.25*(sin(2*x) ."2) ./sqrt(a+(sin(x) ) ."2) ; 
vy = @(x) 1.0 - (x. '■2) *(1.0 - log(x) /3) ; 
ty = @(x) 1.0 + (x.~2)*(1.0 - log(x))/9; 


%%Parameters 

% - 

if exist(' Chem','var') 
if exist(' Phivar') 
if exist(' Nx','var') 
if exist(' Emin','var') 
if exist(' Fo','var' ) 


1; else Chem 
1; else Phi 
1; else Nx 
1; else Emin 
1; else Fo 


7.0 ; end 
4.5 ; end 
21 ;end 
Chem ; end 
8.0 ;end 


%Fundamental Constants and needed terms 
Q = 0.3599911;m=5.6856297;hbar=0.6582119;kb=11604.5; 


aa = 4*Q*Fo; 

Ex = Emin + (Phi - sqrt(4*Q*Fo))*(0:Nx)/Nx; 

Bo = (4.0/(3*hbar) ) *sqrt (2*m*Phi / '3) ; 

Co = (2.0/hbar)*sqrt(2*m*Phi); 

xx = Chem + Phi - Ex; 

LE = sqrt(xx.~2 - aa)./Fo; 

yE = aa./(2*(xx + sqrt(xx.~2 - aa)).*sqrt(xx."2 - aa)); 
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Nx = length(Ex); 

FREx = zeros(size(Ex)); FNx = FREx; FQx = FREx; 
ys = sqrt(aa)/Phi; 

Thetao = (Bo*vy(ys)/Fo) + (Co*ty(ys)/Fo)*Chem; 

BetaF = Co*ty(ys)/Fo; 

BetaQ = (pi/hbar)*sqrt(2*m)*(Q/Fo~3)*0.25; 
for i = 1:Nx; 

FREx(i) = integral(@(x)REx(x,yE(i)),0,0.5*pi); 
FNx(i) = Thetao - BetaF*Ex(i); 

FQx(i) = BetaQ*(Chem + Phi - sqrt(4*Q*Fo) - Ex(i)); 

end 

FREx = (4*sqrt(2*m)/hbar)*FREx.*sqrt(Fo.*LE.~3) ; 

FREx = real(FREx); 
figure(1); 

plot(Ex,FREx,'o',Ex,FNx,'r',Ex,FQx,'b'); 
axis tight square; 

xlabel('Energy^[eV] ','FontSize',16); 
ylabel('Theta(E)','FontSize',16) 

fid = fopen('ThetaE.txt','w'); 

fprintf (fid, ' %s„\t„%s„\t„%s„\t„%s\n' , ' E„ [eV] ' , . . . 

'Theta(E)','ThetaFN','ThetaQuad'); 
for j = 1:Nx; 

fprintf(fid,... 

'%12.8f^\t„%12.8f^\t^%12. 8f„...\t„%12.8f„\n', . . . 

Ex(j),FREx(j),FNx(j),FQx(j)); 

fclose(fid); 

disp(' FINISHED: jSeneration^of_data„file„<ThetaE.txt>') 


A3.10 Triangular barrier D(E) 

An algorithm for the evaluation of the Gamow factor 0{E) for the triangular barrier (the Q -* 0 of the image charge 
barrier) is given. See Section 18.2.3. 

%% Evaluation of the Transmission probability D(k) 

% for a triangular barrier 

t Output: col 1 = E, col 2 = D(E), col 3 = Step D(E) 

% - 

% Airy Function combination formulae: 

% ABx = Ai~2 + Bi~2; ABp = Ai'~2 + Bi'-2 
ABx = @(x) airy(0,x).~2 + airy(2,x).*2; 

ABp = @(x) airy(l,x).~2 + airy(3,x).'2; 

%Fundamental Constants and needed terms 
m=5.6856297;hbar=0.6582119; 
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tt Parameters 

% - 

if exist(' N' var') == 1; else N =50 ; end 

if exist(' F' var') == 1; else F =6.0 ; end 

if exist('mu'var') == 1; else mu =7.0 ; end 

if exist('Phivar') == 1; else Phi = 4.5 ; end 

if exist('Emax','var ' ) == 1; else Emax = 4.0 ;end 


tt Vectors: Ex is energy relative to mu + Phi; 
t ko = k at barrier maximum 

t kxp and kxm are k above and below ko 

Ex = (1:N)* Emax/N; 

f = 2*m*F/hbar~2;f 1 = f" (1.0/3) ; f2 = fM2.0/3); 

ko = sqrt(2*m*(mu+Phi))/hbar; 

kxp = sqrt(2*m*(mu+Phi+2*Ex))/hbar; 

kxm = sqrt(2*m*(mu+Phi-Ex))/hbar; 

zp = (kxp.*2 - ko~2)/f2; %For above barrier 

zm = (ko , '2 - kxm.~2)/f2; %For below barrier 

Dkp = 4 *fl*kxp./(pi*((kxp.~2) .*ABx(-zp) +... 

f2*ABp(-zp) + (2 * fl*kxp/pi))); 

Dkm = 4*fi*kxm./(pi*((kxm.*2).*ABx( zm) +... 
f2*ABp( zm) + (2 * fl*kxm/pi))); 

kxm = kxm(N:-1:1); %reverse order the kxm 
Dkm = Dkm(N:-1:1); %and reverse order the Dkm 

kx = [kxm,kxp]; Dk = [Dkm,Dkp]; %stitch vectors together 
kappa = sqrt(max(kx .'2 - ko."2,0.0)); 

Dkstep = (4*kx.*kappa./(kx+kappa) . "2); %zero field case 

Ex = ((hbar*kx) .~2) /(2*m); 

plot(Ex,Dk,Ex,Dkstep); 

%% Clear junk and output data 

clear kxm kxp Dkm Dkp f fl f2 ko zp zm m hbar 

Hout = [Ex',Dk', Dkstep']; 
filenam= ['Airy',num2str(F),'.txt']; 

dlmwrite(filenam,Hout,'delimiter','\t','precision',6); 
disp(['FINISHED:„Generation„of^dataffile:_' ,filenam]) 

A3.11 Evaluation of H c (ti) 

An algorithm for the evaluation of H c (u) in Figure 18.8 and Table 18.1 is given. 

%% Evaluation of the He (u) function 
% using TDMA method of solving tridiagonal matrix 
% - 

Uxn = @(x,n,y) 12*(-2*y*(n+1.0) + 3*x.*(x+1.0)); 

Dxn = @(x,n,y) 12*( 2*y*(n+1.0) + 3*x.*(x-1.0)); 
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Cxn = @(x) 5.0 - 72 *x ."2; 

bo = @(x) 0.25*((x+3.0)*(x-2+1.0) +... 

2*sqrt(2)*(x-1.0)*(x*2-1.0)); 

%%Parameters 

% - 

if exist(' N' var') == 1; else N = 25 ;end 

if exist('cvar') == 1; else c = 1.0 ;end 


% Position vector: uj 

t up, center, and down diagonals: vUj, vCj, and vDj 
t Boundary conditions: Hc(0) = bo (c); Hc(l) = Hcone 

t Hcone is related to the Airy functions 
t evaluated at u = 1: they can be evaluated 
t using MATLAB routines as follows 


if c == li; 
Zione = 

elseif c == 
Zione = 
elseif c == 

Zione = 

else 


airy (0, c~2) 
-li; 

airy (0, c~2) 
-1.0; 

airy(0,c~2) 


li*airy(2, 0 * 2 ); 
li*airy(2, 0 * 2 ); 


Zione = airy(2,c"2); 

end 

Hcone = 2*sqrt(pi) *Zione* exp (-2*c/3); 


% set up position and boundary vector and 
% tridiagonal matrix; Hx corresponds to (N+l)*Hc(u) 
% here to make implementation easier. 

% We will rescale u later to fix this 

uj = (1 :N) ; Hx = uj ; 

vDj = Dxn(uj,N,c); 

vCj = Cxn (uj ) ; 

vUj = Uxn (uj , N, c) ; 

vBj = zeros(1,N); 

vBj(1) = -Dxn(1,N,c)*bo(c); 

vBj(N) = -Uxn(N,N,c)*Hcone; 


% Start tridiagonal matrix solution algorithm without 
% make sure vUj is not zero (it never is here) 

% and exploiting fact that vBj=0 except at j=l and N 

% Downward elimination part: 

vUj(1) = vUj(1)/vCj(1); 

vBj (1) = vBj (1) /vCj (1) ; 

for i = 2:N-1; 

temp = vCj(i) - vDj(i)*vUj(i-1); 

vUj(i) = vUj(i)/temp; 

vBj (i) = - vDj (i) * vBj (i-1)/temp; 

end 

vBj (N) = (vBj (N) - vDj (N) *vBj (N-l) ) / . . . 

(vCj (N) - vDj (N) *vUj (N-l) ) ; 


www.ebook3000.com 




Appendix 3 


t Upward part 
Hx (N) = vBj (N) ; 

for i = N-l:-1:1; 

Hx (i) = vBj (i) - vUj (i) *Hx (i + 1) ; 

end 

uj = uj/(N+l); %time to rescale u so 0 < u < 1 

t All finished 

% - 

%% Compare to MATLAB routines to see accuracy 

Hexact = zeros(1,N); 
zj = uj (-2.0/3) ; 
for i = 1:N; 

if c == li; 

Hexact(i) = airy(0,-1.0/uj(i).*■(2.0/3)) .. . 

- li*airy (2 , -1.0/uj (i) .'"(2.0/3)) ; 
elseif c == -li; 

Hexact(i) = airy(0,-1.0/uj(i).*(2.0/3))... 

+ li*airy (2 , -1.0/uj (i) .'"(2.0/3)) ; 

elseif c == -1.0; 

Hexact (i) = airy(0,1.0/uj(i)(2.0/3}); 

else 

Hexact(i) = airy(2,1.0/uj (i) . A (2.0/3)); 

end 

Hexact = Hexact * 2*sqrt(pi)./... 

( (uj .'"■(1.0/6) ) . *exp (2*c . / (3*uj ) ) ) ; 

Hexact = [bo(c),Hexact,Hcone]; tadd on boundary terms 

Hx = [bo(c),Hx,Hcone]; %add on boundary terms 

uj = [0,uj,1.0]; %add on boundary terms 

Hout = [uj',real(Hexact'),imag(Hexact'), ... 

real(Hx'),imag(Hx')]; 

dlmwrite('Hx.txt',Hout,'delimiter','\t','precision',5) 

A3.12 Transfer matrix algorithm 

There are features of the implementation of the transfer matrix algorithm (TMA) evaluation that require particular 
attention: 

• It is assumed that at the left and right boundaries, V(x) = 0. If the right boundary were at a different potential, then 
D(k) -> (k r /k)\t(k)\ 2 , where hk r = y/2m{E - V r and V r is the potential of the right-hand boundary. 

• V(j) corresponds to the potential to the right of x(j). 

• The quantity rm is the ratio of the effective mass of the electron in bulk material with the rest mass, or r = m*/m. 
For example, for GaAs, r = 0.067 but for metals, r = 1.0. Typical semiconductor values are often around 0.2, and such 
values make more visually interesting D{Jc). 

• x(l) is a point prior to the onset of the potential barrier. x(N v ) is the point where the barrier ends. Thus, F(l) = V 
(N v ) = 0.0. 
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• The very odd choice of k max «- 3 AA\6k max {\ : N)/{Nn) (where the notation (1 : N) creates a vector of integers from 1 
to N) is simply to make the values of k not rational because (3.1416/^) « 1 is not rational (cannot be expressed as a 
ratio of integers). As a result, kj would only be 0 by an amazing coincidence: if it were 0, the calculation of S M would 
fail. The prevention of the kj = 0 circumstance is better avoided through the use of an IF-THEN construct to explicitly 
exclude the kj = 0 occurrence, but the simple "cheat" here is easier, results in faster code, and for the most part always 
works. 

• In the code, k (i) refers to the incident momentum values k and is the argument of D(k). ko (j) refers to kj and is 

defined by k 0 (j) = - Vj)/h. The notation could be better, but is not. 

• The output to a file called DkTMA. txt is in the form of two columns spaced by a tab character using MATLAB-specific 
calls. The first column is the values of k. The second column is the values of D(k). 

A3.12.1 Transfer matrix evaluation 

Implemented in MATLAB, the calculation of Eq. (19.3) may be accomplished as follows (where li = ■sf—l = i): 

function [Spqx] = FUNSmat(k,kp,x) 

% Calculation of Transfer matrix component 

Sx = zeros(2); 

Sx(l,l) = (k+kp).*exp( li*(kp-k).*x); 

Sx(l,2) = (k-kp).*exp(-li*(k+kp).*x); 

Sx(2,l) = (k-kp).*exp( li*(k+kp).*x); 

Sx(2,2) = (k+kp).*exp(-li*(kp-k).*x); 

Spqx = Sx./(2*k); 


A3.12.2 Matrix multiplication algorithm 

Once the matrices defined by Eq. (19.3) are available, from which is constructed as in Eq. (19.1), then the multipli¬ 
cation of the matrices in the proper order is needed. Implemented in MATLAB, the procedure may be accomplished as 
follows: 

%% Program QBarrier 

% Transmission probability calculated using a 
% transfer matrix approach with plane waves 

% - 

% Requires: FUNSmat.m 

% Can use: VxRTD, VxQUAD, or equivalent 


%Fundamental Constants and needed terms 
m=5.6856297;hbar=0.6582119; 

%% Parameters 


if exist ('ioutvar') 
if exist ('rm','var') 
if exist ('N','var') 
if exist ('Vx','var') 

Nv = length (Vx)-1; 

else 


1; else iout = 0 ; end 
1; else rm =0.3 ; end 
1; else N = 100 ; end 


% Default case if no Vx, xj available: RTD-like 
% k(j) is the value of kappa to right of x(j) 

% Potential structure beings at x(2) 
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Nv = 4 ; 

Vx = 0.2* [ 0.0, 1.0, 0.0, 1.0, 0.0]; 
xj = [-1.0, 0.0, 2.0, 6.0, 8.0]; 

end 

ko = sqrt (2*rm*m*Vx)/hbar; 

if exist ('kmax','var') == 1; else kmax=2.0*max(ko) ; end 
if exist ('xjvar') == 1; else xj =(0:Nv)*8.0/Nv ;end 

%% Solve for Dk 

% next line puts k on non-rational basis 
k = 3.1416*kmax*(1:N)/(N*pi); 

Dk = zeros (1,N); 
for i = 1:N; 

Sx = [1,0;0,1]; 
for j = Nv:-1:1; 

kj = sqrt (k (i)~2 - ko ( j)~2); 
kjp = sqrt (k (i)~2 - ko (j+1) "2); 

Sx = FUNSmat (kj ,kjp,xj (j+1))*Sx; 

end 

Dk(i) = 1.0./ (abs (Sx(l,1))~2) ; 

if iout > 0; 

Hout = [k',Dk']; 

filenam= ['DkTMA.txt']; 

dlmwrite (filenam,Hout,'delimiter',... 

'\t','precision',6); 

end; 


A3.12.3 Quadratic barrier V(x) algorithm 

The following algorithm, implemented in MATLAB, creates Vj and Xj for use by the algorithms of Sections A3.12.1 and 
A3.12.2 for the case of a quadratic barrier potential profile. Setting N b = 2 results in a single rectangular barrier with 
the same height and width as the N h = 2 case of Section A3.12.4. The vectors x p and V p are for visualization purposes 
(they serve no function in the S„ matrix). 

%% Program VxQuad 

% Potential Barrier associated with Nb-barrier RTDs 


tt Input terms 
if exist('iout','var') 
if exist('L','var') 
if exist('Vo','var' ) = = 
if exist('rm','var' ) = = 
if exist( 'Nb' ,'var') == 


== 1; else iout = 0 
== 1; else L = 5.0 

== 1; else Vo = 0.3 

== 1; else rm = 0.25 

== 1; else Nb = 5 


; end 
; end 

; end 
; end 


%% Set up 

Nb = max(Nb,2); %Avoids Nx < 2 catastropies 
xj = (1:Nb); xj = L*(2*xj - Nb - 1)/(Nb-1); 
Vx = Vo*(1.0 - (xj/L).~2); 

Nv = length(Vx); 
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Plot the potential 
Nv = length (Vx); 
for j = 1 : Nv - 1; 

xp(2*j-1) = xj (j) ; 

Vp(2*j-1) = Vx(j); 

X P (2 * j ) = xj (j+1) ; 

Vp (2*j) = Vx(j) ; 

xp(2*Nv-l) = xj(Nv); 

Vp(2*Nv-l) = Vx(Nv); 

xp (2*Nv) = xj (Nv) - xj (1) ; 

Vp(2*Nv) = Vx(Nv); 

plot (xp,Vp,' -jo' ,'MarkerSize',8); 

hold on; 

plot (xj,Vx,'r„.','MarkerSize',30); 
hold off; 


if iout > 0; 

Hout = [xp',Vp']; 
filenam= ['Q_Vxp.txt']; 
dlmwrite(filenam,Hout,'delimiter','\t', 
'precision',6); 

Hout = [xj',Vx']; 
filenam= ['Q_Vx.txt']; 

dlmwrite(filenam,Hout,'delimiter','\t', 
'precision',6); 
clear Hout filenam 

end 


A3.12.4 Resonant tunneling diode/superlattice V(x) algorithm 

The following algorithm, implemented in MATLAB, creates Vj and Xj for use by the algorithm of Section A3.12.1 for 
the case of a resonant tunneling diode (RTD) potential profile. Setting N b = 2 results in a single rectangular barrier with 
the same height and width as the N b = 2 case of Section A3.12.3. Setting N b = 3 results in the prototypical RTD [183], 
The vectors x p and V p are for visualization purposes (they serve no function in the S„ matrix). 

%% Program VxRTD 

% Potential Barrier associated with Nb-barrier RTDs 


%% Input terms 
if exist('Lbar','var') 
if exist('Lwel','var') 
if exist( 'Vo' ,'var') 
if exist('rm','var') 
if exist('Nb','var') 
if exist(' iout ','var') 


== 1; else Lbar = 5.0 

== 1; else Lwel = 5.0 

== 1; else Vo = 0.3 

== 1; else rm = 0.25 

== 1; else Nb =1 

== 1; else iout = 0 


end 

end 

end 

end 

end 

end 


Set up first Barrier-Well-Barrier 
xj(1) = -Lwel; 

Vx(1) = 0.0; 
xj (2) = 0.0; 
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Vx(2) = Vo; 

xj(3) = xj(2) + Lbar; 

Vx(3) = 0.0; 

Nv = length (Vx); 

%% Add more 

xjo = xj - xj(1); %xjo is unit RTD structure (position) 
Vxo = Vx; %Vxo is the unit RTD structure (potential) 
for j = 2:Nb; 

Nv = length (Vx); 

xj = [xj,xj (Nv) + xjo]; 

Vx = [Vx,Vxo]; 

%% Plot the potential 
Nv = length (Vx); 
for j = 1:Nv-1; 

xp(2*j-1) = xj(j); 

Vp(2*j-1) = Vx(j); 
xp(2*j) = xj(j+1); 

Vp(2*j) = Vx(j); 

xp(2*Nv-l) = xj(Nv); 

Vp(2*Nv-l) = Vx(Nv); 

xp (2*Nv) = xj (Nv) - xj (1) ; 

Vp(2*Nv) = Vx(Nv); 

plot (xp,Vp,'-^o','MarkerSize',8); 

hold on; 

plot (xj,Vx,' r_.' , 'MarkerSize' ,30); 
hold off; 

if iout > 0; 

Hout = [xp',Vp']; 

filenam= ['RTD_Vxp.txt']; 

dlmwrite (filenam,Hout,'delimiter',... 

'\t','precision',6); 

Hout = [xj',Vx']; 

filenam= ['RTD_Vx.txt']; 

dlmwrite (filenam,Hout,'delimiter',... 

'\t','precision',6); 
clear Hout filenam 

end 


A3.12.5 Screened ("Yukawa") coulomb potentials 

The following algorithm, implemented in MATLAB, compares the screened Coulomb potentials for the copper-like and 
silicon-like parameters of Section 21.7. For Cu, k is given by the Thomas-Fermi formula of Eq. (21.85). For Si, k is given 
by the Debye formula of Eq. (21.88). The values of R are twice the covalent radius of the atom, and the lattice is assumed 
to be cubic. 
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Program Vxions 

% Thomas-Fermi and Debye shielded Coulomb Barriers 


%Fundamental Constants and needed terms: units 
% are in eV, nm, fs, and g = 1 

m=5.6856299; hbar=0.65821193; Qo = 0.35999112; 
kb = 1.0/11604.519; ao = 0.052917721; 


% Vxkap = V(x) / (4Q/R) and is dimensionless 
Vxkap = @(x,c) -exp (-c*abs(x))./abs(x); 


if exist( 'Nx','var') 
if exist( 'N','var') 
if exist( 'Vmax','var') 
if exist( 'Vmin','var') 
if exist('Vmin2var') 
if exist( 'xmax','var') 


1; else Nx = 

1; else N = 

1; else Vmax 
1; else Vmin 
1; else Vmin2 = 
1; else xmax 


64 

16 

1.0 

- 8.0 

Vmin 

1.2 


end 

end 

end 

end 

end 

end 


%% kappa factors 


t Thomas Fermi Screening Factor (copper) 

RCu = 2*0.138; ChemCu = 7.0; 
kf = sqrt (2*m*ChemCu)/hbar; 

kapCu = sqrt (4*kf/(pi*ao)); alfcu = kapCu*RCu; 


%Debye Screening factor (silicon) 

RSi = 2*0.111; Ks = 11.9; T = 300; 

Q = (Ks-1.0)*Qo/(Ks + 1.0); 
rho = 1.0el8 /(le7)*3; 

kapSi = sqrt (16*pi*Q*rho/(kb*T)); alfsi = kapSi*RSi; 


%% Single Ion 

x = -xmax + 2*xmax*((1:Nx)-1)/(Nx-1); 


VSio = Vxkap(x,alfsi); 
VCuo = Vxkap(x,alfcu); 


%% cubic array of ions (2N+1)~3 atoms 
VSi = zeros(l,Nx); VCu = zeros(l,Nx); 
for ix = 1:Nx; 

for j = -N:N; 

for k = -N:N; 

for 1 = -N:N; 

rs = sqrt ( (j - x(ix)).' 1 2 + k.'~2 + l.~2); 
VSi(ix) = VSi(ix) + Vxkap(rs,alfsi); 

VCu(ix) = VCu(ix) + Vxkap(rs,alfcu); 






VSi = VSi - max (VSi); 
VCu = VCu - max (VCu); 
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figure(1); plot(x,VSio,x,VCuo); 
axis([x(l) x(Nx) Vmin Vmax]) 
figure(2); plot(x,VSi,x,VCu); 
axis([x(l) x(Nx) Vmin Vmax]) 

Vout = [x',VSio',VSi',VCuo',VCu']; 
dlmwrite('VxionsOut.txt',Vout,'delimiter',... 
'\t','precision',6) 


A3.13 Semiconductors and doping density 

The following algorithm, implemented in MATLAB, finds the temperature dependence of the chemical potential p(T) 
and the carrier concentration (taken to be electrons) p e (T) under the assumption that the acceptor doping concentration 
N a = 0. The algorithm makes use of the approximation of Bq. (22.22). The algorithm is the basis of Figures 22.5, 22.6, 
and 22.8. It relies on a bisection search algorithm to find p(T), knowing that —E g /2 < p(T) < p 0 . 

%% PROGRAM Doping.m 

t Evaluation of the Fermi level for a 
t given donor doping level 
t Units are eV, nm, fs, g = 1 


% Fundamental constants: 

% [kb] = eV/K; [mo] = eV/c'-2; [hbar] = eV-fs 
kb = 1.0/11604.5192; mo = 5.685629853; 
hbar = 0.6582119282; 

Nc = sqrt (2*mo"3)/((pi"2) *(hbar~3) ) ; 

% FD formula 

fxe = @(x) 1.0./(1.0 + 2*exp(x)); 


if exist('Nd','var') 
if exist('Nt','var') 
if exist('Nb','var') 
if exist('Tmin','var') 
if exist('Tmax','var') 
if exist('Eg','var') 
if exist('Ed','var') 
if exist('COmn','var') 
if exist('COmx','var') 


1; else Nd 
1; else Nt 
1; else Nb 
1; else Tmin 
1; else Tmax 
1; else Eg 
1; else Ed 
1; else COmn 
1; else COmx 


l.e-6 

51 

24 

6.0 

1006.0 

1.17 

-0.045 

-Eg/2 

10.5 


end 

end 

end 

end 

end 

end 

end 

end 


T = Tmin + (Tmax - Tmin)*((1:Nt)-1)/(Nt-1); 
Mu = zeros(size(T)); rho = Mu; 

MRX = zeros(Nt,5); 


for j = 1:Nt; 

beta =1.0/(kb*T(j)); 

cNv = 0.5*sqrt(pi)*(Nc/beta~l.5); 

Cmax = COmx; 
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Cmin = COmn; 
for k = 1:Nb; 

Chem = 0.5*(Cmax+Cmin); 

be = beta* Chem; 

bh = -beta* (Chem + Eg); 

rhoe = cNv * exp ( beta*Chem); 

rhoh = cNv * exp(-beta* (Chem+Eg)); 

Ndp = Nd*fxe (beta* (Chem - Ed)); 

if((rhoh - rhoe) + Ndp < 0.0); 

Cmax = Chem; 

else 

Cmin = Chem; 

end 

end 

Mu(j) = 0.5*(Cmax+Cmin); 

rho(j) = cNv * exp(beta*Mu (j))/Nd; 

end 

MRX(:,1) = T; MRX(:,2) = Mu; MRX(:,4) = rho; 


for j = 1:Nt; 

beta = 1.0/ (kb*T (j ) ); 
cNv = (Nc/beta~l.5); 

Cmax = COmx; 

Cmin = COmn; 
for k = 1:Nb; 

Chem = 0.5*(Cmax+Cmin); 

be = beta*Chem; 

bh = -beta*(Chem + Eg); 

rhoe = cNv * FUNfp( beta*Chem); 

rhoh = cNv * FUNfp(-beta*(Chem+Eg)); 

Ndp = Nd*fxe(beta*(Chem - Ed)); 

if((rhoh - rhoe) + Ndp < 0.0); 

Cmax = Chem; 

else 

Cmin = Chem; 

end 

Mu(j) = 0.5*(Cmax+Cmin); 

rho(j) = cNv * exp(beta*Mu(j))/Nd; 

MRX(:,3) = Mu; MRX(:,5) = rho ; 


Edx = Ed*ones(l,Nt) ; Egx = -0.5*Eg*ones(1,Nt) ; 
figure(1);plot(T,Mu,T,Edx,T,Egx); 
xlabel('Temp„[K]','FontSize',18); 
ylabel('\mu„[eV]','FontSize',18); 

Txt = ['log(Nd), num2str(loglO(Nd))] ; 
title(Txt,'FontSize',18); 

Edx = ones(l,Nt); 
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figure(2);plot (T,rho,T,Edx); 
xlabel ('Temp„[K] ','FontSize' ,18) ; 
ylabel ('\rho„/N_d','FontSize',18); 

Txt = [' log (Nd), num2str ( loglO (Nd))]; 
title (Txt,'FontSize',18); 
axis ([0 Tmax 0 3]) 

filnam = ['DopeNd' int2str (-int8 (loglO (Nd))) '.txt'] 

dlmwrite (filnam,MRX,'delimiter', '\t' ,'precision',8) 


A3.14 Band bending: accumulation layer 

The following algorithm, implemented in MATLAB, finds <p(x) in an accumulation layer at the surface of a semiconductor 
exposed to a vacuum field F vac using silicon-like parameters and using the approximation of Eq. (24.32). Quantities such 
as Ff, evaluated in the examples of Section 24.3, are also obtained. Output is shown in the graphs of Figure 24.6. 

%% Evaluation of Phi (x) for an accumulation region 
% Silicon-like parameters for semiconductor surface 
% under a vacuum field 

% - 


%%Functions: Analytic F (phi) 

% - 

Fphi = @(x) sqrt(exp(x) - 1 - x); 


%%Parameters 

% - 

if exist(' Chemo','var') == 1; 
if exist(' Nx','var') == 1; 

if exist ('T','var') == 1; 

if exist(' Ks','var' ) == 1; 

if exist (' Phis ',' var ' ) == 1 ; 

if exist ('idebug','var')== 1; 


else Chemo 
else Nx 
else T 
else Ks 
else Phis 
else idebug = 


-0.20 ;end 


40 

300.0 

11.9 

0.020 

0 




; end 


%Fundamental Constants and needed terms 

Q = 0.3599911;m=5.6856297;hbar=0.6582119;kb=l.0/11604.5; 


Nc = sqrt (2*m~3) / ( (pi~2) * (hbar~3) ) ; 
beta = 1.0/(kb*T); 

Fb = sqrt((8*pi*Q/Ks)*(Nc/beta~2.5) ) ; 

Fo = Fb*sqrt(0.5*sqrt(pi)*exp(beta*Chemo)); 

Phi = Phis*(Nx + 1 - (l:Nx))/Nx; 

u = beta*Phi; 

Fid = Fo*Fphi (u) ; 

Fvac = Ks*Fo*Fphi(beta*Phis); 

rho = Nc*sqrt(pi)*exp(beta*Chemo)/beta"l.5; 

xj = zeros(size(Phi)); 


if Chemo < 0.0; 

Lamda = sqrt (2)/(Fo*beta); 
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else 

Lamda = sqrt (Ks/ (4*sqrt (Chemo)*pi*Q*Nc)); 

end 

for j = 2:Nx; 

xj(j) = xj(j-l) + 2*(Phi(j) - Phi(j-1))/... 

(Fld(j) + Fld(j-l)); 

end 

PhiD = Phis*exp(xj/Lamda); 
plot(xj,Phi,'o',xj,PhiD); 


A3.15 Simple ODE solvers 

A3.15.1 First order 

An algorithm is given for the simple numerical solution of a one-dimensional equation of the form 

d x u(x) = v(x) (A3.18) 

and applied to the trial function u{x) = xcos(6x) + 1. The number of points is N = 2n with a default value of n = 8. The 
algorithm is designed so as not to require memory for the creation of a coefficients matrix y or necessitate its inversion, 
nor require memory space to create the |i>) vector because the | u) is overwritten (see discussion in Section Al.4.1). 
The algorithm was used in Figure A1.4. 

%% Program GradientM 

% demo of a simple ODE solution for du/dx = v(x) 

% Create a vector of length N = 2n 
if exist('n',' var' ) == 1; else n = 8 ;end 

% Sample definition of u(x) and du/dx for a smoothly 
% varying but integrable function 
ux = @(x) x. *cos (6*x)+1.0; 
du = @(x) cos (6*x)-6*x. *sin (6*x); 

dx = 1.0/(2*n+l); 
x = (1:2*n)*dx; 

v = 2*dx*du(x); UO = ux(x); 

% initialize the u vector 
u = zeros (1,2*n); 

% upward solution for even j 
u (2) = v (1) + ux (0) ; 
for j =2:n; 

U (2*j) = u(2*j -2) + v(2*j-1) ; 

end 

% downward solution for odd j 

u(2*n-l) = u(2*n-2) + 0.75*v(2*n-l) - 0.25*v(2*n); 
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for j = 2 : n; 

u(2*n + 1 - 2*j) = u(2*n+3-2*j) - v (2*n+2-2*j ) ; 

end 

t plot output 

plot (x,u, 'r' ,x,uo,' k. ') 

% % Output matrix 
MRX = zeros (2*n,3) ; 

MRX(:,1) = x'; MRX(:,2) = U'; MRX(:,3) = ux(x)'; 
dlmwrite('odel.txt',MRX,'delimiter','\t','precision',8) 


A3.15.2 Second order 

An algorithm is given for the simple numerical solution of a one-dimensional equation of the form 

d*u(x) = v(x) (A3.19) 

such as can be used in the solution of Poisson's equation (Eq. (24.48)). The algorithm is designed so as not to require 
memory for the creation of a coefficients matrix Ml or necessitate its inversion, nor require memory space to create the | v) 
vector as the | u) is overwritten (see discussion in Section Al.4.2). The algorithm is again (as in Section A3.15.1) applied 
to the trial function u(x) = xcos(6x) + 1. As before, the number of points is N = 2n with a default value of n = 8. The 
algorithm was used in Figure A1.4. The two components are: 

• The preparation of the initial vectors is done by Program Poisson 

• The matrix solution for M • u = v is done by FUNode2 

tt Program Poisson 

t demonstration of a simple ODE solution for 
t (d/dx)~2 u(x) = v(x) 

% The boundary conditions are uo = u(0), uN = u(l) 

% where x is normalized to between 0 and 1 

% that is, x(j) = j/(N+l), so that x = 0 

% corresponds to j = 0, and x = 1 corresponds 

% to j = N+l 

% Create a vector of length N 

if exist('n','var') == 1; else N = 16 ;end 

% Sample definition of u(x) and du/dx for a 
% smoothly varying but integrable function 
ux = @(x) x.*cos(6*x)+1.0; 
du = @(x) -12*(3 *x.*cos(6*x) + sin(6*x)); 

% create normalized x vector 
dx = 1.0/(N+l); 
x = (1:N)*dx; 

% create v vector, then call ODE2 solver 
v = (dx~2)*du (x) ; 
u = FUNode2(v,ux(0),ux(1)); 

% plot output, then export output to tab-delimited file 
plot(x,u,'r',x,ux(x),'ko' ) 



Algorithms 


MRX = zeros (N,3); 

MRX(:,1) = x'; MRX(:,2) = u'; MRX(:,3) = ux(x)'; 

dlmwrite( 'ode2.txt',MRX,'delimiter','\t','precision',8) 

function ux = FUNode2(u,uo,uN); 

Solution to the equation (d/dx)~2 u(x) = v(x) 

% On input, u is the v vector 

% On completion, u is overwritten with the solution 

%% Initialization 

N = length (u); 

u(l) = u(l) - uo; % left boundary condition 
u(N) = u(N) - uN; %right boundary condition 

Solution (Matrix inversion) 

% increasing j 
for j =2:N; 

u(j) = j * (2*u (j ) + u(j-l))/(j+l) ; 

end 


% decreasing j 
for j = N-1:-1:1; 
u (j ) = u(j) + 


j*u(j+l) / (j+1) ; 


%% Final factor multiplication and output solution 
ux = -0.5*u; 

An example of the usage of FUNode2 for the solution of Poisson's equation for doping ii 
encountered in Section 24.4 for the generation of Figure 24.14. The code for that figure is: 

%% Program DopePhi 

t finding potential in diamond for constant doping of 
t nitrogen and boron. Boron is throughout; Nitrogen 
% extends only to xn. Width of diamond is L 


a thin diamond film v 


t Create a vector of length N and a subregion of nc 
if exist ('N','var') == 1; else N = 24 ;end 

if exist ('nc','var') == 1; else nc = 6 ; end 

t Set up doping rho_b for boron, and rho_n for nitrogen 
% and let rho_b = nb*rhoO, rho_n = nn*rhoO 
% where rhoO = 10~16 atoms/cm*3 . 
if exist(' nb' , 'var' ) == 1; else nb = 0.01 ;end 

if exist(' nn','var' ) == 1; else nn = 2.0 ;end 

if exist(' R' ,' var ') == 1; else R = 0.03 ;end 


% Create dimensionless s vector (call it x temporarily) 
dx = 1.0/(N+l); 
x = (1:N)*dx; 
sn = (nc + 0.5)*dx; 

% Doping Profiles (signs are explicitly included) 
boron = -nb*ones(1,N); 
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nitro = nn* [ones(l,nc) zeros (l,N-nc)] ; 
v = nitro + boron; 

tt Analytic solution: create each s in region as needed 
s = x(1:nc); 

ya = 0.5*(nn - nb)*s.~2 + (nb - nn*sn - R)*s; 
s = x(nc+1:N); 

yb = -0.5*nb*(s.~2) + (nb - R) *s - 0.5* nn *sn*2; 
s = x; y = [ya, yb]; 

%% Numerical Solution: create v vector, then call 0DE2 
% Observe the usage of the analytic solution to find 
% the right boundary uN for the purposes of comparing 
t to the analytic solution (this would normally just 
t be chosen to a desired value). 
v = (dx~2)*v; 

uo = 0.0; uN = -0.5* nb + (nb - R) - 0.5* nn *sn~2; 
u = FUNode2(v,uo,uN); 

t plot output, then export output to tab-delimited file 
% The numerical solution is u; the analytic is y 
plot (s,y,'r',s,u,'ko'); 

MRX = zeros (N,3); 

MRX(:,1) = x'; MRX(:,2) = u'; MRX(:,3) = y'; 

dlmwrite ('dope.txt',MRX,'delimiter','\t','precision',8) 

clear v ya yb MRX x y 


A3.16 Current through a metal-insulator-metal diode 

The current density through the barrier created by an insulator between two metal boundaries, as schematically shown in 
Figure 24.1, is found through the numerical implementation of Eq. (24.12) in the following algorithm. For convenience, 
a parameter p is introduced and defined by the relation p = Output is written to a file labeled by the bias value. 


%% Program DkMIM 

% Solution of simple rectangular barrier under bias 
% Note: standard theory used f~(1/3). Let p = f^ (1/3) 

%% Needed terms 


if exist ('re','var') 
if exist ('Phib','var') 
if exist ('Vb','var') 
if exist(' L','var') 
if exist(' N','var') 
m=re*5.6856297;hbar=0. 


== 1; else 
== 1; else 
== 1; else 
== 1; else 
== 1; else 
6582119; 


Phib 

Vb 

L 

N 


0.5 

1.0 

3.0 

5.0 

200 


end 

end 

end 

end 


%% Main part of Program 
Field = Vb/L; 

p= (2*m*Field/hbar~2)-(1./3); 
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ko = sqrt(2*m*Phib)/hbar; 
kb = sqrt(2*m*Vb)/hbar; 

if exist('kmax'var') == 1; else kmax = 5.0 ;end 

if exist('kmin'var') == 1; else kmin = 1.0 ;end 

if N > lj 

k = kmin + (kmax-kmin)*((1:N)-1)/(N-l); 

else 

k = kmin; 

end 

%% Begin matrix solution 

% Airy(k, z) : k = 0 - - Ai(z) , k = 1 - - Ai ' (z) 

% k = 2 -- Bi (z) , k = 3 -- Bi'(z) 

Dk = zeros(1,N); 

for ik = 1:N; 

kp = sqrt (k (ik) ~2 + kb~2) ; 

zb = (ko~2 - k(ik)~2 - L*p~3)/p~2; 

zo = (ko-2 - k(ik)-2)/(p-2); 

% right boundary 

Sgl = [1.0, 1.0; li*kp, -li*kp]; 

Trans = [-p*airy(1,zb), -airy(0,zb); ... 

p*airy(3,zb), airy(2,zb)]; 

Sgl = Trans*Sgl; 

% left boundary 

Trans = [airy(2,zo), airy(0,zo); ... 

-p*airy(3,zo), -p*airy(1,zo)]; 

Sgl = Trans*Sgl; 

Trans = [k(ik), -li; k(ik), li]; 

Sgl = (pi/(2*p*k(ik)))* Trans*Sgl; 

Dk(ik) = (kp/k(ik))/abs(Sgl(1,1))"2; 

end 

Dkfn = (4.0*k/ko).* ... 

exp (-(4.0/(3*p*3))* (max (ko , '2 - k. ~2,0.0) . ''1.5) ) ; 
Dkfn = min(l,Dkfn); 

%% Plots and Output 

figure(l); plot(k,Dkfn,k,Dk); 
xlabel('k„[1/nm]','FontSize',16); 
ylabel('D(k)','FontSize',16) 
legend('FN','Dk'); 
figure(2); semilogy(k,Dkfn,k,Dk); 
xlabel ('k„[1/nm] ','FontSize',16); 
ylabel('D(k)','FontSize',16) 
legend('FN','Dk'); 
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Hout = [k',Dk',Dkfn']; 

filenam = ['Dk',num2str(10*Vb),'.txt']; 

dlmwrite(filenam,Hout,'delimiter','\t','precision', 6) ; 


A3.17 Field emission from semiconductors 


The evaluation of field emission from semiconductors requires the determination of the band bending associated with a 
given applied vacuum field, and then its incorporation into a modified Fowler-Nordheim-like equation. The algorithm 
is written in two parts. In the first, the vacuum field at the surface is related to the extent of the band bending factor 
<j). Because F uac is the desired parameter for showing results on a Fowler-Nordheim plot, a method is then given to find 
the maximum 4> max and minimum 4> mm associated with the largest and smallest F max and F min . This can be done quickly 
using the methods of Section A3.7.2 applied to Eq. (24.27) (indeed, the example in that section is quite similar). The 4> 
associated with a particular F can be found by 


! = In |^+ 1 + | 


and analogously for F mjn , where 


K s \f^n^N c Qe^o 
KjW 2 


(A3.21) 


with 4> 0 =1, and; ranging from 1 to approximately 10 (for good convergence). Observe also that the work function 
® is replaced by / - cj), although the code treats ® as / so as to keep the correspondence to the usual implementation 
clear. 

The method works for finding both the maximum and minimum associated 4>. Letting the (j> be uniformly spaced gives 
a range of fields for the subsequent evaluations of J F . The algorithm to set up these evaluations is: 


tt Parameters 

%fundamental Constants in units of eV, fs, nm, q 
rkb = 1.0/11604.50529; 

rhbarc = 197.3269631; rhbar = 0.658211899; 
rmo = 5.685629657; afs = 1.0/137.0359997; 
%end fundamental constants 


gx = 


@(x) sqrt(exp (x) - 1.0 - x); 


if exist ('T' ,'va 
if exist('Phi',' 
if exist('Nf','v 
if exist(' re', 'v 
if exist(' Ks', 'v 
if exist('mu','v 
if exist('Fmax', 
if exist('Fmin', 
if exist(' Nloop' 


:') 

rar' ) == 
ir') == 
ir') == 
ir') == 
ir') == 
var' ) 
var' ) 
'var') 


1; else T 
1; else Phi : 
1; else Nf 
1; else re 
1; else Ks 
1; else mu 


300.0; end 
4.0 ; end 


100 

0.2 




else Fmax = 10.0 ; end 
else Fmin = 1.0 ;end 
; else Nloop = 10 ; end 


%% F(phi) plots 

Nc = sqrt (2*(re*rmo)"3)/(rhbar*(pi*rhbar)"2); 
beta = 1.0/(rkb*T); 

Q = 0,25*afs*rhbarc; 

Cof = sqrt (4*sqrt (pi"3) *Nc*Q*exp(beta*mu)/ ... 
(Ks*beta~2.5)); 
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% Use iteration to find the proper pmax and pmin 

Fo = Ks*Cof; 

pmax = 1.0;pmin = 1.0; 

for j = l:Nloop; 

pmax = log (beta*pmax + 1 + (Fmax/Fo)"2) /beta; 
pmin = log (beta*pmin + 1 + (Fmin/Fo)"2)/beta; 

end 

Fmax = Fo*gx(beta*pmax); 

Fmin = Fo*gx(beta*pmin); 

p = pmin + (pmax-pmin)*(0:Nf)/Nf; 

Fp = Fo*gx(beta*p); 
figure(2); plot(p,Fp); 

xlabel('p [eV]'); ylabel('F(p)„[eV/nm]'); 

%% Standard Fowler Nordheim Plots 
Jf = zeros(size (Fp)); Js = Jf; 
for j = 0:Nf; 

Jf (j+1) = FUNfns(Fp(j+l),4.5,T,1.0,1.elO); 

Js (j+1) = FUNfns(Fp(j+1),Phi-p(j+l),T,re,Ks); 


x = 1.0./ Fp; 

FNf = log(Jf./Fp.~2); 

FNs = log(Js./Fp.~2); 

figure ( 1) ; plot (x, FNf,x,FNs O'); 
xlabel('1/F'); ylabel('In(J/F-2)'); 

Vout = [p',Fp',x',FNf',FNs']; 
dlmwrite('FNsemicon.txt',Vout,'delimiter',... 
'\t','precision',6) 


clean up 

clear rkb rhbarc rhbar rmo afs Acm2 Nc Q Cof j 

The calls to the semiconductor field emission current calculation method are made using the algorithm 

The Fowler-Nordheim Current Density 
t modified by effective mass re and dielectric Ks 
% This function call is self-contained: all 
% constansts and functions 

t required are embedded. 

% Input units: eV fs nm q=l 
% Outut units: Amp/cm~2 
t Arguments: F, Phi, T, re, Ks 

t for re = 1, Ks > 1, gives metal FN equation 

function Jfns = FUNfns(F,Phi,T,re,Ks) 

%fundamental Constants in units of eV, fs, nm, q 
rkb = 1.0/11604.5192; rhbarc = 197.3269719; 
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rhbar = 0.6582119282; rmo = 5.685629853; 
afs = 1.0/137.03600; Acm2 = 1.602176565el0; 

%end fundamental constants 

% Semiconductor-modified FN-like terms 
eta = (Ks - 1.0)/(Ks + 1.0); 

Q = 0,25*afs*rhbarc; 
ys = sqrt(4*eta*Q*F)/Phi; 

% Rule out barrier being pulled down past Fermi level; 

% Do so in way that plotting does not show the bad data. 
if ys > 1.0; 

Jfns = NaN; 

end 

% Forbes functions: a little faster to code because of 
% changes to Q and Phi 

vys = 1.0 + (ys. A 2).*(log(ys) - 3.0)/3; 
tys = 1.0 + |ys.*2}.*(1.0 - log(ys))/9; 

% FN terms 

Afn = (re~l.5)/(16*(pi'2)*rhbar*Phi*tys.~2); 

Bfn = (4.0/(3*rhbar))*sqrt(2*re*rmo*Phi~3) *vys; 

x = (2*rkb*T/rhbar)*sqrt(2*re*Phi)*tys/F; 

% Temperature Effects 

sig = ((1.0 + x. A 2) ./ (1.0 - x. A 2)) - ... 

(2.0 - ( (pi»2}/6))*x.-2 + ... 

0.2 5* ( ( (7*pi*4) /90) - 8) *X. , '4; 

% The final equation 

Jfns = Afn*(F.*2) .*exp(-Bfn./F)*sig*Acm2; 

These algorithms were used for the preparation of Figures 24.7 (relation between F and r/>) and 24.8 for the represen¬ 
tation of J F in Fowler-Nordheim coordinates. 

A3.18 Roots of the quadratic image charge barrier 

A method of rapidly finding the roots of the equation V 0 -Fx- (Q/x) + yx 2 = 0 is given, based on the initial choices of 
Eq. (24.63) used in the recursion relation of Eq. (24.62) as per the iterative prescription of Section A3.7.2. 

tt Numerical Evaluation of the Roots of 
% V (x) = Vo - Fx - Q/x + gx~ 2 

Xn = @(x,V,F,Q,g) (g*x.~3 - V*x + 2*Q).*x... 

/(Q - F*x.~2 + 2*g*x."3) ; 


Q = 0.3599911214; 

if exist ('Vo'var') == 1; else Vo = 4.0 ; end 
if exist ('F'var') == 1; else F =2.0 ; end 
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if exist('g'var') == 1; else g =0.22 ;end 

if exist('Ks','var') == 1; else Ks = 8.0 ;end 

if exist(' Nx' ,'var') == 1; else Nx = 12 ;end 

Qs = Q/Ks; 

Ra = zeros(Nx,3); 

% Zeroth order approximations 

Ra(1,1) = 2*Qs/(Vo + sqrt(Vo~2 - 4*Qs*F)); 

Ra(1,3) = (F-3 + Qs‘g-2)/(g*(F~2 + g*Vo)); 

Ra (1,2) = Qs/(g*Ra(l,1)*Ra(1,3)); 

t Iterative Solution 
for j = 2:Nx; 

for ix = 1:3; 

Ra(j,ix) = Xn(Ra(j-1,ix),Vo,F,Qs,g); 

end 

% Output 

dlmwrite('VxRoots.txt',Ra,'delimiter',... 

'\t','precision',6) 
x = [x' ,y' ] ; 

dlmwrite('vx.txt',x,'delimiter','\t','precision',6) 


A3.19 Zeros of the airy function 

The following algorithm starts with Eq. (26.22) to produce an approximate value of w n (called y in the code) for 
which Ai(w n ) = 0, and improves it using Newtonian iteration via Eq. (26.23) to provide the values used in Table 26.1. 
The algorithm then implements Eq. (26.28) to make Figure 26.5. 

%% Create a table of the first 20 Airy Function zeros 
clear; clc 
%Control flags 
iout = 1; 

% Defining functions 

wn = 0(x) -(X.M2./3)) .*(1 + (5./(48*x.~2))); 

if exist('N','var') == 1; else N = 32,-end 

n = (1 :N) ; 

M = zeros(N,3); 

M(:,1) = n; 
x = 3*pi* (4*n-l)/8; 

% reasonable estimate of the zeros 
y = wn(x) ; 

M(:,2) = y; 
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t High accuracy: iterative improvement 
% by Newtonian Iteration 
y = y - (airy (0,y) . /airy (1,y) ) ; 
y = y - (airy (0,y) . /airy (1,y) ) ; 

M(: ,3) = y; 

if iout > 0; 

filenam= 'AiZeros.txt'; 

dlmwrite(filenam,M,'delimiter','\t','precision' , 10) ; 

end 

clear M n filenam x 

tt unsealed Graph 
Nx = 100; 

S = -2.0*((l:Nx)-1)/(Nx-1); 

Na = [2, 4, 8, 16] ; 

Srho = zeros(4,Nx); 

M = zeros(Nx,5); 

M(: ,1) = s; 

for j = 1:4; 

for k = 1:Na(j); 
for i = 1:Nx; 

Srho (j , i) = Srho (j , i) + (Na(j) A 2 - k~2)*... 
(airy (0,y(k) - s(i))./airy(l,y(k))).''2; 

Srho(j,:) = Srho(j,:)/Na(j)"2; 

M ( : ,1+j) = Srho(j,:) ; 

if iout > 0; 

filenam= 'UnscaledAiS.txt'; 

dlmwrite(filenam,M,'delimiter','\t','precision',10); 

figure(1); plot(s,Srho) 

%% scaled Graph 
Nx = 200; 

S = -2.0 * ( ( (1:Nx )-!)/( Nx-1) ) . ~ 2 ; 

Srho = zeros(4,Nx); 

M = zeros(Nx,5); 

M(: ,1) = s; 

for j = 1:4; 

for k = 1:Na(j); 
for i = 1:Nx; 

Srho (j , i) = Srho (j , i) + (Na(j)'~2 - k*2)*... 
(airy(0,y(k) - Na(j)*s(i>)./airy(l,y(k))).-2; 


end 
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Srho(j,:) = Srho(j,:)/Na(j)~2; 
M(: ,1 + j) = Srho(j,:); 


if iout > 0; 

filenam= 'ScaledAiS.txt'; 

dlmwrite(filenam,M,'delimiter','\t','precision',10); 

end 

figure(2); plot(s,Srho) 


A3.20 Atomic sphere radius r s 

The following algorithm first calculates the data points used to construct Figure 27.3, and then finds the minimum of X e as 
a function of r s for a range of r jf which is used to construct Figure (27.4). Observe that Ekxco is given by Eq. (27.22) and 
DEkxco is its derivative with respect to r s (represented by x in the algorithm). Similarly, El cor is the sum of e a + e ei , and 
DElcor is its derivative with respect to r s . Lastly, after using the MATLAB routine MIN to find the approximate location 
of the minimum, four iterations using finite different approximations (Section Al.3.2) are used to solve the quadratic 
equation 

y(x 0 + 8) = 0 = y(x 0 ) + 8 /( x 0 ) + i<5 2 j/'(x„) (A3.22) 

to find the better estimate x 0 +8 that is used to find corrections to x 0 . The toggle C { = 0 turns off the inclusion of e ee + e ej ; 
Cj = 1 includes it. 

%% Program AtomicR 

% Evaluation of minimum of Sigma-eps: a method of 
% determining rs if the ionic radius of a metal 
% (valency = 1) is given 

% Formulae 

% kinetic, exchange, and correlation function 
% Ci = 0 turns off the Ground state energy 
% =1 turns on the Ground state energy 

Ci = 0; 

% Kinetic + Exchange + Correlation & Deriviative 
Ekxco = @(x) (2.21./x.-2) - (0.9163,/x) -... 

0.874./(2.946 + x + 3.362*sqrt(x)) ; 

DEkxco = @(x) (-4.42./x.^3) + (0.9163./x.~2) +... 

0.874* (1.0 + 1.681./sqrt(x)) ./... 

((2.946 + x + 3.3 62 *sqrt(x)) .~2) ; 

% Ion core & Derivative 

Elcor =@(x,a) 3*(-3*x.~2 + 5*a. '2) . /(5*x.~3); 

DElcor =@ (x, a) 9* ( x.~2 - 5*a.-2) ./(5*x.''4) ; 

N = 50; S = (0:N-1)/(N-l); 
x = -0.01+exp(log(100)*s); 

Na = 5; 
a= (0:Na-1) ; 
y = zeros(N,Na); 
for i = 1:N; 

for j = 1:Na; 

y(i,j) = Ekxco(x(i)) + Ci*EIcor(x(i),a(j)); 
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Mx = [x' , y] ; 

dlmwrite('IonCore.txt',Mx,'delimiter',... 

'\t','precision',6) 
clear x y Mx i j Na N s 

tt Find rs (here called x) given ri (here called a) 

Del = 0.004; % Used to make finite difference... 

%... approximations 

Na =41; % Sets size of the ionic radii vector a 

a = 6* ( ( (1 :Na) -1) / (Na-1) ) ..*1.5; % Gives better spacing 

rs = zeros(l,Na); 

iout =0; % turn on to monitor error assessment 

for i = 1:Na; 

% find minimum using methods based in MIN function 
x = 0.1 + 8.0* (0:100)/100; 
y = Ekxco(x) + Ci*EIcor(x,a(i)); 

[~, io] = min(y); xo = x(io); 
for j = 1:4; 

fp = DEkxco(xo + Del) + Ci*DEIcor(xo + Del,a(i)); 
fm = DEkxco(xo - Del) + Ci*DEIcor(xo - Del,a(i)); 
fo = DEkxco(xo) + Ci*DEIcor(xo,a(i)); 

xo = xo-4*Del*fo/(fp - fm +... 

sqrt (abs ( (fp-fm) , '2 + 8*fo* (2*fo-fm-fp) ) ) ) ; 

end 

rs(i) = xo; 
if iout > 0; 

disp(['dEkxc/dr„=„' , . . . 

num2str(DEkxco(xo) + Ci*DEIcor(xo,a (i)))]) ; 

end 

clear Del fm fo fp i j io Na x y xo yo 

% Compare to data Kittel (Table 9 p 78 & Table 1 p 126) 
t of "Introduction to Solid State Physics, 7th Ed ." 

% aex is converted to ri by dividing by Bohr radius 
% [ Li, Na, K, Rb, . . . 

% Cs, Cu, Ag, Au] 

aex = [1.2850, 1.8330, 2.5132, 2.7967,... 

3.1557, 2.5510, 2.3810, 2.5888]; 

rex = [3.2500, 3.9300, 4.8600, 5.2000,... 

5.6300, 2.6700, 3.0200, 3.0100]; 

plot(a,rs,aex,rex,'o'); axis ( [0 4 1 6]) ; 

Mx = [a',rs']; 

dlmwrite('rvsa.txt',Mx,'delimiter','\t','precision',6); 
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A3.21 Sodium exchange-correlation potential 

The following algorithm evaluates the electron density p(x) for the infinite barrier using Eq. (26.5), forms a hyperbolic 
tangent approximation p a (x) to it using Eqs (26.52) and (26.18), then uses p a (x) to construct the exchange correlation 
potential using only e ex (Section 27.2) and e cor (Section 27.3). 

%% Program ExCor 

% Evaluation of the potential profile near a surface 
% based on simple exchange-correlation calculation 
% Hardwired to parameters for Sodium and neglect of 
% ion core term 

% Hyperbolic Density approximation 
Rhoax =@(x,a,b) a./(1.0 + exp(b*x)); 

%lambda from Friedel model 

lam = (64*sqrt (2) / (9*pi~4) ) * (16 + 12*pi - 3*pi~2) ,« 
mo = 5.6856299; hbar = 0.65821193; ao = 0.052918; 

% Sodium Parameters 

chem = (hbar*kf). ~2/(2*mo); % Chemical Potential of Na 

kf = sqrt(2*mo*chem)/hbar; % fermi wavenumber 
rs = (3.0/(4*pi*rho*ao~3))*(1.0/3); %Wigner-Seitz radius 
Phi =2.29; % Work function of Na 

xi = - (3*pi)/ ( 8*kf); % positive background 

F = 2.0; % ad hoc field 

Nx = 201; 

z = -10*pi + 24 *pi*(0:Nx-1)/(Nx-1) ; 
x = (z/(2*kf)) - xi; 

Vf = F*max(x,0.0); 

rhox = zeros(l,Nx); Vimg = rhox; Vxc = rhox; 
rhoa = Rhoax(kf*x,rho,lam); 
for i = 1:Nx; 

rhox(i) = rho*FUNXic(z(i)); 

Vimg(i) = Vimage(x(i),chem+Phi,F); 

Vxc(i) = FUNxcor(rho) - FUNxcor(rhoa(i)) - Vf(i); 

Mx = zeros(Nx,4); 

Mx (:,1) = rhox; Mx(:,2) = rhoa; 

Mx(:,3) = Vxc; Mx(:,4) = Vimg; 
figure(2);plot(x,Mx); 

Mx = [x',Mx]; 

dlmwrite('NaXCor.txt',Mx,'delimiter',... 

'\t','precision',6); 
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A3.22 Field-dependent work function 

The following algorithm evaluates the current density using Eq. (13.25) and compares the cases where the work function 
is field dependent + 2 Fx 0 ) and where it is not (<t> 0 ). The resulting curve is fitted to a quadratic equation. 

%% Program Animag 

t Comparison of Analytic Image Charge to standard 
t Fowler Nordheim equation 

% constants 
hbar = 0.658212; 
m = 5.68563; 

Q = 0.359991; 

Phio = 4.5; 
chem = 7.0; 

Nf = 41; 

% Field Emission: Jfo = straight FN, Jf = analytic image 
t Method: make a vector of F values. Evaluate xo using 
% 

F = 2.0 + 8*(0:Nf-1)/(Nf-1); 

xo = hbar./sqrt(2*m*(chem + Phio - sqrt(4*Q*F))); 

Phi = Phio + 2*F.*xo; 

Jfo = zeros(l,Nf); Jf = Jfo; 
for i = 1:Nf; 

Jfo(i) = FUNjfn(F(i),Phio); 

Jf(i) = FUNjfn(F(i),Phi(i)); 

y = Jf./Jfo; 

P = polyfit(F,y,2); ya = polyval(P,F); 
figure(2); plot (F, y, ' o', F, ya) ; 

Mx = [F',Phi',Jfo',y',ya']; 

dlmwrite('AnFN.txt',Mx,'delimiter','\t','precision',6); 


A3.23 Digitizing an image file 

The following algorithm takes a gray-scale PNG image file (specifically, the ones shown in Figure 29.9) and returns a 
matrix of numerical values that can be processed using MATLAB. In the present circumstance, to give Figure 29.10, the 
maximum values were cut off at z max to allow the regions of low intensity to be seen in greater contrast, and a floor cut 
off to bring the low intensity regions into greater contrast trough the usage of z min . A vertical white line can be imposed 
on the data by setting the elements along that line to z max . 

Because digitized data from images is noisy (which makes the line plots derived from them, such as Figure 29.11, 
rather spiky), a smoothing algorithm was employed by the name of Smoothz, the effects of which are to replace each 
element of the image matrix Z (J by a weighted average. If Z, is a vector of values, then it amounts to 


Z' = 


Z f+1 + R 0 Zi + Z,. 

R 0 + 2 


(A3.23) 


and the extension to a matrix Z tj - is straightforward. The parameter nloop controls the number of times the smoothing 
process is implemented. The effect is to mildly blur the image. 
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clc;clear; 

Ro = 2; nloop = 0; zmax = 60; zmin = 5; Nx = 512; 
io = 205; 

Raw first 

Zl=double (imread (' 18 .png' ) ) ,-max (max (Zl) ) 
figure(l); surf(Zl); axis tight square; shading flat; 
axis([1 Nx 1 Nx 0 100]); view(2); colormap gray; 
axis off; grid off; 

%% processed first 
Z=Z1; SmoothZ; Zl = Z; 

Zl = max(Zl,zmin) - zmin; Zl = min(Zl,zmax); 
za = Zl(:,io); 

Zl(:,io) = zmax; 

figure(2); surf(Zl); axis tight square; shading flat 
axis( [1 Nx 1 Nx 0 (zmax)]); view(2); colormap hot; 
axis off; grid off; 

%% Raw second 

Z2=double (imread (' 23 .png' ) ) ,-max (max (Z2) ) 
figure(3); surf(Z2); axis tight square; shading flat; 
axis( [1 Nx 1 Nx 0 100]); view(2); colormap gray; 
axis off; grid off; 

processed second 
Z=Z2; SmoothZ; Z2 = Z; 

Z2 = max(Z2,zmin) - zmin; Z2 = min(Z2,zmax); 
zb = Z2(:,io); 

Z2(:,io) = zmax; 

figure(4); surf(Z2); axis tight square; shading flat 
axis( [1 Nx 1 Nx 0 (zmax)]); view(2); colormap hot; 
axis off; grid off; 

%% line plot 

figure(5); xa = (l;Nx); xb = xa+16; 
plot(xa,za,'r',xb,zb,'b'); 
axis( [10 500 0 60]) ; 
legend('18','23'); 

Mx = zeros(Nx,4); 

Mx(:,1) = xa; Mx(:,2) = za; Mx(:,3) = xb; Mx(;,4) = zb; 
dlmwrite('CsW.txt',Mx,'delimiter','\t','precision',6) 

A3.24 Lattice gas algorithm 

A two-dimensional grid of sites is randomly populated with adatoms so that they preferentially cluster near the center. 
At each time step, the atoms are allowed an opportunity to jump to an adjacent site if one is available; the order the 
atoms are chosen is based on a shuffle algorithm, modeled after a version given by Knuth ("Algorithm P: (shuffling)". 
Ref. [411], p. 139), which shuffles the order of the indices of a vector. This algorithm is used to generate Figure 29.15. 
The shuffling algorithm is called shuffle (Nc). 

%% Program HoppingCs 

% A simple algorithm to allow point particles 
% to hop from site to site on a square lattice. 
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t The length scale is set by the radius 
% of the initial distribution. 


% Pore distribution function 

Pore = @ (x, a) 1.0./(1.0 + exp (a* (x.'2 - if)).; 


t Set up Initial State; 

% ro controls the box size; a controls sharpness of 
% the initial distribution; po controls probabiity 
% of a jump. 

% N is tne number of coordinates on a box side 
% Nt is the number of time recordings 
t Ns is the number of time steps between recordings 
% Nr is the number of r coordinates to plot 


if exist(' ro','var') 
if exist( ' a','var') 
if exist( 'po','var') 
if exist( 'N','var' ) 
if exist( 'Nt','var' ) 
if exist( 'Ns','var' ) 
if exist( 'Nr','var') 


1; else ro = 2 ;end 

1; else a =16 ;end 

1; else po =0.2 ;end 

1; else N =128 ;end 

1; else Nt = 6 ;end 

1; else Ns =250 ;end 
1; else Nr = 16 ;end 


x = (2* (1:N)-N-l) *ro/ (N-l) ;y = x; 
z = Pore(x,a); 

figure(l); plot(x,z); clear z 


% Initial distribution 
Mx = zeros (N); Mxo = Mx; 

R = rand(N);ic = zeros(1,N~2); jc = ic; 

Nc = 0; 
for i = 1:N; 
for j = 1:N; 

if R(i,j) < Pore (sqrt(x(i)~2 + y(j)~2),a); 
Nc = Nc+1; ic(Nc) = i; jc(Nc) = j; 

end; 

end; 

end; 

ic = ic(1:Nc); 
jc = jc(1:Nc); 
xc = (2*ic -N-l) *ro/ (N-l); 
yc = (2*jc -N-l) *ro/ (N-l) ; 


% rho is the radial distribution; 

% vecxy is the matrix of occupied sites 
rho = zeros(Nt,Nr); vecxy = zeros(2*Nt,Nc); 
vecxyfl,:) = xc; vecxy(2,:) = yc; 
for i = 1:Nc; Mxo(ic(i), jc(i)) = 1; end; 
rho(l,:) = FUNrhosum(x,y,Mxo,ro,Nr); 
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for 1 = 1:Ns; 

% set up hopping directions 
ihop = floor(4*rand (1,Nc)); 

% see if they hop, and if the hop-to 
% space is empty, move them 
% Hop directions: 

% 0 = +x; 1 = +y; 2 = -X; 3 = -y 
% shuffle the vectors: 
ix = shuffle(Nc); 
ic = ic(ix); jc = jc(ix); 
for i = 1:Nc; Mx(ic(i), jc(i)) = 1; end; 
for i = 1:Nc; 
if rand() < po; 
switch ihop(i); 
case 0 

n = ic (i)+1; if n > N;n = 1 ;end; 
if Mx(n,jc(i)) < 1; 

Mx (n, j c (i) ) = 1; Mx (ic (i) , j c (i) ) = 0; 
ic(i) = n; 

end; 

case 1 

n = jc(i)+l; if n > N;n = 1 ; end ; 
if Mx(ic(i),n) < 1; 

Mx(ic(i),n) = 1; Mx(ic(i),jc(i)) = 0; 
j c (i) = n; 

end; 

case 2 

n = ic (i)-1; if n<l;n = N ;end; 
if Mx(n,jc(i)) < 1; 

Mx(n,jc(i)) = 1; Mx(ic(i),jc(i)) = 0; 
ic (i) = n; 

end; 

otherwise 

n = jc(i)-l; if n<l;n = N ;end; 
if Mx(ic(i),n) < 1; 

Mx(ic(i),n) = 1; Mx(ic(i),jc(i)) = 0; 
jc (i) = n; 

end; 

rho(k,:) = FUNrhosum(x,y,Mx,ro,Nr); 
vecxy(2*k-l,:) = (2*ic -N-l)*ro/(N-l); 
vecxy(2*k,:) = (2*jc -N-l)*ro/(N-l); 

end; 

clear i j R 

% Two ways to plot Mx that are equivalent: 

xco = (2*ico-N-l)*ro/(N-l); yco=(2*jco-N-1)*ro/(N-l); 
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xc = (2*ic -N-l)*ro/(N-l); yc =(2*jc -N-l)*ro/(N-l); 
figure(2); plot(xco,yco,'bo',xc,yc,'r.'); 
axis square tight; axis([-ro ro -ro ro] ) ; 
figure(3); plot(rho'); axis square tight; 

dlmwrite('Rhot.txt',rho','delimiter', . . . 

'\t','precision',6) 

dlmwrite('vecxy.txt',vecxy','delimiter', . . . 

'\t','precision',6) 

function ix = shuffle(Nc) 

t returns array of indicies that will shuffle vector 
vx = (1:Nc); 
for i = Nc:-1:1; 

io = floor(i*rand())+1; 
vo = vx(i) ; 

vx(i) = vx(io); 
vx(io) = vo; 

end 

function rx = FUNrhosum(x,y,Mx,ro,Nr) 

N = length(x); 
rx = zeros(l,Nr); 
for i = 1:N; 

for j = 1:N; 

n = floor((Nr+1)*sqrt(x(i)^2 + y(j)*2)/ro)+1; 
if Nr+1 > n;rx(n) = rx(n) + Mx(i,j) ;end; 

end; 

end; 

nx = (1:Nr)-0.5; rx = rx./nx; 


A3.25 Evaluation of the point charge model functions 

The following algorithm evaluates S n (r), constructs P n (r ) from the iteration algorithm of Eq. (30.32), and then evaluates 
p n (r) using Eq. (30.35). Lastly, P n (f) is approximated by A P n ~ 1 P 0 using Eqs (30.38) and (30.48) (Schottky's conjecture). 
The output was used in Figure 30.9. 

%% Program PnrCalc 

% calculation of Pn(r) using a recursion relation 

if exist('r','var') == 1; else r = 0.75;end 
if exist('Np','var') == 1; else Np = 25;end 

betao = @(x) 1.0 + ((2*(1 + x)~2)./(x.*(2+x))); 

S = zeros(l,Np); P = S; 

S = [S,(1.0 - r.MNp+1) ) / (1.0 - r) ] ; %S longer than P 

%% Create Sn(r): 

Silt = 1.0; 

if r < 0.99; 

for j = 2:Np; S(j) = (1.0 - r~j)/(l-r); end; 
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for j = 2:Np; S(j) = 1.0 + r*S(j-l); end; 

end 

%% Create Pn(r) : 

P(l) = 0.5* (1.0 + r)*(2.0 + r); 
for n = 2:Np; 

P(n) = 0.5*S(n+1); 
for j = 1:n-1; 

P(n) = P(n) - ... 

(P(j)*S(j) / (S(n+l-j)*(S(j) + S (n+1) ) ) ) ; 

end 

P(n) = P(n)*(1.0 + (S(n+1)/S(n))); 

%% Create Betan(r): 

beta = ones(l,Np); 
for n = 1:Np; 

for j = 1: n; 

beta(n) = beta(n) + (4*P (j)*S(j)*S(n+1) ./... 

( (r~j ) * (S (n+l-j ) * (S (n+1) + S(j))) A 2)) ; 

end; 

end; 

%% Approximate beta 
x = (1:Np); 

% ao = - (d/dx) ln(Px(r)); 

ao = polyfit(x,log(P),1); ao = -ao(l); 

bo = betao(r); 

delb = 1.0/(r*exp(ao)); 

betaA = zeros(l,Np); 

betaA(l) = bo; 

for j = 2:Np; 

betaA(j) = delb*betaA(j-1); 

end; 

semilogy(x,beta,'o',x,betaA); 
xlabel('n','Fontsize',24); 
ylabel('\beta_n(r)','FontSize',24) 
legend('exact','approx') 

S = S(1:Np); %make S, P, and beta the same size again; 

%% Send< to Output file 
M = zeros(Np,5); 

M ( : ,1) = x' ;M(:,2) = S';M(:,3) = P'; 

M ( : ,4) = beta';M(:,5) = betaA'; 

dlmwrite('Pnr.txt',M,'delimiter','\t','precision',6) 
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A3.26 Modeling of Field emitter l(V) data 

The algorithm takes the experimental I(V) data of Figure 30.10 (based on Figure 2 of ref. [458] and reproduced in 
Table A3.2 using the field enhancement and area factor models of Eqs (30.18) and (30.19) based on the hemisphere 
model) and analyzes it using field enhancement and notional emission area concepts. Refinements of the hemisphere 
model using a prolate spheroidal model will change g(F), but will not dramatically change the theoretical I{V) curve. 
The algorithm used a single data point labeled by i 0 = 3 to make an estimate of the tip radius a s using one of the lower 
current data points, after which Eq. (30.124) provides the apex field value andEq. (30.18) provides the estimate of the tip 
current. The default parameters of a 3 = 375 nm, 9 C = 15°, and O - 4.3 eV are assumed based on fabricated geometry and 
accepted values. The method of bisection (Section A3.7.1) is used to find a s . The input data of Table 30.10 is presumed 
to exist as comma-delimited numerical data in a file called Figdat. txt. 

%% Program SchwoebelFig 
% Take digitally extracted data from 
% P. Schwoebel, et al., JVSTB, Fig 2 and analyze it 
% using the field enhancement + notional area model 

% Load Data 

load Figdat.txt; 

Siv = Figdat; clear Figdat; 

%% Assign data: 


VI = Siv(:,1); 

11 = Siv 

(:,2) ; 

V2 = Siv(:,3) ; 

12 = Siv 

(:,4) ; 

V3 = Siv (:,5) ; 

13 = Siv 

(:,6) ; 

V4 = Siv(:,7) ; 

14 = Siv 

( : , 8) ; 


tt Analyze Data 

betag = @(x,g,b) ((pi/log(((86 + (g/x))/... 

(54*tan (b) ) ) * (g/x)) ) - (tan (b) ~2) ) /x; 

% Input values: Mo work function of 4.3 eV 
% recommended by Samsonov 


Table A3.2 l(V) digitally extracted data from Figure 2 of ref. [458] and shown in Figure 30.10 for the 
initial and final voltage (V) and current (/). 


VI 

11 

V2 

12 

V3 

FI 

V4 

F2 

56.30 

1.014x1 O' 8 

95.79 

9.864 xIO 9 

107.1 

1.014x10 s 

109.6 

1.052 x10 s 

58.78 

2.026 x10 s 

101.6 

2.008 x10 s 

112.1 

2.063 x10 s 

114.3 

2.101 x 10 s 

61.55 

5.037 x10 s 

107.4 

4.947 x10 s 

117.1 

5.129x10 s 

119.5 

5.320 x10 s 

64.58 

9.881 x 10 s 

111.1 

9.797 x10 s 

122.6 

1.006X10 7 

124.0 

1.025 x 10 7 

67.35 

2.010 xIO 7 

116.1 

1.940X10 7 

128.1 

2.01 OxlO 7 

129.8 

2.010 xIO 7 

72.04 

4.907 xIO 7 

124.0 

4.914 xIO 7 

135.8 

5.182 x 10 7 

137.8 

5.089 xIO 7 

76.18 

9.803 xIO 7 

131.8 

9.468 xIO 7 

142.7 

9.626 xIO 7 

144.4 

9.983 xIO 7 

80.60 

1.958 x10 s 

139.7 

1.927 x10 s 

150.7 

1.994 x10 s 

152.7 

2.031 x 10 s 

88.33 

4.868 x10 s 

155.9 

4.880 x10 s 

161.5 

4.868 x10 s 

163.7 

4.957 x10 s 

94.96 

9.725 x 10 s 

164.6 

9.664 x10 s 

170.4 

1.009 x10 s 

173.7 

9.904 X 10 s 

101.0 

1.908 x10 s 

175.8 

1.862 x10 s 

180.9 

2.015 x10 s 

182.8 

1.943 x10 s 
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if exist('N', 
if exist('io' 
if exist('ag' 
if exist('bc' 
if exist('Phi 


'var') == 1; 
'var') == 1; 
'var') == 1; 
,'var') == 1 


else N 
else io 
else ag 
else be 
else Phi 


= 76; end 
= 3; end 
= 375.0; end 
= 15*pi/l80; end 
= 4.3; end 


t Fowler Nordheim parameters (numerical values 
% for constants used) 

Bo = 6.8308898*sqrt(Phi~3); 
nu = 1.6393731/sqrt(Phi); 

Ao = (1.368477e-6 / Phi)*(280.16579*Phi*2)-mi; 


l = zeros(1,4); 


for k = 1:4; 

amax = 50.0; 
amin = 0.5; 

Io = Siv(io,2*k); 

Vo = Siv(io,2*k-l); 
for j = 1:24; 

ao = (amax + amin)/2; 

F = betag(ao,ag,be)*Vo; 

Itest = Ao*(F*(2-nu))*... 

exp (-Bo/F) *2*pi* (ao-2) *... 

F/(Bo + F*(4 - nu)); 

if Itest > Io; amin = ao; else amax = ao,-end 

end 

an(k) = (amax + amin)/2; 


Th = zeros(N,4); 

Vx = 50 + 150*(0:N-1)/(N-l); 
for j = 1:N; 

F = betag(an(l),ag,be)*Vx(j); 

Th(j,1) = Ao*(F"(2-nu))*... 

exp(-Bo/F)*2 *pi* (an (1)"2) *... 
F/(Bo + F*(4 - nu)); 

F = betag(an(2),ag,be)*Vx(j); 

Th(j,2) = Ao*(F"(2-nu))*... 

exp(-Bo/F)*2 *pi*(an(2)"2) *... 

F/(Bo + F*(4 - nu)); 

F = betag(an(3),ag,be)*Vx(j); 

Th(j,3) = Ao*(F-(2-nu))*... 

exp(-Bo/F)*2 *pi*(an(3)*2) *... 

F/(Bo + F*(4 - nu)); 

F = betag(an(4),ag,be)*Vx(j); 

Th(j,4) = Ao*(F"(2-nu))*... 

exp(-Bo/F)*2 *pi* (an (4) '2) *... 

F/(Bo + F*(4 - nu)); 

end; 
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semilogy(VI,II,'o',V2,I2, 's' ,V3,I3, 'd' ,V4,I4,' v' ,Vx,Th); 
xlabel('Voltage^[V] ' , ' FontSize',24); 
ylabel('Current^[A] ' , 'FontSize',24) ; 
axis([50 190 5e-9 5e-5]); 


A3.27 Modeling a log-normal distribution of field emitters 

In modeling an array of field emitters for which the apex radius {«,} is log-normal distributed, a conventional approach 
might be to choose the radius with a probability based on Eq. (29.1), but here a much faster algorithm is presented. 
First, observe that the number n t of emitters with a radius a t is given by 

n, = N j ' L{x)dx = ^(Erf(A,) - ErffA^ )) (A3.24) 

where Erf(x) is the error function of Eq. (A2.I0), N is the total number of emitters, and 

A(a)= ^=- ln (^) (A3.25) 

The emitters are then placed sequentially on the array and their positions are "shuffled" by the Shuffle (Nc) algorithm 
of Section A3.24. As a result, the array has randomly placed emitters that are log-normal distributed. 


A3.27.1 Driver 

The Driver routine loads the parameters, finds the maximum and minimum cij values, whereby tij = 0 for larger or 
smaller aj, and plots the results. Finding the maximum and minimum cij values is accomplished by zoomas, a relatively 
cheap method of redistributing the Oj values over a region where the rij are non-zero. 

%% Program Driver: 

t This program populates values needed and runs StatTip 


if exist('Fo','var') 
if exist('sig','var') 
if exist('ao','var') 
if exist('amin','var') 
if exist('amax','var') 
if exist('nj','var') 
if exist('N','var') 


1; else Fo 
1; else sig 
1; else ao 
1; else amin 
1; else amax 

1; else N 


5.0; end 
0.08;end 
16; end 
0; end 
5 0; end 
32; end 
128; end 


% Cheap method to find the max/min values of as 
for j =1:5 
StatTip; 
zoomas; 

end 


% Mo plots the sorted Jfo; M plots random placement Jo 
M = zeros(N);Mo = zeros(N); 

Jmax = max(Jf); 
for j = 1:N; 

Mo(:,j) = Jfo((j-1)*N+1:j*N)/Jmax; 

M ( : , j) = Jf((j-l)*N+l:j*N)/Jmax; 


end 
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% Set at least one tip to zero current so surface plot 
% better shows the distribution of current 
Mo(N-1,N-1) = 0.0; 

M(N-1,N-1) = 0.0; 

%% Plot results 

figure(1); 

plot(as,ntips/max(ntips),as,jtips/max(jtips)); 
figure(2); surf(Mo); axis square tight; 

shading flat; view(2); colormap hot 
figure(3); surf(M); axis square tight; 
shading flat; view(2); colormap hot 

%% Send to Output file 
Mout = zeros(nj,3); 

Mout(:,1) = as';Mout(:,2) = ntips';Mout(:,3) = jtips'; 
filn = ['sig',num2str(1000*sig),'.txt']; 
dlmwrite(filn,Mout,'delimiter','\t','precision',6); 
clear Mout filn aso kmin Ao Bo j nu Phi F 


A3.27.2 StatTip 

Each value in the vector Jf is the total tip current for an ellipsoidal emitter with apex field F(aj) = F tip a 0 /aj, where a 0 ** /t 
of the log-normal distribution and F tip is the field on the median tip. The vector Jf is shuffled and mapped over to the 
array matrix [M]y = When y is plotted as a surface map, each pixel mimics the output of the tip. Figure 30.29 was 

generated in this way for various values of a. 

%% Program StatTip 

% Create 1(F) data for a statistical distribution 


if exist(' nj ', 'var' ) == 1; 
if exist( 'N','var' ) == 1; 
if exist(' ao' ,' var') == 1; 

if exist(' amin','var' ) == 1; 
if exist(' amax','var ' ) == 1; 
if exist(' Phi','var' ) == 1; 
if exist(' Fo','var' ) == 1; 
if exist(' sig','var') == 1; 
if exist(' ish','var') == 1; 


else 

else 

else 

else 

else 

else 

else 

else 

else 


nj = 24; end 
N = 64; end 

ao = 8; end 
amin = 2; end 
amax = 24; end 
Phi = 4.5; end 
Fo =3.0; end 

sig = 0.4; end 

ish=shuf fle (N , '2) ,-end 


t Fowler Nordheim parameters (numerical) 

Bo = 6.8308898*sqrt(Phi~3); 
nu = 1.6393731/sqrt(Phi); 

Ao = (1.368477e-6 / Phi)*(280.16579*Phi~2) *nu; 


t Tip radii: aso prevents maximum field from 
% exceeding 12 eV/nm by "blunting" the tip to 
t aso it does 

aso = max(Fo*ao/12.0,amin); 

as = aso + (amax-aso) * (0 :nj-1) / (nj-1) ; 

F = Fo*ao./as; 
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Jf = zeros(1,N~2); jtips = zeros(size(as)); 
ntips = floor(N~2*GenLN(as,ao,sig)); 

t eliminate the larger as that do not have tips 
t absorb the left-over tips into the largest tip cell 
ntips(nj) = N~2 - sum (ntips(1:nj-1)); 
for j = 1:nj; 

if ntips(nj-j) < 1; 

ntips(nj-j) = ntips(nj-j+1); 
ntips(nj-j+1) = 0; 

else 

break; 

end 

% Calculate the tip current for each tip 
kmin = 0; 
for j = 1:nj; 

if ntips(j) >0; 

Jtemp = Ao*(F(j)~(2-nu))* ... 

exp(-Bo/F(j))*2*pi*(as(j)~2) * ... 

F(j)/(Bo + F(j)*(1 - nu)); 

Jf(kmin+1:kmin+ntips(j)) = Jtemp; 
jtips(j) = ntips(j)*Jtemp; 
kmin = kmin + ntips(j); 

end; 

end 

% Jfo is sorted; Jf is randomized placement 

Jfo = Jf*160.2176487; %convert to microamps 

Jf = Jfo(ish)*160.2176487; %convert to microamps 

A3.27.3 zoomas 

This routine resets a max and a min based on the number of emitters, so that the resulting distribution of aj contains n j bins 
for which the number of emitters in the /tb bin is n y 

for j = 1:nj; 

if ntips(j) < 1; 
amin = as(j); 

else 

end 

end 

for j = 1:nj; 

if ntips(nj-j+1) < 1; 
amax = as(nj-j+l); 

else 

break 

end 


end 
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A3.27.4 GenLN 

This routine performs the calculation of the number of emitters for aj < a< a j+l for emitters that are log-normal 
distributed. 

function Lx = GenLN(as,mu,sig) 

% Function GenLN 
% Generate the ntips parameter 
% according to a log-normal distribution 


exps = @(x) exp(~x. A 2)/sqrt(pi); 
nj = length(as); 


if exist( 
if exist( 
if exist( 


sig','var') == 1; 
mu','var') == 1; 
xmax','var') == 1 


else sig 
else mu 
else xmax 


0.2; end 
6; end 

3 *mu ; end 


xj = log(as/mu)/(sig*sqrt(2)); 
yj = zeros(l,nj); 

yj (1) = integral(exps,-Inf,xj(1)); 

for i = 2:nj-1; 

yj (i) = integral(exps,xj(i-1),xj (i)); 

end 

yj(nj) = integral(exps,xj(nj),Inf); 


Lx 


yj ; 


end 


A3.28 Simple shell and sphere algorithm 

Two populations of electrons are created at a depth of 8 = 12.8 nm into copper: the first population (blue = "shell") is 
electrons that travel without scattering, and the second population (red = "sphere") is electrons that travel for a time 
t = 1 fs then scatter to a random direction with a new energy Ej = n + Tico/ 2L The initial velocity of all electrons was set 
to v g = \j(2/m)(n + hoj) for X = 532 nm. All shell electrons are at the same distance r = (x 2 +y 2 + z 2 ) l/2 from the origin, 
although the plots only show the (x, z) coordinates, as in Figure 31.24. The terms r a and r 0 are the rms radii, and are used 
to represent the red and blue lines. Lastly, in choosing the random direction, it must be ensured that the choice of any 
direction is equally probable. Although <£ = 2nr with r a random number chooses the azimuthal angle, the polar angle 
requires greater care [ 108] as the integration over a surface element da on a sphere of radius a involves da = a 2 sin 0d0d(/>. 
Therefore, it is cos 0 that must be chosen randomly, and so for two random numbers r, and r 2 , 

= 2nr x (A3.26) 

cos# = 1 - 2r 2 (A3.27) 


% - 

% Script ShellSphere 

% - 

% Expanding Shell (unscattered) is BLUE 
% Sphere (scattered) is RED 
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% Scattered electron algorithm: e- travel for 
% a time tau and then scatter: when scattered, 
t the electron's energy is reduced as though it 

t had scattered with an electron from the Fermi 

t level (that is, its energy goes to the average 
t energy of the two electrons) 


m = 5.685629853; hbar = 0.6582119282; 
c = 299.792458; chem = 7.0; 


if exist('N'var') 
if exist('Ntvar') 
if exist('tau'var') 


1; else N = 800; end 
if else Nt = 8; end 
1; else tau = 1.0; end 


if exist('zinit','var') == 1; else zinit = 12.8; end 

if exist('zmin'var') == 1; else zmin = 0.0; end 

if exist('lambda'var') == 1; else lambda = 532.0; end 

% Assume all e- initially excited to chem + hbar*omega 

hf = 2*pi*hbar*c/lambda; 

vmax = sqrt((2/m)*(chem + hf) ) ; 

% circle coordinates 
cirx = cos(2*pi*(0:90)/90); 
ciry = sin(2*pi*(0:90)/90); 

phi = 2*pi*rand(l,N); 

costheta = 1.0 - 2*rand(l,N); 

sintheta = sqrt(1.0 -costheta."2); 

epx = sintheta.*cos(phi); 

epy = sintheta.*sin(phi); 

epz = costheta; 

vx = vmax*epx; vy =vmax*epy; vz = vmax*epz; 
vax=vx;vay=vy;vaz=vz; 

xp = zeros(l,N); yp = xp; zp = xp + zinit; 
xa = xp; 
ya = yp; 

% set window size 

if exist('zmax','var') == 1; else zmax = 5*zinit; end 
xmax = zmax/2; xmin = - xmax; 

for j = 1:Nt; 

vo = sqrt ( (2.0/m) * (chem + (hf/2''j))); 
for i = 1:N 

xp(i) = xp(i) + vx(i)*tau; 
yp(i) = yp(i) + vy(i)*tau; 
zp (i) = zp (i) + vz (i) *tau; 
xa(i) = xa(i) + vax(i)*tau; 
ya(i) = ya(i) + vay(i)*tau; 
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za(i) = za(i) + vaz(i)*tau; 

phi = 2*pi*rand(); 

costheta = 1.0 - 2*rand(); 

sintheta = sqrt(1.0 - costheta."2); 

epx = sintheta.*cos(phi); 

epy = sintheta.*sin(phi); 

epz = costheta; 

vax(i) = vo*epx;vay(i)=vo*epy;vaz(i)=vo*epz; 


scatter(xp,zp,8,'filled','b');hold on; 

xcen = sum(xa)/N; 

ycen = sum(ya)/N; 

zcen = sum(za)/N; 

rp = sqrt(sum((xp-xcen) . *2 . . . 

+(yp-ycen).~2+(zp-zcen).*2)/N); 
zcen = sum(zp)/N; 

plot(rp*cirx,rp*ciry+zcen,'LineWidth',4,... 

'Color', [0.6 0.6 1]) ; 
scatter(xa,za,8,'filled', 'r') ; 
xcen = sum(xa)/N; 
ycen = sum(ya)/N; 
zcen = sum(za)/N; 
ra = sqrt(sum((xa-xcen)."2 +... 

(ya-ycen).~2 + (za-zcen).*2)/N); 
plot(ra*cirx,ra*ciry+zcen,'LineWidth',4,... 

'Color',[1 0.6 0.6]); 
hold off; 

axis([xmin xmax zmin zmax]); 
axis square; grid off; 

title ( ['tiae„=„' , num2str(j*tau),'_fs' ]',... 

'FontSize',24); 
axis off; 

end 


A3.29 Gyftopoulos-Levine work function algorithm 

An algorithm for the evaluation of the work function ®(0) as a function of coverage 0, where 6 = 1 is a monolayer, as 
specified by Eq. (31.146), is given. It relies on the function call FUNglx that requires seven input parameters to evaluate 
0(0) on output: [r w , r c ,f, n, R wf , O u; , OJ. Observe that / is called f ac and R wf called rmf in the body of the algorithm. The 
default parameters are nominally modeled after cesium on tungsten. Code is inserted for plotting the results and an io 
controls which parameter of /, n, or R wf is changed. 

% - 

%% Program Gyftopoulos-Levine 

% - 

% Calculation of work function Phi as a 
% function of surface coverage theta 

% Parameters 

if exist ('N','var') == 1; else N =50; end 
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if exist('rw','var') 
if exist('rc','var') 
if exist('fac','var') 
if exist('n','var') 
if exist('rmf','var') 
if exist('Phic','var') 
if exist('Phiw','var') 
if exist('io','var') 


1; else rw 
1; else rc 
1; else fac 
1; else n 
1; else rmf 
1; else Phic 
1; else Phiw 
1; else io 


0.130;end 
0.253;end 
1.0; end 
1.0; end 
4.0; end 
1.6; end 
4.55; end 
1; end 


pxo = [rw, rc, fac, n, rmf, Phiw, Phic]; 


theta = ((1:N)-1)/(N-l); 


switch io 
case 1 

% f variation: Examine Phi for various fac 
Phix = zeros(N,4); 
px = pxo; 
for j = 1:4; 

px(3) = 0.5 + j * 0.2 5 ; 

Phix(:,j) = FUNglx(px,theta); 


figure(1); 

plot(theta,Phix) 

xlabel('\theta','FontSize',18); 

ylabel('\Phi(\theta)','FontSize',18); 

legend('f_=„0.75','f w =J1.00', .. . 

'f_= w l.25','f_= w l.50'); 
title('Variation„with„f' , ' FontSize',24); 
axis( [0 1 1.5 4.6]);axis square 

Mx = [theta',Phix]; 

dlmwrite('fac.txt',Mx,'delimiter',... 

'\t','precision',5); 


% n variation Examine Phi for two values of n 

Phix = zeros(N,2); 
px = pxo; 
filn = 'nfac'; 
for j = 1:2; 

px(4) = 1 + (j —1)*0.65; 

Phix(:,j) = FUNglx(px,theta); 

end 

figure (2) 

plot(theta,Phix) 

xlabel('\theta','FontSize',18); 


ylabel('\Phi(\theta)','FontSize',18); 



legend('n„=„l.00','n^=„1.65'); 

title('Variation„with„n' ,'FontSize', 24); 

axis ( [0 1 1.5 4.6]);axis square 

Mx = [theta',Phix]; 

dlmwrite('nfac.txt',Mx,'delimiter', . . . 

'\t','precision',5); 


case 3 

% rmf variation Examine Phi for 2 values of rmf 


Phix = zeros(N,2); 
px = pxo; 
for j = 1:2; 

px(5) = 2*j; 

Phix(:,j) = FUNglx(px,theta); 


figure(3) 

plot(theta,Phix) 

xlabel('\theta','FontSize',18); 

ylabel('\Phi(\theta)','FontSize',18); 

legend (' rmf ^=J2 ' , ' rmf _=_4 ' ) ; 

title('Variation„with„rmf' ,'FontSize',24) ; 

axis( [0 1 1.5 4.6]);axis square 




Mx = [theta',Phix]; 

dlmwrite('rmf.txt',Mx,'delimiter',... 

'\t','precision',5); 


t- 

function G = FUNglx(px,theta) 

t- 


Hx = @(x) (2*x+l).*(1 - x).~2; 

rw = px(1); rc = px(2); fac = px(3); 
n = px(4); rmf = px(5); 

Phiw = px(6); Phic = px(7); 
ro=0.43652798610891956; 

R = rc + rw; 

wac = rmf*fac*(rw/rc)*2; 

cosb = sqrt(1.0 - (2.0/wac)*(rw/R)"2); 

numr = fac*theta*cosb*(ro/rc)'2; 

deno = (1.0 + n*(rc/R)~3) *(1+(9*n/8)*(fac*theta) ."1.5); 
Gx = (numr./deno); 

G = Phic + (Phiw-Phic) .*Hx(theta) .* (1.0 - Gx); 

return 
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A3.30 Poisson distributions 

A3.30.1 Generation 

The Poisson distribution of Eq. (32.16) is generated in one of two ways: first, by simulating random events, where the 
duration after the last random event is given by Af ; - = - In r, with r a random number and k determined by 

£ A tj < X (A3.28) 

for three separate times, and, second, by simply plotting Eq. (32.16) as a continuous function of k. The output is the basis 
of Figure 32.10. 

tt PoisMaker 

t Given lambda, make a Poisson distribution four 
t different times (four simulations) and compare 
t to the analytical form 
% Terms: 

t np = maximum number of points to be shown 
t Ntotal = a big enough number so that simulation 
t will likely end before ix > Ntotal/lambda 

t iout = a flag to control the generation of outupt 
t lambda = the Poisson parameter 
t Analytic form 

t Pk(t) = (lambda'k / k!) exp(-lambda) 

if exist(' np'var' ) == 1; else np = 160 ;end 

if exist(' Ntotalvar' ) == 1; else Ntotal = 10000; end 

if exist(' iout ',' var ') == 1; else iout = 1 ;end 

if exist(' lambda' ,' var' ) == 1; else lambda = 8.0 ;end 

%% set up maximum values and zeroed out matricies 
% it is unlikely that k will ever exceed 20 * lambda 
% so restrict size of matrix holdng the data to Nkmax 
Nkmax = round(lambda*20); 

Pk = zeros(3,Nkmax); 

Nt = 0; 

Nx = round(Ntotal/lambda); 

%% Here is the main (numerical) part. 

% j is the simulation number 
% k is the number of events occuring 
% xt keeps adding events until they exceed lambda 
% -log(rand()) is the random time to the next event 

for j =1:3; 

for 1 = 1:Nx; 
xt = 0.0; 
k = 0; 

while xt < lambda; 

xt = xt - log(randO); 
k = k+1; 

end 

Pk (j , k) = Pk (j , k) + 1; 
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Nt = max(Nt,k); 

end 

end 


%% Here is the generation of the analytical form 
% It makes use of nl = n * Gamma(n), 

% and gammaln = In(Gamma(n)) 

% However many entries Pk contains, 
t just view Nt of them 
Nt = max(Nt,round(2.3*lambda)); 
np = max(Nt,np); 

Pk = Pk(:,1:Nt); 
kk = (0:Nt-l); 


xk = (0:np)*Nt/np; 

yk = Nx*exp(-lambda+xk*log(lambda)-gammaln(xk+1.0)); 


%% Send to Output file 
if iout > 0; 

Mout = [xk',yk']; 

dlmwrite('APk.txt',Mout,'delimiter',... 

'\t','precision', 6) ; 

Mout = [kk',Pk']; 

dlmwrite('NPk.txt',Mout,'delimiter',... 

'\t','precision', 6) ; 




A3.30.2 Test of acceptance-rejection method 

A test of the assignment Atj = -\nr as suggested by ref. [411] compares its results for p 0 (At) to that of the 
acceptance-rejection method. Both are compared to Np 0 {t) = (lV2)exp(-2Af). The output is the basis for 
Figure 32.11. 

% PoisTest 

% Test of Poisson distribution algorithm 
% Notes: Del is the mean time beween events 
% Delj is the individual time between events 

% from a Poisson distribution algorithm. 

% If r is a random number, then Delj = - Del*ln(r) is 

% amount of time that passes until the "next" event 

if exist(' np' ,'var') == 1; else np = 12; end 

if exist(' nb' ,'var') == 1; else nb = 6; end 

if exist(' Del ','var') == 1; else Del = 32.0; end 
if exist(' iout ','var') == 1; else iout = 0;end 

Nb = 2~nb; 

Np = 2 * np; 

t METHOD 1: Straightforward 
Delj = -Del*log(rand(1,Np)); 
xdel = ((l:Nb)-0.5)*10*Del/Nb; 
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yx(1,:) = (Np*10/Nb)*exp(-xdel/Del) ; 
yx(2,:) = hist(Delj,xdel); 

t METHOD 2: Acceptance / Rejection 

p = exp(-1.0/Del); 

Delx = zeros(l,Np); 

for j = 1:Np; 

for k = 1:20000; 

if rand() > p; 


Delx(j) = k; 


yx(3,:) = hist (Delx,xdel); 

%% 

plot (xdel,yx,'o'); 

legend('Exp','Knuth','AR'); 

if iout > 0; 

Mout = [xdel',yx'l; 

dlmwrite('POIS.txt',Mout,'delimiter',... 


'\t','precision',6); 

disp (' FINISHED : „Headers„=„x, ^Exp, „Knuth, ^AR' ) 


end 


A3.31 Electron-electron relaxation time 

An algorithm is given for the evaluation of the electron-electron relaxation time r ee (k) where E = h 2 k 2 /2m, as in 
Eq. (32.73) and contained in the matrix M, and compared to its approximation by Eq. (32.78) (see also Eq. (6) of ref. 
[548] but note that their e 0 should rather be e 2 ) and contained in the matrix M a . Default values are copper-like for the 
parameters of T [K] = 77, 150, 300, 600, 1000, and 2000. As a numerical feature, observe that z = IKE - n)/n\ the points 
chosen are so as to be uniformly spaced on a log-log plot of t ee {k) vs z by setting 



(A3.29) 


so that ?! = 0.00674 < 0.01 and z N = 403 > 300, where the second numbers are the limits of z in Figure 32.15. 
%% Tauee 

% Calculation of Lugovskoy-Bray relaxation time 

% fundamental constants and input parameters 
kb = 1.0/11604.52; m = 5.68563; hbar = 0.6582119; 

Q = 0.3599911; ao = 0.05291772; 

if exist('Nx','var') == 1; else Nx = 40 ;end 

if exist('mu','var') == 1; else mu = 7.0;end 

if exist('Ks','var') == 1; else Ks =5.2; end 

if exist('iout','var')== 1; else iout = 0;end 
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set up primary quantities 
kf = sqrt(2*m*mu)/hbar; 
qo = sqrt(4*kf/(pi*Ks*ao)); 
xo = 2*kf/qo; 

yo = 1.0 /sqrt(1.0 + xo~2); 

Gammo = (0.25*xo , '3) * (xo*yo"2 + acos (yo) -... 

yo*acos (yo , '2) /sqrt (1.0 + yo^2) ) ; 

T = [77,150,300,600,1000,2000]; Nt = length(T); 
z = exp (-5 + 11* ( (1 :Nx) - 1) / (Nx - 1) ) ; 

%% Initialize Output Matricies 

M = zeros(Nx,Nt); 

Ma = zeros(Nx,Nt); 

%% Loop through the temperatures 
for j = 1:Nt; 

beta = 1.0/(kb*T(j)); 

E = mu + (pi/beta)*z; 
k = sqrt(2*m*E)/hbar; 
x = 2*k./qo; 

y = 1.0./sqrt(1.0 + x.~2); 

Gamma = (0.25*x."3).*(x.*y.*2 + acos(y) -... 

y.*acos(y.~2)./sqrt(1.0 + y.~2)); 
tauo = (2*hbar*ao/(pi*Q*Gammo))*(Ks*beta*mu) '2; 

M (: , j) = Gammo*tauo*((E/mu) .*2) ./ ( (z + l).*Gamma); 
Ma(:,j) = tauo* sqrt(E/mu)./ (z + 1); 


%% Send to Output file 
if iout > 0; 

Mout = [z',M]; 

dlmwrite('tauee.txt',Mout,'delimiter',... 

'\t','precision',6); 

Mout = [z',Ma]; 

dlmwrite('taueeA.txt',Mout,'delimiter',... 
'\t','precision',6); 




A3.32 Resistivity and the Debye temperature 

If discrete data } can be modeled by a relation of the form y-(x), where X is a parameter, then a means of finding X is 
to minimize the least squares difference between the set of {y ; } and the values of y x (xf with is {1, N]. That is, X is adjusted 
until 5(1) defined by 




(A3.30) 
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takes on its smallest value at X = X 0 . In some circumstances, if y(x) grows exponentially large (as in Eq. (32.96)), it can 
prove more convenient to consider ln(y,) instead, in which case, 


5(2) 



(A3.31) 


Near X « X 0 , 5(2) is approximately parabolic. Letting 5(2) = Q2 2 + C 2 X + C 3 , then an approximation to X 0 is X 0 « —C 2 /2C 1 . 
Therefore first a polynomial fitting to a quadratic occurs, and then X 0 is estimated in the algorithm, or rather, by evaluating 
5(2) over n points Xj = pjX and using these points to construct a quadratic polynomial fit to 5(2) from which the coefficients 
Cj are found. Generally, the first guess of T D is the largest value of T in the data sets (culled from Tables A3.3 and A3.4), 
but occasionally this is a poor choice, and so an option is included to specify the initial value of T D from which the 
iterations begin if needed. Generally, algorithms that require such intervention on the part of the user are not robust, but 


Table A3.3 Resistivity of copper (Cu), silver (Ag), and gold (Au) in units of b<n-cm] 
from Matula [296]. 


T [K] Cu Ag Au 


1.0000 
4.0000 
7.0000 
10.000 
15.000 
20.000 
25.000 
30.000 
35.000 
40.000 
45.000 
50.000 
55.000 
60.000 
70.000 
80.000 
90.000 
100.00 
125.00 
150.00 
175.00 
200.00 
225.00 
250.00 
273.15 
293.00 
300.00 
350.00 
400.00 
500.00 
600.00 
700.00 
800.00 
900.00 
1000.0 
1100.0 
1200.0 
1300.0 
1235.1 

1337.6 

1357.6 


0.0020000 
0.0020000 
0.0020000 
0.0020200 
0.0021800 
0.0028000 
0.0044900 
0.0082800 
0.014700 
0.023900 
0.035800 
0.051800 
0.072700 
0.097100 
0.15400 
0.21500 
0.28100 
0.34800 
0.52200 
0.69900 
0.87400 
1.0460 
1.2170 
1.3870 
1.5430 
1.6780 
1.7250 
2.0630 
2.4020 
3.0900 
3.7920 
4.5140 
5.2620 
6.0410 
6.8580 
7.7170 
8.6260 
9.5920 


0.0010000 

0.0010000 

0.0010300 

0.0011500 

0.0018900 

0.0042200 

0.0095500 

0.019400 

0.034100 

0.053900 

0.077300 

0.10400 

0.13200 

0.16200 

0.22500 

0.28900 

0.35400 

0.41800 

0.57300 

0.72600 

0.87800 

1.0290 

1.1790 

1.3290 

1.4670 

1.5870 

1.6290 

1.9320 

2.2410 

2.8750 

3.5310 

4.2090 

4.9120 

5.6380 

6.3960 

7.2150 

8.0890 

8.4150 


0.022000 
0.022000 
0.022100 
0.022600 
0.025800 
0.034600 
0.050200 
0.072500 
0.10180 
0.14100 
0.18100 
0.22100 
0.27000 
0.30800 
0.39500 
0.48100 
0.56600 
0.65000 
0.85700 
1.0610 
1.2620 
1.4620 
1.6620 

I. 8640 
2.0510 
2.2140 
2.2710 
2.6850 
3.1070 
3.9140 
4.8750 
5.8160 

7.8620 

8.9860 

10.191 

II. 486 
12.854 

13.388 
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Table A3.4 Resistivity of select metals (aluminum, iron, molybdenum, nickel, platinum, and 
tungsten) in units of [^£2-cm] from the AIP Handbook, Table (9d-2a) [55]. 


T [K] Al Fe Mo Ni Pt W 


20 

40 

60 

80 

100 

150 

200 

273 

293 

298 

300 

400 

500 

600 

700 

800 

900 


0.000755 

0.0181 0.0758 

0.0959 0.271 

0.245 0.693 

0.442 1.28 

1.006 3.15 

1.587 5.20 

2.417 8.57 

2.650 9.61 

2.709 9.87 

2.733 9.98 

3.87 16.1 

4.99 23.7 

6.13 32.9 

7.35 44.0 

8.70 57.1 

10.18 


0.00261 0.014 

0.0457 0.068 

0.206 0.242 

0.482 0.545 

0.858 0.96 

1.99 2.21 

3.13 3.67 

4.85 6.16 

5.34 6.93 

5.47 7.12 

5.52 7.20 

8.02 11.8 

10.6 17.7 

13.1 25.5 

15.8 32.1 

18.4 35.5 

21.2 38.6 


0.0484 0.00196 

0.409 0.0544 

1.107 0.266 

1.922 0.606 

2.755 1.02 

4.76 2.09 

6.77 3.18 

9.60 4.82 

10.5 5.28 

10.7 5.39 

10.8 5.44 

14.6 7.83 

18.3 10.3 

21.9 13.0 

25.4 15.7 

28.7 18.6 

32.0 21.5 


calculations such as these are not something to be repeated, and so it is left as interactive. The interactive lines are under 
the heading "Driver" with (in MATLAB) the main algorithm ExtractBD invoked by entering its name and a return, 
with each line corresponding to a different metal. 

The data sets are assumed to be text files labeled rhoXX. txt, where XX is a stand-in for the element to be examined, 
for example for copper, the file is rhoCu. txt. Especially with the data from ref. [296], the first few p jr and sometimes 
the last few, are being affected by processes other than acoustic phonon, and so those data points are excluded using the 
parameters ilo and ihi. 


%% Driver: calling 

clear; iout 

ExtractBD; 

clear; iout 

ExtractBD; 

clear; iout 

ExtractBD; 

clear; iout 

ExtractBD; 

clear; TD=230; iout 

ExtractBD; 

clear; TD=450; iout 

ExtractBD; 

clear; iout 

ExtractBD; 

clear; iout 

ExtractBD; 

clear; iout 

ExtractBD; 


arguments for Program ExtractBD 

= 0; ilo=5; ihi=3; flno='Ag'; 

= 0; ilo=7; ihi=3; flno='Au'; 

= 0; ilo=7; ihi=0; flno='Cu'; 

= 0; ilo=0; ihi=0; flno= 'VI' ; 

= 0; ilo=0; ihi=0; flno='Pt'; 

= 0; ilo=2; ihi=4; flno='Ni'; 

= 0; ilo=0; ihi=0; flno='Al'; 

= 0; ilo=0; ihi=0; flno='Mo'; 

= 0; ilo=0; ihi=0; flno='Pt'; 


% - 

% Program ExtractBD 

% - 

t Iterative Search for Debye Temperature from rho(T) 
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t data Perform a least-squares search to find what value 
% of TD provides the best correspondence to experimental 
% rho(T) data. The input data file (default case) is 
% for copper, so load <rhoCu.txt>: change this if 
% considering different conductivity data for other 
t metals. 

% 

% Units of R are assumed to be microOhm-cm, so convert 
% it first to Ohm-cm which are the units of this program 


if exist('flno'var') == 1; else flno = 

if exist( 'ilo','var') == i ; else ilo 

if exist( 'ihi','var') == i ; else ihi 

if exist( 'NL','var') == 1; else NL 

fin = ['rho',flno,'.txt']; 

Rtmp = load(fln); 

T = Rtmp(: , 1 ) ? R = le-6*Rtmp(:,2); clear 
Nt = length(T); 

T = T(ilo+1:Nt-ihi); R = R(ilo+1:Nt-ihi); 


'Cu' ;end 


5 ; end 

0 ; end 

8 ; end 


Rtmp 


% Number of Data points 
Nplot = 50; 

tt Fit high temp resistivity data to Bloch Grun fit 
% Find the Debye Temperature 
% least squares search algorithm 
Nt = length(T); 

if exist('TD','var') == 1; else TD = 200;end 

Npoly = 5; pj = zeros(1,Npoly); yj = pj ; 
pj = 1.0 + 0.01*(2*(1:Npoly) - Npoly - 1)/(Npoly - 1); 
for jd = 1:NL; 

yj = zeros(1,Npoly); 

TDj = pj *TD; 
for j = 1:Npoly; 
for k = 1:Nt; 

Yj (j) = yj (j) + • • • 

(log ( ( (T (k) /T (Nt) ) *5) * (R (Nt) /R (k) ) * . . . 
(FUNbg (TDj (j)/T(k))/FUNbg(TDj (j )/T (Nt) ) ) ) ) ,~2; 

Cpoly = polyfit(pj,yj,2); 

TD = -TD*Cpoly(2)/(2*Cpoly(1)); TD = max(TD,l); 

disp ( [f lno, ' _TD_=„' , num2 str (TD) ] ) 
clear Cpoly ihi ilo j jd k NL Npoly TDj 
Rtmp = load(fln); 

T = Rtmp(:,1); R = le-6*Rtmp(:,2); clear Rtmp 
Nt = length(T); 

for j = Nt-1:-1:1; 
if TD > T(j); 

rhoD = R(j) + ((R(j +1) - R(j))*.. . 
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(TD - T(j) ) / (T(j+1) — T (j ) ) ) ; 

end 

disp ( [f lno, ' „„rhoD„=^' , num2str (rhoD) ] ) j 
Ra = ( (T/TD) . . *FUNbg (TD . /T) /FUNbg (1) ; 

figure(1); loglog(T/TD,R/rhoD,'+', T/TD,Ra,'g',1,1,'ro'); 

xlabel('T/T_D','FontSize',18); 

ylabel('\rho/\rho_D','FontSize',18); 

title([flno,num2str(round(TD))],'FontSize' , 20) ; 

% axis([0.01 10 0.0005 20]); 

%% Send to Output file 

if exist( 'iout','var') == 1; else iout = 1 ;end 
if iout > 0; 

Mout = [T,R,T/TD,R/rhoD,Ra]; 
dlmwrite ( [' TD- ' , f lno, ' . txt' ] , Mout, . . . 

'delimiter','\t','precision',6); 

end 

The algorithm uses W_( 5,x), evaluated by FUNbg (x), which in turn uses Eqs (32.88) and (32.92), depending on the 
size of the argument. The code is: 

function yx = FUNbg(y) 

fx = @(x) (x.~5)./((exp(x) - 1.0).*(1.0 - exp(-x))); 

gx = @(x) -(log(x).*5)./((1.0 - x).~2); 

zo = 124.4313306172022; 

Nx = length(y); 
z = exp(-y); 
s = zeros(Nx,l); 
for j = 1:Nx 

if y(j) <20.0; 

s(j) = integral(@(x)fx(x),0.0,y(j)); 

else 

s(j) = zo - integral(@(x)gx(x),0.0,z(j)); 

end 

end 

yx = s ; 

A3.33 Orbits in a magnetic field 

Using the methods of Section 33.1 a Maxwell-Boltzmann distribution of transverse velocities is created according to 
Eqs (33.7) and (33.8). The output is used to create the data behind Figures 33.1 and 33.2: the OrbitsFig file is used in 
the former, and the MDvo. txt file in the later. 

%%Program Orbits 

% find the initial position of N electrons and plot 
% their circular orbits about the magnetic field lines 
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if exist( 'N','var') == 1; else N = 256 ;end 

if exist( 'Nt','var') == 1; else Nt = 32 ;end 

if exist( 'cof','var') == 1; else cof = 0.1 ;end 

if exist( ' iseed' ,' var' ) == 1; else iseed = 5 ;end 

% Generate Maxwell Boltzmann Distribution of velocities 

ra = rand(l,N); rb = rand(l,N); 

vo = sqrt ( -2*log (ra)); 

theta = 2*pi*rb + 0.5*pi; 

vx = vo.*cos (theta); 

vy = vo.*sin (theta); 

% Create random uniform distribution of launch sites 
ntotal = 0; 

rx = zeros(1,N); ry = zeros(1,N) ; 

rng(iseed); 
while ntotal < N; 

a = 2*rand ()-l; b = 2*rand()-l; 
if sqrt (a*2 + b~2) < 1; 
ntotal = ntotal + 1; 
rx(ntotal) = a; 
ry(ntotal) = b; 

end 

end 

% subtract off the starting points 
rx = rx - cof*vx; 
ry = ry - cof*vy; 

% Create phi for drawing circles 
phi = 2*pi* (0:Nt-l)/(Nt-1); 

t Create the matricies xp and yp that contain orbits 
xp = zeros (N,Nt); yp = xp; 
for i = 1:N; 

xp(i,:) = rx(i) + cof*vo(i) *cos (phi + theta(i)); 
yp(i,:) = ry(i) + cof*vo(i) *sin (phi + theta(i) ) ; 

end; 

ymax = max(max (yp)); xmax = max(max (xp)); 
ymax = max (ymax,xmax); 

ymin = -min(min (yp)); xmin = -min(min (xp)); 
ymin = max (ymin,xmin); 

L = max (ymax,ymin); 

t Plot the initial starting points and the orbits 

plot (xp(1,:),yp ( 1 , :), ' c') ; 

axis ([-L L -L L]); axis square 

hold on; 

for j = 2:N; 

plot (xp(j,:), yp(j,:),' c' ); 
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plot (xp ( : , 1) ,yp ( : , 1) , 'k.') ; 

plot(cos (phi), sin (phi),'r','LineWidth',2) 

hold off; 

axis off; 

clear a b i j ntotal xmax xmin ymin 
clear ra rb rx ry iseed vx vy 

%% Output to files 

print -dpdf OrbitsFig 

% Send to Output file 

dlmwrite (['MDvo.txt'],vo','delimiter',... 

'\t','precision',6); 


A3.34 Trajectory of a harmonic oscillator 

Using the discrete methods of Section 33.4.1, the evaluation of the trajectory of a one-dimensional harmonic oscillator 
for which x(t) = cos(mt) is compared to its evaluation using a leap-frog (Eq. (33.59)) and an average value (Eq. (33.62)) 
method. The output is the basis for Figure 33.16. 

% - 

% PROGRAM TrajectoryHO 

% - 

% plotting of trajectories for the harmonic oscillator 
% x = cos(w*t); v = -w*sin(w*t); 

% . 

% Input terms: 

% Nt = Number of time steps 

% n = to evaluate frequency by w = 2*pi*n/Nt 
% Note that Delta-t = 1 and x(0) =1 for convenience 
% . 


% Set up needed term 

if exist('Nt','var') == 1; else Nt = 48;end 

if exist (' n','var' ) == 1; else n =2 ;end 

if exist('iout','var')== 1; else iout = 0 ;end 

w = 2 *pi*n/Nt ; 

t = (l:Nt) -1; 

xj = zeros(l,Nt); xj(1) = 1; 
vj = zeros(l,Nt); 

%% Leap frog algorithm 

Px = [1; -0.5*w*sin(-0.5*w)]; 

Mo = [1, 1; 0, 1]; 
for j = 1:Nt-l 

Ax = - (w~2) * [xj (j ) ; xj (j ) ] ; 

Px = Mo*Px + Ax; 
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Xj (j+1) = Px(l) ; 

end 

xlf = xj; 

tt Average algorithm 
Px = [1; 0] ; 

Mo = [1, 1; 0, 1]; Do = [0.25, 0.25; 0.5, 0.5]; 
for j = 1:Nt-l 

Ax = - (w / '2) * [xj (j ) ; xjiljll t %Prediction 

Ps = Mo*Px + Do *Ax; 

Ax = - (w , '2) * [xj (j ) ; Ps (1) ] ; %Correction 

Px = Mo*Px + Do *Ax; 
xj (j+1) = Px(l) ; 

end 

xav = xj; 

%% Plotting and output section 

% first show the trajectory x(t), then> show how 
% both methods compare to the analytical result 
% x(t) = cos(w*t) 

% by looking at the differences 

xt = cos(w*t); 

figure(1);plot(t,xt,'k',t,xlf,'ro',t,xav,'bs'); 
legend('Analytic','Leapfrog','Average'); 
title('Trajectory','FontSize',20); 
xlabel ('time„[\Delta„t] ' , ' FontSize' ,18); 
ylabel('positions(t)', ' FontSize',18); 

figure(2);plot(t,xt-xlf,'-ro',t, xt - xav,'-bs'); 
legend('Leapfrog','Average'); 

title('Difference„with„Analytic', 'FontSize',20); 
xlabel('time„[\Delta„t] ','FontSize',18); 
ylabel('positions(t)','FontSize',18); 

if iout > 0; 

Mxt = [t',xt',xlf',xav']; 
filenam= 'xvt.txt'; 

dlmwrite(filenam,Mxt,'delimiter',... 

'\t','precision',8); 

end 

A3.35 Trajectories for emission from a hemisphere 

The numerical evaluation of electron trajectories from a hemisphere is done by using Eq. (33.62) and makes use of 
the average method of Eq. (33.60). The results are compared alongside the analytical evaluation using Eq. (33.78). 
The output data (x = p/a vs y = z/a) is shown in Figure 33.20. 

% - 

% PROGRAM TRAJECTORIES 

% - 

t Dimensionless Representation: 
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t (tau = time, a = boss radius 

% x = rho/a; u = vrho / vo; 

% y = z/a; w = vz / vo; 

% t = tau s; 

% . 

% Input terms: 

% Np = [N]umber of [p]articles 
% Nt = [N]umber of ft ]ime steps 
% ds = dimensionless time increment 
% kappa = Fo * a / m * vo~2 where 
% Fo = Field [eV/nm] 

% a = boss radius [nm] , 

% m = electron mass [eV/c~2] , 

% vo = initial velocity [nm/fs] 

% - 

Dimensionless Frho Fz in terms of rho = a*x, z = a*y 
% fx = ®(x,y) 1.0 - (x~2 - 2*y~2) / (x~2+y~2)~2.5; 

% fy = @(x,y) 3*x*y/ (x~ 2+y~ 2) ~ 2.5 ; 

% . 

fy = @(x,y) 1.0 +... 

((-x.-2 + 2*y.~2)*/((x.~2+y.*2).*2.5)); 
fx = @(x,y) 3*x.*y./((x.*2+y.*2) .*2.5); 

% Set up needed term 

if exist('Np','var') == 1; else Np = 8;end 

if exist('Ntvar') == 1; else Nt = 24;end 

if exist('ds','var') == 1; else ds = 0.05;end 

if exist('thetamax','var') == 1;... 
else thetamax = 45.0;end 

if exist('kappa','var') == 1; else kappa = 15.0;end 
if exist('nsig','var') == 1; else nsig = 2;end 

theta = thetamax*pi*(0:Np-l)/(180*(Np-1)); 
sigma = sqrt(1.0 +... 

(2*(nsig~3 - 1)*kappa/nsig~2)*cos(theta)); 

%% Initial conditions and Integration. 

% x = rho/a, y = z/a, u = vrho/vo, w = vz/vo 
% zero out all the vectors and allocate their space 
% for computation 

x = zeros(Nt,Np); y = x; u = x; w = x; 
xu = x; yu = x; xc = x; yc = x; 

% Initial launch sites and velocities 
x (1, :) = sin(theta); y(l,:) = cos(theta); 
u (1, :) = sin(theta); w(l,:) = cos(theta); 

Analytic trajectories: set up time 
ts = (0:Nt-1)*ds; 
for ip = 1:Np; 
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xu(:,ip) 
xc(:,ip) 
yu(:,ip) 

yc(:,ip) 




sin(theta(ip))*(1.0 + ts) ; 

sin(theta(ip))*(1.0 + ts*sigma(ip)); 

cos(theta(ip))*(1.0 + ts)... 

+ 0.5*kappa*ts."2 ; 

cos(theta(ip))*(1.0 + ts*sigma(ip))... 
+ 0.5*kappa*ts.~2; 


i-% Numerical 
for j = 1:Nt- 
XS = x(j , 

ys = y(j, 
u(j+l, :) 
w(j+l, : ) 
x(j+l, : ) 

y(j+l- : ) 


trajectories 

■1; 

:) + ds*U(j, :) +. . . 

0.5*kappa* (ds , '2) *fx (x (j , :) ,y (j , :) ) ; 

:) + ds*w(j, :) + . . . 

0.5*kappa* (ds , '2) *fy (x (j , :) ,y (j , :) ) ; 

= u(j,:) +... 

0.5*kappa*ds*(fx(xs,ys)+fx(x(j,:),y(j,:))); 
= w(j , :) +. . . 

0.5*kappa*ds*(fy(xs,ys)+fy(x(j,:),y(j,:))); 
= x(j,:) + 0.5*ds*(u(j,:)+U(j+l,:)); 

= y(j,:) + 0,5*ds*(w(j,:)+w(j+l,:)); 


end 


% For the j-th trajectory, plot x(j,:) vs. y(j,:) 


A3.36 Monte Carlo and integration 

The numerical algorithm to generate Figure A1.3 in Section Al.2.5 is given, which calculates Eq. (A1.47) for the uni¬ 
form (p(x) =1), the power law (p(x) = (1 + v)x v with v = 2), and the acceptance-rejection method. The code has been 
explicitly made to be compact, so that each estimate of the integral is performed using a sum over vector elements. 
The acceptance-rejection method employs a trick of max(x, 0)/x = 1 if x > 0 and 0 otherwise. 

%% Program Monte Carlo 

% Compare three different methods of evaluating a 
t definite integral using Monte Carlo techniques: 
t 1. Acceptance / Rejection A/R 

t 2. Uniform p(x) distribution Uni 

t 3. Power Law p(x) distribution Pow 

% Do this calculation Nj times and compare the output: 

% The more accurate method(s) will cluster around ya 

clear;clc; %Clear the work space 

%%Parameters 
Nx = 512; Nj = 64; 

a = 5; p=2 ; 

iseed = 5; 

rng(iseed); % initialize the random number generator 

% create the normalized function f(x) and its 
% analytcal integrated value (from 0 to 1) ya 
ya = (1/a) - l/(exp(a) -1); 

Fx = @ (x, a) (exp (a*x) - 1) / (exp (a)-1) ; 




% Initialize the vectors 

Iuni = zeros(l,Nj); Ipow = zeros(l,Nj); 

Iaj = zeros(l,Nj); Io = ya*ones(l,Nj); 

Do all the Monte Carlo calculations in the same loop 
for j = 1:Nj; 

rj = rand(l,Nx); t create same random vector for all 

%% Method 1: Uniform random variable 
hj = rj; pj = ones(l,Nx); 

Iuni(j) = sum(Fx(hj,a)./pj)/Nx; 

%% Method 2: Power Law random variable 
hj = rj.-(1/(p+1)); pj = (1+p)*hj.~p; 

Ipow(j) = sum (Fx (hj , a) ./pj )/Nx; 

%% Method 3: Acceptance - Rejection 

hj = Fx(rj,a); rp = rand(l,Nx); 

laj(j) = sum(max(hj - rp, 0.0)./(hj - rp))/Nx; 


end 

%% Create index (x) and sort the data: shows better 
% how well the data clusters around the actual value ya 
x = (1:Nj)'; 

Itot = [sort(Iuni)sort(Ipow)sort(Iaj)',Io']; 

figure(4); plot(x,Itot); 

legend('Uni','Pow','A/R','Analytic'); 

title('Monte„Carlo„Comparison','Fontsize',24); 

xlabel('Run^Index','FontSize',18); 

ylabel('MC^Estimate','FontSize',18); 

%% Output the data 
Mx = [x,Itot]; 

dlmwrite('Monte.txt',Mx,'delimiter','\t','precision',6); 
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Coldplay, 296 

color combinatorics, 33-35, 34/ 


Columbus, xiii 
combinatorics, 33-36, 34/ 
comic premise, 177-178, 177n7 

computationally kind model, 193-198, 195/ 
computations, 7-8 
concentration gradient, 385 
conducting hemispheres, 406^08, 406/, 408/ 
conduction, 329-339 

Poole-Frenkel transport, 329-333, 330/-333/ 
tunneling, 333-336, 335/, 336/ 

Zener breakdown, 329 
conduction bands, 262, 264, 264 f, 265/ 
band gap, 266 
in insulators, 278-279, 279f 

in semiconductors, 278-279, 299-301, 299/, 3001, 301/ 
sinusoidal potential and, 269-272 
and temperature, 281, 282/ 
band overlap, 279 
band-to-band 
transport, 319-321, 320/ 
tunneling, 164 
bending, 310-311, 311/ 

conductivity, 15-16, 277. see also semiconductors 
electrical, 15-16, 66-68 
thermal, 16-18, 17/, 68-69, 304-306 
confluent hypergeometric function, 317 
conical field emitters, 409, 424 
conjugate variables, 84-87 
constant(s) 

current approximation, 165-168 
density approximanon, 164-165 
of motion, 92-95 
universal, 145 
contacts, 329-339 
container problem, 80-81, 81/ 

controlled porosity dispenser (CPD) cathodes, 399-401, 400/ 
convolution theorem, 288 

Conwell-Weisskopf (CW) approach, 502-503, 515 
Cooper, James Fenimore, 209 
Copernicus, Nicholaus, 259 
copper(Cu) 

cesiated, 471-472, 472/ 

Debye temperature, 70 
electron quantity, 25 
emittance parameters, 557, 557t 
heat capacity of, 19, 19/ 
quantum efficiency, 453, 453/, 471-472, 472/ 
reflectivity, 452 
resistivity, 652f 
core(s) 

multiple, 274-275 
terms for, 357-360, 358f, 359/ 
corposants (St. Elmo's fire), 403, 404-405, 404/ 
correct Boltzmann counting (IV!), 44 
correlation energy V m . 356 
correlation terms, 356-357, 356t, 357/ 
cosmic microwave background radiation (CMBR), 21,21/ 
Coulomb potentials, 92, 273 
screened or Yukawa, 274, 275/, 501, 506, 614-616 
coupling constants, 281 

CPD cathodes, see controlled porosity dispenser cathodes 
creation and annihilation operators, 96nl0 
Creon, 163 

Crossan, John Dominic, 552 
crystals work function 
values, 478, 478/ 
variations, 478, 479/ 
current, 329-339 

constant current approximation, 165-168 
density, 15, 47, 63n6, 165n7, 166 
anatomy of, 127-133 
changes, 366-367 
equation for, 127-128 
FN, 366 
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for heavily doped semiconductors, 317, 317/ 
metal-insulator-metal, 303-309 
RLD, 135-137, 366 
supply function, 127-128 
and effective mass, 301-302 
flow, 277 

rectification, 305-306 
source-limited, 372 

tip models, 415-417 
tunneling, 329 

cyclotron resonance frequency, 525 

damping constant, 457 
Dante Alighieri, 273 
Darius I, 61, 567n2 
dark current, 174, 174/ 
darken equation, 388-389 
Darrow, Clarence, 541 
data analysis, 593-595, 5951 
Davis, Paul R„ 429 
de Broglie, Louis, 11 
Debye screening constant, 274, 274nl5 
Debye temperature, 70 
algorithm for, 651-655 
for diamond, 511 
metal, 510-511, 511 1 
deformation, acoustic potential, 511 
degenerate electron density, 313 
degenerate gas, 55 

degenerate semiconductors, 284-285 
Dekker, A. J., 161 
Delacroix, Eugene, 209 
delayed emission, 473 
De Morgan, Augustus, 412, 412n5 
density, see also current density 
algorithm, 596-598 
constant, 164-165 
degenerate, 313 

and dipole components, 348-352, 350/ 
functional theory, 361 
high temperature/low density, 273-274 
low temperature/high density, 273 
matrix, 86-87, 103-104 
near barriers, 341-352 
of states, 257-259, 259/ 
examples, 64 

for one dimension, 64, 258-259, 259/ 
for three dimensions, 258-259, 259/ 

TMA vs. Kemble approximation, 229-231, 230/ 
for two dimensions, 64, 258-259, 259/ 
for various g, 64, 65/ 
surface, 376 
in typical metals, 61 
depletion layers, 319-324, 320/, 323/ 
depletion regions, 311 
depletion width, 165 
de Waal, Franz, 125 
diamond, 282 

acoustic relaxation times, 511-512, 515/ 
amplifiers, 156/ 

Bethe approximation, 490-492 

concept of, 487-494, 488/, 489/, 491/, 492/ 

initial conditions, 492-494 

maximum yield, 492, 492/ 

Monte Carlo simulations, 516-524, 516/-518/, 520/, 523/ 
penetration range, 492-494 
response time, 516-524, 522/, 523/ 
secondary electrons generated, 493, 493/ 
band gap, 512 
boron doping, 512, 512n22 
Debye temperature, 511 
high purity, 519f 
parameters, 278-279, 279 1 


Poisson's equation, 319-320, 3211 
relaxation times, 511-512, 515/, 5191 
secondary electron yield, 487, 488/ 
secondary emitters, 156/ 
dielectric constant, 286 

bound-electron-like/Lorentz component, 461, 465 
example approximation, 460 
free-electron-like/Drude component, 461, 465 
harmonic model, 460—461 
interband, 465 
intraband, 465 

Lorentz-Drude model, 458-465, 4651, 466/ 
quantum model, 463—i65, 466/ 
quasi-classical Boltzmann model, 461-463 
and screening, 287-289 
for silicon, 460 

differentiation, 577-582 
first-order, 548, 580, 582-583, 619-620 
numerical solution of ODEs, 582-584 
second-order, 580-582, 583-584, 620-622 
simple ODE solvers, 619-622 
diffusion coefficient 
adatom motion and, 385-387 
simple model, 382-383 

diffusion of coatings, 375-391, 376/, 379/-381/ 
analytic model, 376-377 
lattice gas models, 381 
numerical model, 377 
point source model, 376-381, 380/, 381/ 
point source solution for, 377-378 
shuffling algorithm for, 383-384, 384/ 
diffusion, thermally activated field assisted, 416 
digitizing image files, 632-633 
d-dimensional phase space, 259-260, 260/ 
diodes 

metal-insulator-metal, 307-308, 308/ 
metal-semiconductor, 305-306 
dipole components, 348-352, 350/ 
dipole models, 409, 411,41 If, 412/ 

Dirac delta function, 574 
Dirac, Paul A. M„ 73, 255, 354 
directed energy, 525 
discharge, 245 

dispenser cathodes, 136/, 371-401, 372/ 
barium, 401 

current density from, 375, 375/ 
diffusion of coatings, 375-391, 376/ 

"good cathodes," 375, 375/ 
lifetime, 375-376, 399-401 
sintered wire CPD cathodes, 399-401 
surface density, 376 
tubes in, 399, 399/ 
typical Miram plots from, 373, 373/ 
dispersion, 291-293, 504 
hyperbolic, 295, 298 
distinguishable particles, 25-28 
cautionary points, 28 
examples, 26-27, 26f, 27/, 31 
distributions, 63-64. see also specific formulations by name 
energy distribution curves (EDCs), 543 

experimental thermal-field energy distributions, 175-176, 176/ 

of holes, 280, 284-285, 285/ 

of income, 26-27, 26f, 27/ 

log-normal, 640-643 

most probable, 39-41 

normal, 37n2 

power law, 577, 577/ 

quantum, 49-60, 113 

steady-state, 114 

theoretical thermal-field energy distributions, 176-181, 178/-180/ 
uniform, 577, 577/ 

Djerassi, Carl, 145 
Dolan, Winthrop W., 403 
dominant charge carrier, 280 
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donors, 282 
dopants, 54 
DopePhi, 323/, 324 
doping, 277, 281-285, 284/, 285/ 
density, 616-618 
Dostoyevsky, Fyodor, 336-339 
Dowell, David H., 467, 563 
Dowell-Schmerge relation, 467-469, 538-539 
downwind differencing scheme (DDS), 580 
Dresher, Melvin, 73 
drift-diffusion-like material, 446, 447/ 
drift-diffusion 
scattering, 447, 447/ 
semiconductors, 447, 447/ 

Driver routine, 640-641 
Drude model, 13-19, 48, 462 
Drude theory, 461 
DuBridge, Lee A., 153 
DuBridge photoemission data, 152/ 
Dulong-Petit 
law, 41 
value, 19 

Diirrenmatt, Friedrich, 42, 363 
Dushman, Saul, 46 
thermionic data, 47/ 

Dyke, Walter P„ 403 
Dylan, Bob, 247, 487 
Dyson, Freeman, 453 

Eagleton, Terry, 506 
Ecclesiastes, 110-118 


income distributions, 26-27, 26f, 27/ 
income evaluations, 26-28 
Edison effect, 13, 14/, 122, 163 
Edison, Thomas Alva, 13 
Ehrenreich, Henry, 458 
eigenvectors, 75-77 

Einstein, Albert, 19, 21, 21nl0, 56, 123, 147, 297, 333, 417 
elastic scattering, 501 
electrical conductivity, 15-16, 66-68 
electric field, 9, 12, 463nl7 
electrochemical potential, 310 
electron(s) 
beams, 529-530 
back-scattered, 155 
cold launch, 532, 532/, 533/ 
compression of, 532, 532/ 
envelope equation, 528-532 
factors that affect, 529, 530/ 
hot launch, 532, 532/, 533/ 
interaction factors, 529, 530/ 
intermediate energy, 155 
lithography, 4, 525 

maximum tolerable emittances, 530-531, 530t, 531 
physics of, 525-565, 532/ 
space-charge-dominated, 564-565, 564/ 
trajectories, 532, 532/, 533/, 546-550 


density, see current density; density 
near step function, 109, 110/ 
variations, 109, 110/ 
effective mass, 291-302, 292n4 
emission area, 173n3 
flow, 16, 17/ 
gas, 58 

and holes, 279-281 
mass, 8, 9 1 

vs. band gap, 299-300, 299/ 
effective, 291-302, 292n4, 300t, 302nl7 
metal quantities, 25 
nearly free, 279 

number density (carrier concentration), 54 
orbits, 526-528, 527/ 
as particles, 291-292 


sphere/scattered, 471^74, 470/ 472/-475/ 
approximation, 485 
transmission probability, 104-106 
transport over cliff, 106 
transport past barriers, 91, 102-110 
and white dwarf stars, 57-60 
electronegativity, 266-269, 267/, 268/ 

Pauling electronegativities, 268, 268/ 
and work function, 480, 480/, 481/ 
electron-electron (e-e) 
relaxation time, 650-651 
scattering, 446-447, 447/, 506-508, 508/ 


barriers to. see barriers 

brute force evaluation, 189-193, 190/-193/ 

for bumps, 545-563 


al equations, 119-206 
rbon fiber fields, 546, 546/ 
ationally kind model, 193-198 
m layers, 319 


experimental thermal-field energy distributions, 175-176, 176/ 
factors affecting equations, 314-315 

for flat surfaces, 533-545 
general thermal-field 
current density, 195-198, 197/, 198/ 
emission code, 198-206 

general thermal-field-photoemission equation, 173-206, 209 
intensity, 543, 543/, 544/ 
ion-enhanced, 245-253 

modeling, 145 

Monte Carlo simulations, 443, 444/, 513, 516-517, 516/-518/ 

nascent model, 46-48 

notional area, 406-408 

photoemission, see photoemission 

and realistic surfaces, 563-565 

secondary, see secondary emission 

simulation code, 199-204 

space-charge limited, 121, 124 


transmission, see transmission 

for uniform surfaces, 533-545 
units based on, 11-12 

electron-phonon (e-p) scattering, 446-447, 447/ 
elegant (code), 532 
Eliot, T. S„ 369 

ellipsoidal emitters, 424-425, 426/ 

Emerson, Ralph Waldo, 219 


for bumps, 545-563 
definition of, 443, 533-534, 536 
and electron beams, 529-530, 530/ 
evaluation of, 554, 554(, 564 
field emission, 539, 564 
flat surface, 533-545 
and hemispheres, 550-552, 552/ 
intrinsic, 533, 546 
example evaluation of, 563 
and space charge, 564-565, 564/ 
maximum tolerable, 530-531, 530f 
notional, 558-561, 560/ 
one-dimensional equations, 539-541 
photocathode, 538-539 
photoemission, 538-545 
and quantum efficiency, 467-469, 470/ 
and realistic surfaces, 563-565 
rms, normalized, 534-537, 534/-536/, 538 
and secondary emission, 542-545, 543/, 544/ 
single hemisphere, 554-557, 554(, 557/, 558/ 
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emittance (continued) 

and space charge, 541-542 
thermal, 537-538, 555 
and undulating surfaces, 563-564 
uniform surface, 533-545 
emitter array, 561-563 
endomythium fables, 310n9 
energy, 257 
conversion, 525 
correlation, 356 

density of states, 257-259, 259/ 
and entropy, 41-42 
exchange, 354-356, 355/ 

Fermi, 53-54 

kinetic, 353-354 

relaxation time, 501 

and sinusoidal potential, 269-272 

stupidity, 356 

energy distribution curves (EDCs), 543 
entropy 

Boltzmann's equation for, 33 
energy and, 41-42 
example calculations, 43^4, 46 
probability and, 30-33 
Shannon, 32 
Epaminondas, 534 
epimythium fables, 310n9 
Erasmus, 238, 337n9 
error function, 587 
Esaki, Leo, 230n4, 240 
escape probability, 485 
Euclid, 567 

Euler-MacLauren series method, 570 
Euler/upwind differencing scheme (UDS), 377 
Eupalinus, 106 
evaluation, 7-12 

evaporation of coatings, 391-393, 391/-393/ 
exact tunneling and transmission evaluation, 207-253 
exchange-correlation, 360-362, 361/, 362/ 
exchange energy, 354-356, 355/ 

experimental thermal-field energy distributions, 175-176, 176/ 

fables, 310n9 
factorials, 35 
false positives, 29 
Faraday cup, 432-433 
Faraday, Michael, 277, 415 
fatal approximation, 450, 476 
Feinleib, Morris, 371 
FELs. see free electron lasers 
Ferine, 92n5 
Fermi-Dirac (FD) 

distribution, 25, 49, 50, 67, 280, 508 
statistics, 56-57, 57/, 509 
Fermi energy, 53-54 
Fermi level, 344 
Fermi momentum, 53-54 
fermions, 25, 62 

Feynman, Richard P., 73, 113, 232, 233, 503 
field currents, 147-148 

field-dependent work function, 363-366, 365/-366/, 632 
field effects, 350 
field emission, 5, 121-123 
brute force evaluation of, 189-193, 190/-193/ 
computationally kind model, 193-198 
data analysis, 145-146, 146/, 147/ 
depletion layers, 319 

experimental thermal-field energy distributions, 175-176, 176/ 
general thermal-field 
current density, 195-198, 197/, 198/ 
emission code, 198-206 

general thermal-field-photoemission equation, 173-206, 209 
internal, 319-321, 320/ 
log-normal distribution of, 640-643 
from metal, due to quantum mechanical tunneling, 88 


MTE for, 539-540 

MTE values, 540-541 

n= 1 regime, 194-195, 195/ 

N(n,s,u) function, 182, 182/, 183, 187, 188/ 
resonance in, 336-339, 337/, 338/, 339/ 
resources and further reading, 125 
from semiconductors, 622-626 
simulation code, 199-204 

theoretical thermal-field energy distributions, 176-181, 178/-180/ 
field emission data, 145-146, 146/, 1461, 147/ 
field emission displays (FEDs), 4 

field emission electron microscopy (FEEM), 416, 550-551 
field emission microscopy, 4 

held emission scanning electron microscopy (FESEM), 4 

examples, 540, 564 
parameters, 557, 5571 
single hemisphere, 555-557, 557 1, 558/ 
field emitters, 403-^42 

boundary element methods, 429-431, 429/, 432, 432/, 4321 

bowling pin model, 428 

conical, 409, 424 

data, 638-640, 6381 

dipole model, 409, 4111 

ellipsoidal, 424-425, 426/ 

floating spheres, 409 

hemispheres, 406-408, 406/, 408/ 

hemispherical, 425, 426/ 

high aspect ratio, 439, 439/ 

hybrid analytic-numerical model, 425-433 

hyperbolic, 422-423, 423/, 425, 426/ 

line charge models, 417^20, 418/-421/ 

monopole model, 409 

notional area, 406-408 

penetration factors, 457-458 

photoemitters, 443-486 

point charge model, 408-412, 410/, 4111, 412/, 435-436, 436/, 4371 

prolate spheroidal approach to, 420-425 

reflectivity, 457-458 

Saturn model, 426-428, 427/ 

semi-analytic model, 431^133, 433/ 

sharpest, 440-442, 441/ 

shielding, 433-437, 434/, 437/ 

silicon, high aspect ratio, 439, 439/ 

simple line charge models, 417-419, 418/ 

Spindt-type, 409, 416-417, 416/ 
statistical variation, 438-442, 439/-441/ 
tapered line charge models, 417, 419-420, 420/, 421/, 434 
tip current models 
assessment of, 415-417 
statistical variation, 438-439, 440/ 
field enhancement, 403^05, 404/, 411, 413 
factors, 405, 413-415, 413/-415/, 423 
fifth element, 91 
final states allowed, 514-515 
fine structure constant a, 8-9, 9 1 
finite differences, 579-582 
first Born approximation, 360 
first-order differentiation, 548, 580, 582-583 
simple solvers, 619-620 
flash X-ray sources, 4 
flat surfaces, 533-545 
floating spheres, 409 
Flood, Merrill, 73 

fluorine, Pauling units, 267, 267n8 
FN. see Fowler-Nordheim (FN) equation 
foolishness, 337n9 

Forbes-Deane approximation, 144, 605 
Forbes, Richard, 141n6, 317, 341 
force, 9, 12 

brute force evaluation, 189-193, 190/-193/ 
for distinguishable particles, 43 
for indistinguishable particles, 43 
FORTRAN (code), 222, 494 
general thermal-field emission code, 198-206 
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random number call, 497nl0 
foundations, 1-118 

four-barrier superlattice potentials, 232, 232/ 

Fowler, Ralph Howard, 57-58, 60, 139, 149 
Fowler-Dubridge (FD) equation, 149-153 
approximations, 149-152 
modified, 451, 455 
Fowler function U(x), 152/ 

Fowler-Nordheim (FN) 
current density, 366 
curve or plot, 145-146, 147/, 440^141 
equation, 139-148, 173-174, 174/, 190/ 
accuracy, 142n7 

Forbes-Deane approximation, 145 
image charge approximation, 140-145 
scaling arguments, 140 
transmission probability, 539 
for triangular barrier, 140-141, 221-222 
Franklin, Benjamin, 164 
Frayn, Michael, 149 
free electron lasers (FELs), 443, 525 
High Brightness Free Electron Laser (HEBAF), 174, 174/ 
Frenkel, Jakov Ilich, 329 
Friedel oscillations, 342, 342/ 

Frost, Robert, 195-196, 513 
fugacity, 42 

functions, 585-590. see also specific functions by name 
Airy, 588-589, 588/, 589 1, 590/ 
error, 587 
gamma, 585 

Legendre polynomials, 587 
Lorentzian, 590 
Riemann zeta, 585-586 
trigonometric, 585 
wave, 596-598 
fundamental constants, 9, lot 


Galbraith, John Kenneth, 585 
Gallipoli, 60, 60nl4 
gallium arsenide 
NEA bulk, 474, 474/ 
parameters, 278-279, 2791 
game theory, 73-74 

Monty Hall problem, 80-84, 81/, 81nl6, 83/ 
prisoner's dilemma, 73-78 
stable strategies, 77-78 
tit-for-tat strategy, 75-78 
gamma function, 585 

Gamow factor, 128-131, 129/, 209, 232-233, 236-238 
algorithm for, 606-608 
example calculations, 130-131 
for image charge, 606-607 
numerical evaluation of, 189, 605 
quadratic, 326 

for rectangular barriers, 211-212 
for triangular barriers, 215, 607-608 
Gandhi, Mahatma, 460 
Garfunkel, Art, 303 
Gaussian distribution, 37n2 
Gaussian pillbox argument, 171 
Gaussian quadrature method, 573-574 
Geisel, Theodor, 266 
General Schedule (GS), 25n5 
general thermal-field 
current density, 195-198, 197/, 198/ 
emission code, 198-206 

general thermal-field-photoemission equation, 173-206 
development of, 209 
N(n, s, u) function, 181-189 
GenLN routine, 643 
germanium: parameters, 278-279, 279 1 
Gibb's paradox, 42-44 
Gibran, Kahlil, 11 
Gibson, John W., 371 
gold (Au): resistivity, 6521 


Golden Balls, 74n4 
Goldilocks principle, 544, 544nl6 
Goldman, William, 291 
"good cathodes," 375, 375/ 

Grand Partition Function (Z), 45-46, 50 
Greeks, 61 

Green, Michael C„ 371 
Green's function approaches, 377 
group velocity, 291 
Gyftopoulos, Elias P., 478 

Gyftopoulos-Levine (GL) theory, 479-480, 482n31, 483, 484/ 
modified (MGL), 479 
work function algorithm, 645-647 

Haas, George A., 356, 371 
Haidt, Jonathan, 8 
hair thickness, 399n22 
halo, 529, 530/ 

Hamilton, Edith, 426 
Hamiltonian, Schrodinger, 89 
Hammersley, J. M., 383 
Handscomb, D. C„ 383 
harmonic model, 460-461 
harmonic oscillators, 91, 110-118 
quantum phase space approach to, 113-118, 118/ 
trajectory of, 657-658 
wave function approach to, 110-113 
Hartman, Robert L., 429 
Hc(u) function, 608-610 
heat capacity, 69-71 
of copper, 19, 19/ 
of solids, 19 

Heaviside step function, 451 
Heisenberg representation, 88-89 
Heisenberg's quantum mechanics, 78 
Heisenberg's Uncertainty Principle, 85, 87 
Heisenberg, Werner, 84, 87, 149 
Helmholtz's equation, 359 
hemispheres 

emittance, 550-552, 552/ 

emitters, 406^08, 406/, 408/, 416, 552, 552/, 554-557, 554t, 558/ 
field emittance, 555-557, 557(, 558/ 
notional emission area factors, 425, 426/ 
notional emittance, 558-561, 560/ 
parameters, 557, 557 1 
photo emittance, 556-557, 557f, 558/ 
single hemisphere emittance, 554-557, 554t, 557 1, 558/ 
thermal emittance, 555, 557, 557 1, 558/ 
trajectories for emission from, 658-660 
Hermes, 286 

Hermite polynomials, 113 
Herodotus, 106 
Hertz, Heinrich, 123 

High Brightness Free Electron Laser (HIBAF), 174, 174/ 
high-power microwave (HPM), 4, 525 
Hilbert, David, 75 
history, 121-126 

discovery of electron emission, 13 
resources and further reading, 124-126 
Hobbes, Thomas, 561 
Hoffmann, Roald, 145 
hole, 279-281 

distribution of, 280, 284-285, 285/ 
transport, 324 
Holland, Chris E„ 438 
Homer, 1, 77, 91n4, 155, 245, 516, 533 
Hor, Charles, 492 
Horace, 366 
Hornsby, Bruce, 388 
Horrendous Space Kablooie (HSK), 21 
hot launch, 532, 532/, 533/ 

Houston, Edwin J„ 13 
Hoyle, Fred, 21n9 
hubris, 91n4 
Hume, David, 28, 528 
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Hummel, Rolf E„ 15 
humor, 7 

Huxley, Thomas H., 13 
hybrid analytic-numerical model, 425-433 
hydrogen atom 
algorithms, 598-601 
Bohr model, 9-10, 92 

ladder operators (h ± ), 95-97 
orbital shape, 100-102, 100/-102/ 

Paschen's curve for, 245, 246/ 
probability distribution, 100-101, 101/ 
quantum mechanical model, 11 
as quintessential problem, 91-102 
radial equation, 97-100, 98/ 
hyperbolic dispersion relationship, 295, 298 
hyperbolic emitters, 422-423, 423/ 
notional emission area factors, 425, 426/ 
hyperbolic relations, 296-298, 296/ 
hyperbolic tangent model (tanh model), 351-352 

Icarus, 273nl4 
ideal gas, 42, 44-45, 62 
average energy, 40-41 
pressure difference, 44-45 
identity, 80-84, 81/ 
image charge 
approximation, 140-145 
example calculations, 133, 142, 143 
many-body effects and, 353-362 
for metals, 132, 132/ 
potential, 131-133, 132/ 
analytic, 363-367 
numerical evaluation of, 605 
quadratic, 325, 325/ 
semiconductor, 286-287, 287/, 314 
image files, digitizing, 632-633 
IMPACT (code), 532 

impulse approximation, 552-553, 553/, 553n25 
impurity, ionized, 512-513 
income model, 26-28, 26f, 27/ 
indistinguishable particles, 31 
inductive output amplifiers (IOAs), 526 
infinite barriers, 341-344, 342/, 344/, 348-349 
two, 344-346, 345/, 346/ 
insulators 

band gap, 278-279, 279 1 
conductivity, 277-278, 278/ 
dielectric constant, 465 
metal-insulator-metal, 303-309, 304/, 306/ 
integration, 62, 569-577 
algorithm for, 660-661 
direct, 571 
examples, 571 

Gaussian quadrature method, 573-574 
Monte Carlo, 575-577 
by parts, 574 

power law distribution, 577, 577/ 
representation, 89n20 
series method, 572 

with sharply peaked integrands, 574-575, 575/ 
trapezoidal approximation applied to, 570, 571/ 
uniform distribution, 577, 577/ 
interfaces, 303-327 

metal-insulator-metal, 303-304, 304/, 306/, 307-308, 308/ 
intermediate energy primaries, 155 
intrinsic emittance, 533, 546 
example evaluation, 563 
and space charge, 564-565, 564/ 
intrinsic semiconductors, 277, 278/, 278 1, 280-281, 284-285 
ions, 250-253, 251/-253/ 
core potentials, 272-275, 330-331, 331/ 
ion-enhanced emission and breakdown, 245-253, 246/ 
ionized center, 501-504 
ionized impurity scattering, 512-513, 515/ 


Jagger, IV 


lot-finding method, 602-603, 603/, 604/ 
:k, 25 


J„ 3 


Jefferson, Thomas, 119, 485 

Jeffries-Wentzel-Kramers-Brillouin (JWKB) method, 129 

jellium model, 353 

Jensen, Barbara L., 299 

Johnson, Kelly, 22 

Johnson, Samuel, 110 

joint probability, 32 

Jonquiere’s function Li 2 (e~ x ), 151 

Kane model, 293 
Keats, John, 127 
Keegan, John, 534 

Kemble approximation, 177, 209, 212 
comparison with TMA, 229-231, 230/, 236, 236/, 237/ 
linearized, 177-178, 178/ 
for simplest imaginable barrier, 220 
for triangular barrier, 220-221, 220/ 

Kepler, Johannes, 259n6 

Khayyam, Omar, 314, 382 

Kierkegaard, Soren, 121 

Kihn, Greg, 495 

kinetic energy, 353-354 

King, Martin Luther, Jr., 309, 501 

Kishek, Rami A., 564 
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Knudsen flow, 393-399, 395/, 396/, 398/ 

Knudsen number (Kn), 394 
Kohn, Walter, 353, 360 
kp method, 293-295 
Kramers-Kronig relations, 461 
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.4, 261/, 262/, 264/ 

Kronig, R. D„ 261 
Kubo-Green relations, 387 
Kubo, Ryogo, 49 
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Kummer function, 317 

labeling, 228/, 234/. see also nomenclature; notation 

ladder operators, 95-97 

Lambert's cosine law, 20, 396 

Lang, Norton D., 353 
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Langmuir gas, 385, 388-389 

Lao Tzu, 546 

laser(s) 

free electron (FELs), 443, 525 

High Brightness Free Electron Laser (HIBAF), 174, 174/ 
parameters, 444, 4441 


pulses, 473 

resonant tunneling diode/superlattice V(x) algorithm, 613-614 
two-barrier superlattices, 230, 231/ (see also resonant tunneling diodes 
(RTDs)) 
gas, 381-382 
algorithm for, 633-636 

quasi-chemical or Bethe-Peierls approximation of, 389-391, 389/ 

acoustic mode, 504-505 
optical mode, 504-505 
Lau, Yue-Ying, 554 
launch sites, 532, 532/ 

Lavoisier, Antoine, 92, 92n5, 145 
Lawrence, Jerome, 296 
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leap-frog approach, 547-549, 549/ 
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difference, 651 
fitting, 593-595, 5951 
Lee, Robert E., 296 
Legendre polynomials, 587 
Leibniz, Gottfried, 227 
Lenard-Jones potential, 50 
Lenard, Philipp, 123 
Levi-Civita symbol, 93 
Levine, Jules D., 478 
L'Hopital's rule, 51, 414 
light-like, 297nl0 
light sources, 443 

linear combination of atomic orbitals (LCAO), 257 
linear fit, matrix method for, 594 
line charge models, 417-420 
simple, 417-419, 418/, 419/ 
tapered, 417, 419-420, 420/, 421/, 434, 435/, 4351 
Liouville's theorem, 534-537 
Lippmann-Schwinger equation, 449 
lithography, electron beam, 4, 525 
log-normal distribution of field emitters, 640-643 
Longfellow, Henry Wadsworth, 52, 221 
Longo equation, 373-375 
Lorentz-Drude model 
calculation of 

laser penetration depth, 465, 466/ 
reflectivity, 465, 466/ 
of dielectric constant, 458-465, 466/ 

Lorentzian functions, 590 
Lorentz number, 71, 297 
for metals, 18, 18/ 

Los Alamos High Brightness Free Electron Laser (HIBAF), 174, 174/ 
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Lugovskoy-Bray relaxation time, 650-651 
Lye, Robert G„ 161 

Lye-Dekker secondary emission data, 156, 157/ 
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Mackie, William A., 429 

MAGIC (code), 532 

magnetic fields, 655-657 
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many-body effects, 353-362 

Marat, Jean-Paul, 92n5 

Marcus Aurelius, 558-560 

Marlowe, Christopher, 273 

mass, effective, 291-302, 292n4, 3001, 302nl7 

materials 

complexity of, 255-367 
negative electron affinity (NEA), 446 
positive electron affinity (PEA), 446 
scattering characteristics of, 446, 447/ 
mathematical induction, 412n5 
Matisse, Henri, 209 
MATLAB (code), 222 
for accumulation layers, 618-619 
for band bending, 618-619 
input data for, 5951 
matrix method for linear fit, 594 
matrix multiplication algorithm, 611-612 
Monty Hall algorithm, 595-596 

quadratic barrier algorithm, 612-613 
random number call, 497nl0 
for resistivity and Debye temperature, 653-655 
resonant tunneling diode/superlattice V(x ) algorithm, 613-614 
for screened (Yukawa) potentials, 614-616 
for semiconductors and doping density, 616-618 
transfer matrix evaluation, 611 
matrices, 74-75 

with conjugate variables, 84-87 
density, 86-87, 103-104 


examples, 82, 84 

matrix multiplication algorithm, 611-612 
method for linear fit, 594 
and Monty Hall problem, 80-84, 81/, 83/ 
plane wave transfer matrix, 227-233 
rotation, 81, 81/ 

S-matrix theory, 87 

transfer matrix algorithm (TMA), 207, 227-244, 610-616 
tridiagonal, 224 
and wave functions, 78-90 
Matthiessen's rule, 498 
Maxwell-Boltzmann (MB) 
distribution, 14, 25, 37-48, 37n2, 526, 527/ 
in classical phase space, 37-39, 38/, 39/ 
cylindrical coordinates, 538n9 
examples, 37, 38/, 39/, 397 
for ideal gas in potential gradient, 44--15 
most probable distribution, 39-41 
gas, 62 
law, 40 

statistics, 13-19, 56-57, 57/, 314 
McLachlan, Sarah, 310 
McMillan, P. J., 103nl2 
McNeill, William H„ 61 

mean transverse energy (MTE), 467-469, 538, 540-541 

for photoemission, 540 
for thermal emission, 540 
measurements, 29 
Melian dialogue, 73 
Melville, Herman, 77, 403, 412n5 
metal(s), 257-275 
band gap, 278-279 
bands, 279 
conductivity of, 277 
Debye temperature, 510-511, Silt 
dielectric constant, 465 
Drude model, 462 
e-e scattering, 446, 447/ 
electron quantities, 25 

field emission due to quantum mechanical tunneling, 88 
image charge barrier for, 132, 132/ 

Lorentz number for, 18, 18/ 
metal-insulator-metal current density, 303-309 
moments model for, 452-453 
parameters, 278-279 
QE for, 466-467 
reflectivity, 462-463 

resistivity, 277, 278/, 2781, 510-511, 510/, 511/, 511#, 6531 
scattering characteristics, 446, 447/ 
scattering mechanisms, 447, 447/ 

Sommerfeld model, 48 
photocathodes 

quantum efficiency vs. laser power, 447, 448/ 
scattering mechanisms, 447, 447/ 

TSM model for, 450, 450/, 451 
metal-insulator-metal current density, 303-309 
numerical model, 307-308 
thermal model, 304-306 
tunneling model, 306-307 

metal-insulator-metal interfaces, 303-304, 304/, 306/ 

Bethe approach to, 304 
transmission probability, 307-308, 308/ 
metal-semiconductor diodes, 305-306 
meters-kilograms-seconds-amperes (MKSA), 8-9, 101 
MICHELLE (code), 532, 532/, 546, 546/ 
micro-plasmas, 245 
microscopy 

field emission electron microscopy (FEEM), 416, 550-551 
field emission scanning electron microscopy (FESEM), 4 
scanning electron, 525 
tunneling electron, 525 
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cosmic microwave background radiation (CMBR), 21, 21/ 
high-power, 4, 525 

Millikan-Eyring 
data, 145-146, 146/, 1461, 147/ 
theory, 145-146, 147/ 

Millikan-Lauritsen hypothesis, 146-148 
Millikan, R. A., 147 
Milton, John, 281-282 
Miram curves/plots, 371-375, 373/-375/ 

Mitchell, Joni, 319, 512 

MKSA (meters-kilograms-seconds-amperes), 8-9, lot 
modeling 

field emitter I(V) data, 638-640 
log-normal distribution of field emitters, 640-643 
modified trapezoidal rule, 570, 571/ 
molecular orbitals, 102 
Moliere, 165, 222 
molybdenum resistivity, 653t 
moments-based approach, 63-64 
for metals, 452-453 
for photoemitters, 451-457 
for semiconductors, 453^157 
momentum, 537 
Fermi, 53-54 
relaxation time, 501 
Monch, Winfried, 341 
monopole models, 409 
Monte Carlo methods 
algorithm for, 660-661 

for diamond amplifier response time, 516-524, 516/-518/, 520/, 523/ 
for integration, 575-577 
main characteristics, 494, 494/ 
for secondary emission cathodes, 494-A99 
simulations, 443, 444/, 513, 516-524, 516/-518/, 520/, 523/ 
Montesquieu, 181 
Monty Hall 
algorithm, 595-596 
problem, 80-84, 81/, 81nl6, 82, 83/ 

Morris, Richard, 78 
Moseley, Henry, 60nl4 
most probable distribution, 39-41 
motion, constants of, 92-95 
Mott-Gurney law, 124 
Mott's screening constant, 274nl5 
MTE. see mean transverse energy 
muffin-tin potentials, 357 
multilayer cathodes, 336-337, 337/ 
multiple-segment barriers, 230/ 

Murphy and Good (MG) equation, 141-142 

Nash equilibrium, 73 

Naval Research Laboratory, 13nl 

Nazis, 87 

nearly free electrons, 279 

negative differential resistance, 127, 240-241, 241/ 
negative electron affinity (NEA) 
cathodes 

photocathodes, 446-447, 447/ 
quantum efficiency vs. laser power, 447, 448/ 
scattering mechanisms, 446-447, 447/ 
secondary emission cathodes, 519-520 
material, 443 

photoemission from, 474, 474/ 

Neptune (code), 532 

N(n, s, u) function, 181-189 
infield-like conditions, 182, 182/, 183, 187, 188/ 
s > 0 regime, 183-188 
s<0 regime, 188-189 

in thermal-like conditions, 182/, 183, 187, 188/ 
nickel, resistivity, 653f 
Niebuhr, Reinhold, 69 


Nietzsche, Friedrich, 141 
nitrogen, 282 
doping, 323/ 
layers, 319 

noble gases, 245, 246/ 
nomenclature, see also notation 
for coding, 234 

core terms, 357-360, 358f, 359/ 
correlation terms, 356-357, 356 1, 357/ 

labeling associated with two-region potential barriers with linear potential 
segments, 228/, 234/ 
non-circular pores, 398, 400n23 
non-degenerate 
gas, 55 

semiconductors, 284-285 

non-polar optical phonon (NOP) relaxation time, 512 
Nordheim functions, 144/, 605 
Nordheim, Lothar W., 139 
normal distribution, 37n2 

normalized rms emittance, 534-537, 534/-536/, 538 
notation, 247n3, 292n4, 302nl6, 314nl5, 417nl5, 482n32, 490n3. see also 
nomenclature 
angle, 407 

electric field, 457nll 
emission area, 406^08, 425, 426/ 
emittance, 558-561, 560/ 
for permeability, 457nl2 
for permittivity, 457nl2 
H-type semiconductors, 282 
number current density, 63n6, 104 
numerical accuracy, 7-8 
numerical analysis 

boundary element methods, 429-431, 429/, 432, 432/, 432t 
hybrid analytical-numerical model, 425-433 
numerical solution of ODEs, 582-584 

Obama, Barack, 274 

Ockham's razor, 417 

Office of Strategic Services (OSS), 87 

Ohm's law, 15 

one-dimensional emittance equations, 539-541 
on Neumann, John, 73 
ontological economy, 417 

Oppenheimer, J. Robert, 73, 102, 103nl2, 269n9 

optical mode, 504-505 

optical phonons, 512-513, 515/ 

optoelectronics, 525 

Oracle of Delphi, 80 

orbitals 

atomic, 264-266 

of hydrogen atom, 100-102, 100/-102/ 
linear combination of atomic orbitals (LCAO), 257 
molecular, 102 
shapes, 100 

electron, 526-528, 527/, 528/ 
in magnetic field, 655-657 
order, 31 

ordinary differential equations (ODEs) 
numerical solution of, 582-584 
simple solvers, 619-622 
orthogonal coordinates, 577-579 
orthogonal states, 29 
oscillators, harmonic, 91, 110-118 
quantum phase space approach to, 113-118, 118/ 
trajectory of, 657-658 
Ovid, 131, 173 
Owen, Wilfred, 52, 52n8 
oxygen, Paschen's curve for, 245, 246/ 

Panitz, John A., 438 
parabolic barriers, 227 
Pareto distribution, 26 
PARMELA (code), 532 
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particle accelerators, 443, 525 
partition function (Z), 40 
Paschen characteristics, 245 
Paschen's curve, 245-247, 246/ 
modified, 247-250, 248/, 250/, 253 
Paschen's law, 245 
Pauli Exclusion Principle, 50 
Pauling electronegativities, 268, 268/ 

Pauling units, 267, 267n8 
Paulze, Marie-Anne, 92n5 
penetration factors, 457—458 
Penney, W. G., 261 
permeability, 457nl2 
permittivity, 457nl2 
permutation algorithm, 591-592 
phalanx system, 61, 534 
phase space, classical, 37-39, 38/, 39/ 

Philipp, Herbert R„ 458 
Phillips Laboratory, 371 
phonon(s), 70, 504, 508 
acoustic mode, 504-505 
acoustic scattering, 508-512 
Bose-Einstein distribution governing, 512 
electron-phonon (e-p) scattering, 446-447, 447/ 
non-polar optical phonon (NOP) relaxation time, 512 
optical mode, 504-505 
optical scattering, 512-513, 515/ 
scattering, 515 

phonon-assisted tunneling, 329-330, 330/, 334 
photocathodes, 150/. see also photoemitters 
emittance, 538-539 
metal, 447, 447/, 448/ 

negative electron affinity (NEA), 446-447, 447/ 
quantum efficiency vs. laser power, 447, 448/ 
scattering mechanisms, 446-447, 447/ 
photoemission, 5, 121, 123 
brute force evaluation of, 189-193, 190/-193/ 
challenge of, 19-23 
computationally kind model, 193-198 
data, 152/, 1521, 153 
delayed, 473 
emittance, 538-539 

general thermal-field-photoemission equation, 173-206, 209 

MTE values, 540-541 

n = 1 regime, 194-195, 195/ 

from NEA bulk gallium arsenide, 474, 474/ 

resources and further reading, 125 

three-step model, 155n2 

photo emittance, single hemisphere, 556-557, 557 1, 558/ 
photoemitters, 443-486 
basic theory, 448-449 
emittance, 443 
examples, 44 4 - 44 5 
laser parameters, 444, 4441 
lifetime, 443 

low work function coatings, 478-485, 482/ 
moments model, 451^57 
Monte Carlo simulations, 443, 444/ 
penetration factors, 457-458 
performance metrics, 443 
quantum efficiency (QE), 443, 445-446, 445/ 
reflectivity, 457-458 
response time, 443 
ruggedness or survivability, 443 
scattering contributions, 466—474 
scattering effects, 446—448, 447/ 
shell and sphere models, 474—476, 475/ 
three-step model, 449—451 
photon, 291 

energy, 22-23, 22/ 

physical roughness, 550-551, 550/, 551/ 
physics 

application, 369-565 
electron beam, 525-565, 532/ 

Pierce gun, 526 


Pierce-type diodes, 532, 532/ 

Piketty, Thomas, 128 

Pines, David, 356 

Pinker, Steven, 8, 39, 385 

Planck's constant, 8, 91 

Planck's formula, 20 

plane wave transfer matrix, 227-233 

plasma frequency, 506 

platinum: resistivity, 6531 

Plato, 3, 64, 80, 236, 329 

poetry, 52 

point charge model, 408—412, 4111 
dipole, 409, 411, 4111, 412/, 419-420 
evaluation of functions, 636-637 
monopole, 409, 419-420 
placement of point charges, 409—410, 410/ 

Schottky conjecture for, 415 
shielding for, 435-436, 436/, 4371 
tapered dipole, 420 
point source 

model, 376-381, 380/, 381/ 
solution, 377-378 
Poisson distributions, 648-650 
generation, 648-649 

test of acceptance-rejection method, 649-650 
Poisson process, 495 
Poisson's equation, 288, 322, 421 
numerical solution of, in diamond, 319-320, 3211 
Poisson statistics, 495-496, 496/ 
polarization, 459, 459/, 459nl3 
polylogarithms, 151 
Polyneices, 163 
polynomial functions, 117 
approximations, 191-192, 192/ 

Hermite polynomials, 113 
Legendre polynomials, 587 
Poole-Frenkel transport, 329-333, 330/-333/ 

P circular, 398 
integral limits, 3981 

Knudsen flow through, 393-399, 396/, 398/, 3981 

sintered wire CPD cathodes, 399-401, 400/ 
positive electron affinity (PEA) material, 446^47, 447/ 
potential energy, 10 

distribution, 577, 577/ 
formula, 161 

pre-quantum models, 13-23 
pressure, 43 
difference, 44-45 
Price's treatment, 243, 243nl0 
primary beams, see also electron beams 
back-scattered, 155 
intermediate energy, 155 
principle of equipartition of energy, 40—41 
prisoner's dilemma (PD), 73-78 
probability 
and entropy, 30-33 
current, 105 
examples, 32-33 
joint, 32 

most probable distribution, 39-41 
and orbital shape, 100-102, 100/ 
products of, 33-36, 36/ 
in quantum mechanics, 91 
and states, 28-30 
transmission, 104-106 
Proclus, 567 

prolate spheroidal approach, 420—425 
coordinates, 421, 422/, 578 
Prometheus, 286 
Prutton, Martin, 375 
Ptolemy, 567 

p -type semiconductors, 282 
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pulse transit time, 169 

Pythagorean theorem, 333, 333n5, 334f 

QE. see quantum efficiency 

quadratic image charge barriers V(x), 324-326, 326f 
algorithm for, 612-613 
approximation of, 190/ 

planar vs. Airy TMA representation of, 236, 236/, 237/ 
roots of, 326, 326 1, 626-627 
quadratic image charge potential V(x) - E, 325, 325/ 
quantum distributions, 49-60, 113 
quantum effects, 115 
quantum efficiency (QE), 21-22, 443, 450 
of cesiated copper, 471-472, 472/ 
of cesiated surfaces, 485-486 
and emittance, 467-469, 470/ 
equation for, 451, 466-467 
example calculation, 443 
as function of coverage, 485 
as function of photon energy, 453-454, 454/ 
for metals, 466-467 
numerical evaluation of, 486, 486/ 
vs. photon energy, 22-23, 22/ 
scattering coefficient to, 476-478, 477/, 478/ 

dielectric constant model, 463-165, 466/ 
hydrogen atom model, 11 
methods, 73-90 

passage from classical mechanics to, 61-62 
probability, 91 
simple model, 73-77 
quantum particles, 49 

quantum phase space approach, 113-118, 118/ 

quantum potential, 130 

quantum predictions, 347-348, 348/ 

quantum solids, 58-59 

quantum statistics, 56-57, 57/, 62 

quartz glass, 403 

quasi-dassical Boltzmann model, 461—163 
quid pro quo, 77-78 
quincunx shapes, 564-565 
Quine, Willard, 222 
quintessential problems, 91-118 
harmonic oscillator, 91, 110-118 
hydrogen atom, 91-102 
transport past barriers, 91, 102-110 

Rabelais, Francois, 257 

radar (RAdio Detecting And Ranging), 3, 122 

radial coordinates, 577-578 

radial equation, 97-100, 98/ 

radial waves, 359 

radiofrequency photoinjectors, 443 


Raimi, Ivan, 341 
Raimi, Sam, 341 
RAND corporation, 73 
randomness, 31 

random number generation, 497, 498/ 
Rao, Trivini, 487 
Rayleigh-Jeans formula, 19-20 
realistic surfaces, 563-565 
rectangular barriers, 209-213, 212/, 213/ 
assumptions and conventions, 210 
Gamow factor for, 211-212 
over-the-barrier transport, 210-211 
points required for, 229/ 
step function barrier, 209 
transmission probability for, 227, 229/ 
under-the-barrier transport, 211 
rectifiers, 305-306 
Reed, David A., 381 
reflectivity 

dependent factors, 485 
field emitter, 457-458 


Lorentz-Drude calculation, 465, 466/ 
metal, 462-463 
photoemitter, 457-458 
relaxation time required for, 462-463 
Reif, Frederick, 376 
Reiser, Martin, 533 
rejuvenation time, 378 
relativity, 299 
relaxation time 
acoustic, 510-512, 515/ 
definition of, 68, 485, 499-515, 503/ 
electron-electron, 650-651 
energy, 501 

example estimations, 462-463 
to infinity, 502-503, 503/ 
ionized center, 501-503 
Lugovskoy-Bray, 650-651 
momentum, 501 

non-polar optical phonon (NOP), 512 
scattering as, 501 

secondary emission cathode, 499-515 
representations, 88-90 
resistance, 460 

negative differential, 127, 240-241, 241/ 
resistivity, 16, 277-279, 278/, 278 1 
algorithm for, 651-655 
data sets, 652 1, 653 1 
metal, 510-511, 510/, 511/, 511r, 653( 

in field emission, 336-339, 337/, 338/, 339/ 
tunneling, 337, 337/ 

resonant transmission, 240-244, 241/-243/, 337, 337/ 
resonant tunneling diodes (RTDs), 127, 213, 228 
barrier profile, 240, 240/ 
barriers and wells, 230 
under bias, 232, 233, 240, 240/ 
current density vs. applied bias for, 240-241, 241/ 
at zero bias, 230-231, 231/, 240 
resonant tunneling diode/superlattice V(x) algorithm, 613-614 
resources and further reading, 124-126 
Rhoderick, E. H„ 304 
ribbons of charge, 429 
Richards, Keith, 25 

Richardson-Laue-Dushman (RLD) equation, 46, 48, 135-137, 148, 173, 174/ 
approximations, 135 
current density changes, 366 
Riemann zeta function, 7-8, 50-52, 414, 509, 585-586 
series associated with, 569 
Riezler, W., 324 

RLD. see Richardson-Laue-Dushman equation 
Rodrigues' formula, 113, 586 
Rolling Stones, 25 

root-finding methods, 601-604, 603/, 604/ 
root mean squared (rms) 
emittance, normalized, 534-537, 534/-536/, 538 
value, 87 
velocity, 17 

Rostand, Edmond, 91n4 
rotation matrices, 81 
roughness, 550-551 
chemical, 550-551, 550/, 551/ 
interface, 309 

physical, 550-551, 550/, 551/ 
surface, 541 

Rousseau, Jean Jacques, 257 

Royal Road, 567, 567n2 

RTDs. see resonant tunneling diodes (RTDs) 

Runge-Kutta (RK) approach, 547 
Russell, Bertrand, 25, 45, 135 

Sackur-Tetrode equation, 46-47 
Sagan, Carl, 539nl0 
Sandel, Michael J., 537 
Santayana, George, 124 
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saturation velocity, 518, 518n26 
Saturn, 426-428, 426n23, 427/ 
scanning electron microscopy, 525 
scattered electrons, 471-474, 470 f, 472/-475/ 
approximation of, 485 
scattering, 61, 446^148, 466-477, 498^99 
acoustic phonon, 508-512 

coefficient to quantum efficiency, 476-178, 477/, 478/ 
drift-diffusion, 447, 447/ 
elastic, 501 

electron-electron (e-e), 446^47, 447/, 506-508, 508/ 
electron-phonon (e-p), 446-447, 447/ 
ionized impurity, 512-513, 515/ 

Monte Carlo simulations, 516-517, 517/ 
optical phonon, 512-513, 515/ 
phonon, 515 
photoemitter, 466-478 
as relaxation time, 501 
self-scattering, 498-499 
scattering angles, 513-514, 513/, 514/ 

Schmerge, John F., 467 
Schottky-barrier diodes, 305-306 
Schottky barrier-lowering factor, 132-133 
Schottky conjecture, 412-415, 413/-415/, 415( 

Schottky deviations, 238-239, 238 f-239f, 240/ 

Schottky emission equation, 135 
Schottky emitters, 125 
Schottky factor, 12 

Schottky-Nordheim barrier functions, 141, 141n6 
Schrodinger's equation, 11, 79, 88-89, 110-111, 129, 257, 464, 500 
full equation, 265 
time-dependent, 448, 464 
Schrodinger's wave mechanics, 78 
Schwoebel, Paul R„ 438 

screened or Yukawa potentials, 274, 275/, 501, 506 
algorithm for, 614-616 
screening/shielding, 433-437, 434/-437/ 

Debye's constant, 274, 274nl5 
dielectric constant and, 287-289 
ion potentials and, 272-275 
Mott's constant, 274nl5 
secondary emission, 5, 121, 124, 155 
back-scattered primaries, 155 
Baroody function, 601-602 
cathodes, 487-524 
data, 160-161, 160/, 161/ 

Baroody data, 156, 156/ 

Lye-Dekker data, 156, 157/ 
from diamond crystals, 487, 488/ 
and emittance, 542-545 
energy contours, 544-545, 544/ 
energy distribution curves (EDCs), 543 
intensity of, 543, 543/, 544/ 
intermediate energy primaries, 155 
low-energy, 155 

Monte Carlo simulations, 493, 493/, 516-518, 517/, 518/ 
penetration range, 492^94 
resources and further reading, 125-126 
true secondaries, 155-156 
secondary emitters, diamond, 156/ 
second-order differentiation, 580-582, 583-584 
simple solvers, 620-622 
Seeger, Pete, 168 

self-scattering, 498-199 
self-similarity, 409 
semi-analytic model, 431-133 
semiconductors, 277-289, 445 
algorithm for, 616-618 
alpha semiconductor model, 299-301, 299/ 
band bending, 310-311, 311/ 
band gap, 278-279, 299-301, 299/, 300t, 301/ 
bands, 279 

current density, 317, 317/ 
degenerate, 284-285 


depletion layers, 319 

dielectric constant, 460 

doping, 277, 281-285, 284/, 285/ 

doping density, 616-618 

e-e scattering in, 446, 447/ 

effective electron mass, 300, 300( 

extrinsic, 282-285 

field emission from, 622-626 

heavily doped, 317, 317/ 

image charge potential, 286-287, 287/, 314 

intrinsic, 277, 278/, 278 1, 280-281, 284-285 

metal-semiconductor diodes, 305-306 

moments model, 453-457 

Monte Carlo simulation, 494, 494/ 

non-degenerate, 284-285 

H-type, 282, 329 

parameters, 278-279, 279 1 

photon energy, 455, 456/ 

p-type, 282, 329 

quantum efficiency, 453-454, 454/, 455-456, 456/ 
resistivity, 277-279, 278/, 278f 
saturation velocity, 518 

scattering characteristics, 446, 447/, 454-455, 455/ 
scattering mechanisms, 447, 447/ 
transmission probability, 454-455 
TSM model for, 450, 450/, 451 
semi-metals: bands, 279 
series expansion, 571-572 
series summation, 569-571 
Shaffer, Peter, 213 

Shakespeare, Willliam, xiii, 16, 50, 57, 65, 68, 77, 119, 128, 175, 229, 236, 
245, 250, 257, 279, 291, 337n9, 344, 353, 399, 419, 448, 461, 498, 
525, 539, 545 
Sham, Lu Jeu, 360 
Shannon entropy, 32 
shape factor approach, 131 
shape factors, 131 

shell and sphere models, 474-476, 475/ 
simple shell and sphere algorithm, 643-645 
Shelley, Percy Bysshe, 30 
Shelton, Haywood, 135 
Shepard, Sam, 41 
shielding, see screening/shielding 
Shih, Arnold, 492 
shot noise, 495 
shuffling 

diffusion model, 383-384, 384/ 
lattice gas algorithm, 633-636 
silicon, 282 

dielectric constant for, 460 
effective mass, 302nl7 
field emitters, high aspect ratio, 439, 439/ 
parameters, 278-279, 279f 
silver (Ag): resistivity, 6521 
silver sulfide, 277nl 
Simmons, John G., 303 
Simon, Carly, 474 
Simon, Paul, 18, 303 
simple barriers, 209-226 
simple diffusion coefficient D model, 382-383 
simple line charge models, 417-419, 418/ 
simple ODE solvers, 619-622 
simple quantum mechanics model, 73-77 
simple shell and sphere algorithm, 643-645 
Simpson's rule, 570, 571/ 
simulation code, 199-204 

sintered wire controlled porosity dispenser cathodes, 399-401 
sintering, 375 

sinusoidal potential, 269-272, 503-506, 504/, 505/ 

Skinner, Burrhus Frederic, 160 
small barriers, 315-316, 315/ 

S-matrix theory, 87 
Smith, David Livingston, 287 
Snicket, Lemony, 315-316 
sodium (Na), 361 
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sodium exchange-correlation potential, 631 

Sokal, Alan D„ 324 

Sokal's hoax, 324 

Sommer, Alfred H„ 443 

Sommerfeld model, 19, 48, 62 

Sophocles, 163 

sound velocity, 506, 5061 

source-limited current, 372 

space charge 

and electron beams, 529-530, 530/ 
and emittance, 541-542 
field, 542 

free transit time, 169, 169/ 
intrinsic emittance and, 564-565, 564/ 
limited current, 170, 372 
limited emission, 121, 124, 168 
space-charge region, 164 

space-charge-dominated electron beams, 564-565, 564/ 
space-like, 297nl0 
speed of light c, 8, 91 

atomic sphere radius r s , 629-630 
in d dimensions, 259-260, 260/ 
shell and sphere models, 474-476, 475/ 
simple shell and sphere algorithm, 643-645 
spheroids, see prolate spheroidal approach 
spin coupling, 293 
Spindt, Charles (Capp) A., 438 
Spindt approximation, 144-145 
Spindt-type field emitters, 409 
arrays, 139 

data from, 416-417, 416/ 
gated, 140/ 

Spinoza, Benedictus de, 550 
spread, 87 

St. Elmo's fire (corposants), 403, 404-405, 404/ 
stable strategies, 77-78 

black dwarf, 58nl3 
white dwarf, 57-60 

allowed final states, 514-515 
basis, 29 

density of, 257-259, 259/ 
examples, 64 

for one dimension, 64, 258-259, 259/ 
for three dimensions, 258-259, 259/ 

TMA vs. Kemble approximation, 229-231, 230/ 
for two dimensions, 64, 258-259, 259/ 
for various p, 64, 65/ 
probability and, 28-30 
statistical variation, 438-442, 439/^441/ 
statistics, 25-36. see also specific statistics by name 
classical, 56-57, 57/, 62 
quantum, 56-57, 57/, 62 
StatTip program, 641-642 

STAY (do not switch curtains when offered) algorithm, 595 

distribution, 114 
problems, 130 
step cliffs, 106 
step functions, 209 

electron density near, 109, 110/ 
transmission probability for, 105, 106/ 

Sterling's approximation, 388 
Stewart, Rod, 569 
Stirling's approximation, 496 
Stoppard, Tom, 183 
stupidity energy, 356 
summation, 569-584 
Sun, 58-60 
Sun Tzu, 37 
superlattices 

resonant tunneling diode/superlattice V(x) algorithm, 613-614 


two-barrier, 231/, 232 (see also resonant tunneling diodes) 
supply function/(B), 127, 128 
surface(s) 
cesiated, 485^186 
density, 376 

diffusion, 375-391, 376/, 379/-381/ 
analytic model, 376-377 
lattice gas models, 381 
numerical model, 377 
point source model, 376-381, 380/, 381/ 
shuffling algorithm for, 383-384, 384/ 
evaporation, 391-393, 391/-393/ 
flat, 533-545 
roughness, 541 
realistic, 563-565 
undulating, 563-564 
uniform, 533-545 
Sutton, Adrian P., 264-265 
Swinburne, Algernon Charles, 449, 466 
SWITCH (switch curtains when offered) algorithm, 595 

Talking Heads, 591 

tanh model (hyperbolic tangent model), 351-352 
tapered dipole model, 420 
tapered line charge models, 417, 419-420, 420/ 
example, 420, 421/ 
shielding for, 434, 435/, 4351 
Taylor cones, 409 
Taylor correction, 603 
Taylor expansion, 572 
Taylor series, 569 
Teller, Edward, 87, 103nl2 
temperature 
band gap and, 281, 282/ 
chemical potential and, 55, 56/, 283-285, 284/ 

Debye temperature, 70 
algorithm for, 651-655 
for diamond, 511 
metal, 510-511, 5111 
doping and, 285, 285/ 
effects of, 349-350, 403 
high temperature/low density, 273-274 
low temperature/high density, 273 
work function and, 363, 364/, 3641, 366-367 
zero, 53 

theoretical thermal-field energy distributions, 176-181, 178/-180/ 
thermal conductivity, 16-18, 17/, 68-69 
metal-insulator-metal current, 304-306 
thermal emission, 5, 121-122, 163 
brute force evaluation of, 189-193, 190/, 191/ 
computationally kind model, 193-198 
data analysis, 136, 136f, 137/ 
emittance, 537-538, 540 

experimental thermal-field energy distributions, 175-176, 176/ 
general thermal-field 
current density, 195-198, 197/, 198/ 
emission code, 198-206 

general thermal-field-photoemission equation, 173-206, 209 
MTE values, 540-541 
« = 1 regime, 194-195, 195/ 

N(n,s,u) function, 182/, 183, 187, 188/ 
resources and further reading, 125 
simulation code, 199-204 

theoretical thermal-field energy distributions, 176-181, 178/-180/ 
thermal emittance, single hemisphere, 555, 557, 5571, 558/ 
thermal field, 177 
algorithm, 604-606 
thermalization, 507 

thermally activated field assisted self-diffusion, 416 
thermal tails, 443 

thermal wavelength (i. T ), 42-43, 46 
thermionic cathodes, 4, 371-372, 375-376 
thermionic currents, 147-148 
thermionic data, 47/ 
thermionic emission, 122 
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thermionic emitters, 14 

Thomas-Fermi approximation, 507 
Thomas-Fermi wavenumber, 506 
Thomas, Richard E„ 356, 371 
Thomson, J. J., 13 
Thoreau, Henry David, 406, 433 
three-step model (TSM), 449^51, 450/ 

Thucydides, 61, 73 
tilt correction, 563 
time-like, 297nl0 
tip current models 
assessment of, 415-417 
statistical variation, 438-^139, 440/ 
tit-for-tat strategy, 75-78 
Tizard, Henry, 3 

TMA. see transfer matrix algorithm 
Tolkien, John R. R„ 63, 425 

tracer diffusion coefficient D *, 377, 382n9, 386-387 
trace space, 536 


for emission from a hemisphere, 658-660 
for harmonic oscillators, 657-658 
trajectory equations, 547 
trajectory evaluation, 546-550 
average value method, 547-549, 549/ 
explicit methods, 549 

implicit methods, 549-550 
leap-frog approach, 546-549, 549/ 
numerical, 553, 553/ 

Runge-Kutta (RK) approach, 547 

transfer matrix algorithm (TMA) evaluation, 207, 227-244, 231/, 611 
Airy function, 233-244, 23S/-237/, 253 
algorithm, 610-616 

features that require attention, 610-611 
matrix multiplication algorithm, 611-612 
plane wave, 227-233, 230/, 236/, 237/ 
quadratic barrier V(x) algorithm, 612-613 
for quadratic potential barrier, 236, 236/, 237/ 
resonant tunneling diode/superlattice V(x) algorithm, 613-614 
for single rectangular barrier, 229-231, 230/, 231-233 
transfer matrix evaluation, 611 

approximation of, 168-171 
averaged charge density, 170-171, 171/ 
ballistic, 169, 169/, 170-171 
definition of, 168, 169, 375-376 
pulse, 169 

space-charge-free, 169, 169/ 
transmission 

exact tunneling and transmission evaluation, 207-253 
between material layers, 324 
over-a-cliff transport, 106 
over-the-barrier transport, 210-211, 334 
past barriers, 91, 102-110 
Poole-Frenkel transport, 329-333, 330/-333/ 
resonant, 240-244, 241/-243/ 
effects in, 337, 337/ 
under-the-barrier transport, 211 

transmission probability, 104-106, 105nl5, 212-213, 212/, 213/ 
in Fowler-Nordheim (FN) model, 539 
MIM diode, 307-308, 308/ 
over-the-barrier D(k > ko), 217-218 
pre-factors, 317-318, 317/, 319/ 
for semiconductors, 454-455 
step function, 105, 106/ 

TMA approach, 231/, 233 
under-the-barrier D(k < ko), 218-219, 218/ 
trapezoidal approximation, 570, 571/ 
traveling-wave tubes (TWTs), 163, 371, 531 
triangular barriers 
Airy functions for, 219-221 
algorithm for, 607-608 
approximation, 140-141 


Fowler-Nordheim equation for, 140-141, 221-222 
Gamow factor for, 215, 607-608 
general method for, 213-222, 217/ 

Kemble approximation, 220-221, 220/ 

large argument evaluation of, 222-225, 225/, 225f 

numerical method for, 222-226, 225/, 225(, 226/, 226f 

over-the-barrier probability, 217-218 

small argument evaluation of, 225-226, 226/, 226 1 

step function barrier, 209 

transmission probability, 220-221, 220/, 221/, 227 
under-the-barrier probability, 218-219, 218/ 
triangular wells, 346-348, 348/ 
tridiagonal matrix equations, 224 
trigonometric functions, 585 
Trinity test, 103nl2 
Troian, J. Ken, 403 
true secondaries, 155-156 
trustworthiness, 29 
truth, 127, 324 
truthiness, 127n3 
Tsu-Esaki formula, 127, 232 


cathode ray tubes (CRTs), 4, 122 
in dispenser cathodes, 399, 399/ 


tungsten 
coated, 479, 480/ 
electronegativity, 268-269, 268/ 
resistivity, 653( 
temperature change, 366n5 
wire thickness, 399n22 
work function, 479, 480/ 
tunneling, 91, 103, 106-109 
band-to-band, 164 
conduction, 333-336, 335/, 336/ 
direct, 329-330, 330/ 

exact tunneling and transmission evaluation, 207-253 

metal-insulator-metal current, 306-307 

onset of, 332-333, 332/ 

phonon-assisted, 329-330, 330/, 334 

pre-factor, 317 

resonant, 336-339, 337/ 

tunneling diode/superlattice V(x) algorithm, 613-614 
wave packet problem, 107-108, 108/ 

Zener breakdown, 329 
tunneling electron microscopy, 525 
Twain, Mark, 198, 497, 542 

two-barrier superlattices, 232, 232/. see also resonant tunneling diodes (RTDs) 
two infinite barriers, 344-346, 345/, 346/ 
two-region potential barriers 
labeling associated with, 228/, 234/ 
with linear potential segments, 234/ 


TWTs. see traveling-wave tubes 


ultraviolet catastrophe, 20 
ultraviolet light, 460 
uncrtainty, 87-88 
undulating surfaces, 563-564 
uniform distribution, 577, 577/ 
uniform surfaces, 533-545 
unit(s), 7-12 
conversion of, 9, 10( 
fundamental constants, 8-9, 9 1 
unit charge q, 8, 9 1 
universal constants, 145 
universal reduced yield equation, 161 
University of Maryland's Electron Ring (UMER), 564 
upwind differencing scheme (UDS), 377 
US Census Bureau Data (2012), 26, 26(, 27/ 

vacuum electronics, 4, 4/ 
vacuum fields, 312 

Van de Graaft generator (MIT), 426n22 
van der Zeil, Aldert, 306 
variables, conjugate, 84-87 
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variations, 35 

statistical, 438^42, 439/-441/ 

Varshni's equation, 281 
Veblen, Thorstein, 311 
vectors, 74-75 
velocity, 39 

average positive, 15, 17 
root mean squared (rms), 17 
Virgil, 186nl 1 
Voltaire, 7, 293 
von Clausewitz, Carl, 431 
Vonnegut, Kurt, 408 
Vorpal (code), 532 

Walton, D. N„ 215n6 
Wang, Erdong, 487 
Wang, Yiman, 373n2 
WARP (code), 532 
waterfall plots, 107-108, 108/ 
wave functions, 214 
algorithm, 596-598 

for harmonic oscillator problem, 110-113 
incident on a barrier, 104-105, 105/ 

JWKB (Jeffries-Wentzel-Kramers-Brillouin) method, 129 
matrices and, 78-90 
over-to-under, 218, 218/ 
plane wave transfer matrix, 227-233 
under-to-over, 218, 218/, 219 
WKB (Wentzel-Kramers-Brillouin) method, 129 
wave packets, 107-108, 108/ 
weak fields, 135n2 
Wehnelt, Arthur, 122 
wells 
RTD, 230 

triangular, 346-348, 348/ 

Wentzel-Kramers-Brillouin (WKB) method, 129 

Whiddington's law, 157 

white dwarf stars, 57-60 

Whitehead, Alfred North, 104 

Wien's formula, 20 

Wigner, E„ 113 

Wigner distribution f(x, k, t), 114 
Wigner function, 113-117, 118/, 292 
Wigner-Seitz sphere, 357 
Wigner trajectory, 115 


Wilde, Oscar, 227 
Wilson, Brian, 508 
wire, tungsten, 399n22 
Wittgenstein, Ludwig, 164 

WKB method, see Wentzel-Kramers-Brillouin method 
Wooten, Frederick, 458 
Wordsworth, William, 471 
work function, 21, 373n3 
atomic number and, 480, 480/ 
charge redistribution and, 478, 479/ 
coverage and, 479, 480/, 483-484, 484/ 
effective, 365 

electronegativity and, 480, 480/, 481/ 
field and, 363-366, 365/-366/ 
field-dependent, 366-367, 367/, 632 
Gyftopoulos-Levine algorithm, 645-647 
low, 478-485, 482/, 484/, 485 
Miram curves and, 374, 374/ 
temperature and, 363, 364/, 3641, 366-367 
temperature-dependent, 366-367 
typical values, 478, 478/, 480, 480/ 
variations with crystal face, 478, 479/ 

World War I, 60, 60nl4, 78 
World War n, 87 
Wright, Robert, xiii 
Wright, Steve, 495 

Xerxes, 106 
X-ray, 525 

Yater, Joan, 492 
The Yi King, 11 
Young's approximation, 491 
Yukawa (screened) potentials, 274, 275/, 501, 506 
algorithm for, 614-616 

Zener breakdown, 329 

Zeno's paradox, 51-52 

zero. Airy function, 346, 3471, 627-629 

zero disorder or zero entropy, 31 

zero-sum games, 73 

zero temperature, 53 

Ziman, John M., 272, 291 

zoomas routine, 642 
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